Is there any gpu memory optimization compare TensorRT 8.6.1.6 to 8.4.0.6 ? #3743

Jsy0220 · 2024-03-27T07:02:53Z

Hi, I tested the same model on the same machine with TensorRT 8.6.1.6 and 8.4.0.6 as following steps

use bin/trtexec to trans onnx model to engine file with 8.6 and 8.4 respectively by the same command bin/trtexec --onnx=xxxx.onnx --saveEngine=xxx.engine(this model has fixed input shapes)
load this engine in the same c++ code and run.
check gpu memory usage by nvidia-smi

The following are the results
8.6

+-----------------------------------------------------------------------------+
| NVIDIA-SMI 510.47.03    Driver Version: 510.47.03    CUDA Version: 11.6     |
|-------------------------------+----------------------+----------------------+
| GPU  Name        Persistence-M| Bus-Id        Disp.A | Volatile Uncorr. ECC |
| Fan  Temp  Perf  Pwr:Usage/Cap|         Memory-Usage | GPU-Util  Compute M. |
|                               |                      |               MIG M. |
|===============================+======================+======================|
|   0  Tesla T4            Off  | 00000000:3B:00.0 Off |                    0 |
| N/A   66C    P0    63W /  70W |    283MiB / 15360MiB |     61%      Default |
|                               |                      |                  N/A |
+-------------------------------+----------------------+----------------------+

8.4

+-----------------------------------------------------------------------------+
| NVIDIA-SMI 510.47.03    Driver Version: 510.47.03    CUDA Version: 11.6     |
|-------------------------------+----------------------+----------------------+
| GPU  Name        Persistence-M| Bus-Id        Disp.A | Volatile Uncorr. ECC |
| Fan  Temp  Perf  Pwr:Usage/Cap|         Memory-Usage | GPU-Util  Compute M. |
|                               |                      |               MIG M. |
|===============================+======================+======================|
|   0  Tesla T4            Off  | 00000000:3B:00.0 Off |                    0 |
| N/A   69C    P0    73W /  70W |    853MiB / 15360MiB |     59%      Default |
|                               |                      |                  N/A |
+-------------------------------+----------------------+----------------------+

It can be seen that the gpu memory consumption is significantly reduced from the above.

So

is there any gpu memory optimization in TensorRT 8.6.1.6 compare to 8.4.0.6 ?
If not, is there anything special parameter should be set when using bin/trtexec in 8.4 but default in 8.6 ?

Thanks !!!

The text was updated successfully, but these errors were encountered:

zerollzeng · 2024-03-28T02:56:13Z

we do have keep optimize the build time memory consumption over versions.

zerollzeng · 2024-04-06T13:44:24Z

close this, feel free to reopen if you have any further questions.

Jsy0220 · 2024-05-28T04:25:54Z

Hi, I have a further question about difference between createExecutionContext and createExecutionContextWithoutDeviceMemory, Is there have any gpu memory difference when using the corresponding context ?

zerollzeng self-assigned this Mar 28, 2024

zerollzeng added the triaged Issue has been triaged by maintainers label Mar 28, 2024

zerollzeng closed this as completed Apr 6, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Is there any gpu memory optimization compare TensorRT 8.6.1.6 to 8.4.0.6 ? #3743

Is there any gpu memory optimization compare TensorRT 8.6.1.6 to 8.4.0.6 ? #3743

Jsy0220 commented Mar 27, 2024

zerollzeng commented Mar 28, 2024

zerollzeng commented Apr 6, 2024

Jsy0220 commented May 28, 2024

Is there any gpu memory optimization compare TensorRT 8.6.1.6 to 8.4.0.6 ? #3743

Is there any gpu memory optimization compare TensorRT 8.6.1.6 to 8.4.0.6 ? #3743

Comments

Jsy0220 commented Mar 27, 2024

zerollzeng commented Mar 28, 2024

zerollzeng commented Apr 6, 2024

Jsy0220 commented May 28, 2024