failure of TensorRT 8.6 when running inferencing on GPU NVIDIA GeForce RTX 3070 with error [convBaseRunner.cpp::execute::295] Error Code 1: Cask (Cask convolution execution) #3658

sandeepgadhwal · 2024-02-07T18:35:52Z

Description

I try to save a onnx model to trt engine, works well. But when loading it to python API it fails while in context.execute_v2(self.allocations)

Environment

TensorRT Version: 8.6.1 (nv-tensorrt-local-repo-ubuntu2004-8.6.1-cuda-11.8_1.0-1_amd64.deb)

NVIDIA GPU: NVIDIA GeForce RTX 3070

NVIDIA Driver Version: 535.154.05

CUDA Version: 12.2

CUDNN Version:

Operating System:

Python Version (if applicable): 3.10

Tensorflow Version (if applicable):

PyTorch Version (if applicable):

Baremetal or Container (if so, version):

Relevant Files

Model link:

Steps To Reproduce

I converted the model using this command

/usr/src/tensorrt/bin/trtexec --onnx=model.onnx --saveEngine=engine.trt

I test the engine using this command

/usr/src/tensorrt/bin/trtexec --loadEngine=engine.trt --batch=1

all works fine

But when i try to load engine through python API it does not work and results in the following traceback.
a.txt

Commands or scripts:
Script i am using for inferencing:
tensorrt_test_fps.py.txt

.
Have you tried the latest release?: yes

Can this model run on other frameworks? For example run ONNX model with ONNXRuntime (polygraphy run <model.onnx> --onnxrt): yes works fine with onnxruntime with Tensorrt Execution backend.

The text was updated successfully, but these errors were encountered:

sandeepgadhwal · 2024-02-08T07:18:13Z

Closing the issue because I made a silly mistake in code and now it is working.

patrickhulce · 2024-02-15T22:45:14Z

@sandeepgadhwal what was your silly mistake? asking for a...friend :)

sandeepgadhwal · 2024-02-16T14:20:39Z

Hi @patrickhulce,

I had allocated wrong size of memory. After allocating correct amount of memory it worked fine. I cross checked the number of bytes allocated.

yaobaishen · 2024-03-28T07:41:29Z

Hi @patrickhulce,

I had allocated wrong size of memory. After allocating correct amount of memory it worked fine. I cross checked the number of bytes allocated.

Hi @sandeepgadhwal , I have encountered the same error, how did you find out the "allocate wrong size of memory" issue? the context.execute_v2() API doesn't print any useful log to help debugging...

yaobaishen · 2024-04-08T10:01:08Z

To whom run into similar issue,

@sandeepgadhwal has mentioned it's related to the memory buffer you created.
In my case, I use pycuda to malloc the input/output buffer of the model, unfortunately, as the below link said, "pytorch-gpu and pycuda may have conflicts", so I switch to use CUDA python API instead of pycuda to malloc all the buffers, then the issue disappears.
please refer to:
https://forums.developer.nvidia.com/t/tensorrt-error-cuda-runtime-invalid-resource-handle-torchaudio-on-gpu-trt-engine-get-wrong-result/199750/5

sandeepgadhwal changed the title ~~XXX failure of TensorRT X.Y when running XXX on GPU XXX~~ failure of TensorRT 8.6 when running inferencing on GPU NVIDIA GeForce RTX 3070 Feb 7, 2024

sandeepgadhwal closed this as completed Feb 8, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

failure of TensorRT 8.6 when running inferencing on GPU NVIDIA GeForce RTX 3070 with error [convBaseRunner.cpp::execute::295] Error Code 1: Cask (Cask convolution execution) #3658

failure of TensorRT 8.6 when running inferencing on GPU NVIDIA GeForce RTX 3070 with error [convBaseRunner.cpp::execute::295] Error Code 1: Cask (Cask convolution execution) #3658

sandeepgadhwal commented Feb 7, 2024

sandeepgadhwal commented Feb 8, 2024

patrickhulce commented Feb 15, 2024

sandeepgadhwal commented Feb 16, 2024 •

edited

Loading

yaobaishen commented Mar 28, 2024

yaobaishen commented Apr 8, 2024

failure of TensorRT 8.6 when running inferencing on GPU NVIDIA GeForce RTX 3070 with error [convBaseRunner.cpp::execute::295] Error Code 1: Cask (Cask convolution execution) #3658

failure of TensorRT 8.6 when running inferencing on GPU NVIDIA GeForce RTX 3070 with error [convBaseRunner.cpp::execute::295] Error Code 1: Cask (Cask convolution execution) #3658

Comments

sandeepgadhwal commented Feb 7, 2024

Description

Environment

Relevant Files

Steps To Reproduce

sandeepgadhwal commented Feb 8, 2024

patrickhulce commented Feb 15, 2024

sandeepgadhwal commented Feb 16, 2024 • edited Loading

yaobaishen commented Mar 28, 2024

yaobaishen commented Apr 8, 2024

sandeepgadhwal commented Feb 16, 2024 •

edited

Loading