How to run two YOLOv8 models in parallel on 3060 #3742

xillee · 2024-03-26T14:49:08Z

Description

When I run yolov8 model on 3060, only 7% of GPU is used. I want to run more the one model in parallel in order to reduce the inference duration.

Environment

TensorRT Version:
8.6
NVIDIA GPU:
3060
NVIDIA Driver Version:

CUDA Version:
12.1
CUDNN Version:
8.6

Operating System:

Python Version (if applicable):

Tensorflow Version (if applicable):

PyTorch Version (if applicable):

Baremetal or Container (if so, version):

Relevant Files

Model link:

Steps To Reproduce

Commands or scripts:

Have you tried the latest release?:

Can this model run on other frameworks? For example run ONNX model with ONNXRuntime (polygraphy run <model.onnx> --onnxrt):

The text was updated successfully, but these errors were encountered:

zerollzeng · 2024-03-28T02:54:41Z

It just be too small to consume the power of GPU, e.g. consider the difference between mobilenet and resnet. You can increase the throughput(not inference time) by increase batch or use multi-thread inference.

ttyio · 2024-04-30T20:31:31Z

closing since no activity for more than 3 weeks per our policy. thanks!

zerollzeng self-assigned this Mar 28, 2024

zerollzeng added the triaged Issue has been triaged by maintainers label Mar 28, 2024

ttyio closed this as completed Apr 30, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to run two YOLOv8 models in parallel on 3060 #3742

How to run two YOLOv8 models in parallel on 3060 #3742

xillee commented Mar 26, 2024

zerollzeng commented Mar 28, 2024

ttyio commented Apr 30, 2024

How to run two YOLOv8 models in parallel on 3060 #3742

How to run two YOLOv8 models in parallel on 3060 #3742

Comments

xillee commented Mar 26, 2024

Description

Environment

Relevant Files

Steps To Reproduce

zerollzeng commented Mar 28, 2024

ttyio commented Apr 30, 2024