The VAE model of SDXL, after being compiled with TensorRT, shows that VAE requires 12GB of GPU memory when loaded #3749

lqfool · 2024-03-28T10:17:58Z

Description

I saw that in the official demo of SDXL, VAE was not compiled. However, when I converted VAE to plan format, the size was 98.22MB. But after loading VAE and using device_memory_size to check the VRAM usage, it showed 12079599616 bytes, which means I don't have enough VRAM to load the entire model.

Environment

TensorRT Version:9.2/9.3

NVIDIA GPU:RTX 4080

NVIDIA Driver Version:535.161.07

CUDA Version:12.2

CUDNN Version:

Operating System:Ubuntu 22.04.3 LTS

Python Version (if applicable):3.10

Tensorflow Version (if applicable):

PyTorch Version (if applicable):2.1.0

Baremetal or Container (if so, version):

Relevant Files

Model link:https://huggingface.co/stabilityai/stable-diffusion-xl-1.0-tensorrt

Steps To Reproduce

Commands or scripts:

Have you tried the latest release?:Yes

Can this model run on other frameworks? For example run ONNX model with ONNXRuntime (polygraphy run <model.onnx> --onnxrt):

The text was updated successfully, but these errors were encountered:

lix19937 · 2024-04-02T01:05:35Z

An engine, on deserialization, allocates device memory to store the model weights. Since the serialized engine is almost all weights, its size is a good approximation to the amount of device memory the weights require. You can calc you VAE model weights memory.

zerollzeng added the triaged Issue has been triaged by maintainers label Apr 2, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

The VAE model of SDXL, after being compiled with TensorRT, shows that VAE requires 12GB of GPU memory when loaded #3749

The VAE model of SDXL, after being compiled with TensorRT, shows that VAE requires 12GB of GPU memory when loaded #3749

lqfool commented Mar 28, 2024 •

edited

Loading

lix19937 commented Apr 2, 2024

The VAE model of SDXL, after being compiled with TensorRT, shows that VAE requires 12GB of GPU memory when loaded #3749

The VAE model of SDXL, after being compiled with TensorRT, shows that VAE requires 12GB of GPU memory when loaded #3749

Comments

lqfool commented Mar 28, 2024 • edited Loading

Description

Environment

Relevant Files

Steps To Reproduce

lix19937 commented Apr 2, 2024

lqfool commented Mar 28, 2024 •

edited

Loading