How to get closer accuracy with FP16 in demo/Diffusion 8-bits PTQ? #3723

TheBge12138 · 2024-03-19T02:21:38Z

Hello, notice that tensorrt 9.3 had updated the 8-bit quantization to accelerate diffusion models, it's excellent !
But when running the demo, i can't get the same result with fp16, I have modified the get_pipeline in model.py to my own model, and used over 1000 prompts for calibration, there is a big gap under some prmopts, and some prmopts are close but still not completely similar.

my sdxl model pipeline has 30steps, I compared the cosine similarity of the unet output tensor. In 60 prompt tests, compared with fp16, the similarity fluctuated between 89% and 97%.

in https://developer.nvidia.com/blog/tensorrt-accelerates-stable-diffusion-nearly-2x-faster-with-8-bit-post-training-quantization/
I saw you can get images that are nearly identical to original FP16 precision.
Is there anything I can do to improve the accuracy of my in8 model？

Thanks!

Tasks

Give feedback

No tasks being tracked yet.

Options

zerollzeng · 2024-03-22T05:48:49Z

@ttyio ^ ^

TheBge12138 · 2024-03-29T06:56:35Z

@jingyu-ml
thanks for your reply! i saw it on email
Can I ask when release/10.0 will be released?

jingyu-ml · 2024-04-01T05:53:42Z

@jingyu-ml thanks for your reply! i saw it on email Can I ask when release/10.0 will be released?

I don't have the release date, but I would say its in the near future and the team is working on it.

zerollzeng · 2024-04-06T13:52:59Z

TRT 10 EA has been released.

TheBge12138 · 2024-04-08T01:49:30Z

TRT 10 EA has been released.

Excellent, thanks for your work!

jingyu-ml · 2024-05-08T10:09:01Z

@TheBge12138 please check the GA release instead of the EA release
d3107c8

TheBge12138 · 2024-05-08T11:55:42Z

@TheBge12138 please check the GA release instead of the EA release d3107c8

thanks!

zerollzeng assigned ttyio Mar 22, 2024

zerollzeng added the triaged Issue has been triaged by maintainers label Mar 22, 2024

TheBge12138 changed the title ~~how to adjust the quantization parameters in demo/Diffusion 8-bits PTQ?~~ How to get more accurate accuracy in demo/Diffusion 8-bits PTQ? Mar 22, 2024

TheBge12138 changed the title ~~How to get more accurate accuracy in demo/Diffusion 8-bits PTQ?~~ How to get closer accuracy with FP16 in demo/Diffusion 8-bits PTQ? Mar 22, 2024

TheBge12138 closed this as completed Mar 29, 2024

TheBge12138 reopened this Mar 29, 2024

TheBge12138 closed this as completed Apr 8, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to get closer accuracy with FP16 in demo/Diffusion 8-bits PTQ? #3723

How to get closer accuracy with FP16 in demo/Diffusion 8-bits PTQ? #3723

TheBge12138 commented Mar 19, 2024 •

edited

Loading

Tasks

zerollzeng commented Mar 22, 2024

TheBge12138 commented Mar 29, 2024

jingyu-ml commented Apr 1, 2024

zerollzeng commented Apr 6, 2024

TheBge12138 commented Apr 8, 2024

jingyu-ml commented May 8, 2024

TheBge12138 commented May 8, 2024

How to get closer accuracy with FP16 in demo/Diffusion 8-bits PTQ? #3723

How to get closer accuracy with FP16 in demo/Diffusion 8-bits PTQ? #3723

Comments

TheBge12138 commented Mar 19, 2024 • edited Loading

Tasks

zerollzeng commented Mar 22, 2024

TheBge12138 commented Mar 29, 2024

jingyu-ml commented Apr 1, 2024

zerollzeng commented Apr 6, 2024

TheBge12138 commented Apr 8, 2024

jingyu-ml commented May 8, 2024

TheBge12138 commented May 8, 2024

TheBge12138 commented Mar 19, 2024 •

edited

Loading