Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Why fp8 generated video has half size of fp16 ? #144

Open
yjhong89 opened this issue Dec 19, 2024 · 2 comments
Open

Why fp8 generated video has half size of fp16 ? #144

yjhong89 opened this issue Dec 19, 2024 · 2 comments

Comments

@yjhong89
Copy link

Hi!

When I run inference script with fp 16 version and fp 8 version, and results are very great.
But, size of generated videos are different for each version and wonder why ??

image

  • Upper one was generated with fp16 and lower one was generated with fp8 model. As seen, fp8 generated one's size is halfed.
@mboboGO
Copy link
Contributor

mboboGO commented Dec 19, 2024

Hi, thank you for trying out our fp8 version.
As for your question, I have checked the output tensors from both fp16 and fp8 for 256x256x129f video generation. The generated tensors have the same shape and type of "torch.Size([1, 3, 129, 256, 256]) torch.float32".
The generated videos have similar size:
image
Please give your inference commands and more info, so that I can reproduce your situation.

@mboboGO
Copy link
Contributor

mboboGO commented Dec 19, 2024

A possible reason is that the fp8 ckpt filters out some high-frequency details that can not observed visually, but the video compress algorithm gives a much smaller file, when saving the video.
This is a just guess ~

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants