Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

flux model engine_from_bytes(bytes_from_path(self.engine_path)) OutOfMemory #4207

Open
algorithmconquer opened this issue Oct 17, 2024 · 3 comments
Labels
Demo: Diffusion Issues regarding demoDiffusion triaged Issue has been triaged by maintainers

Comments

@algorithmconquer
Copy link

from polygraphy.backend.trt import engine_from_bytes;
when run engine_from_bytes(bytes_from_path(self.engine_path)) OutOfMemory on L40 with 1gpu with flux-dev,how to solve

@lix19937
Copy link

Try to use trtexec, trt version >=8.6

@yuanyao-nv yuanyao-nv added triaged Issue has been triaged by maintainers Demo: Diffusion Issues regarding demoDiffusion labels Oct 18, 2024
@yuanyao-nv
Copy link
Collaborator

Related issue: #4205

@asfiyab-nvidia
Copy link
Collaborator

@algorithmconquer the flux demo should now run on L40S as we have added memory optimizations in release/10.6. Can you please try again and update here?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Demo: Diffusion Issues regarding demoDiffusion triaged Issue has been triaged by maintainers
Projects
None yet
Development

No branches or pull requests

4 participants