Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

NaN value in loss #8

Open
wusize opened this issue Jun 28, 2024 · 3 comments
Open

NaN value in loss #8

wusize opened this issue Jun 28, 2024 · 3 comments

Comments

@wusize
Copy link

wusize commented Jun 28, 2024

Hi, thanks for your great work! I am trying to reproduce vqgan on imagenet by running this script (stage 1). However, the training processes always collapsed between 3k iters and 6k iters with NaN in losses. Is there any trick to avoid NaN during training?

@hyc9
Copy link

hyc9 commented Jun 28, 2024

I have encountered this before and found that reducing the number of warm-up steps can be solved

@wusize
Copy link
Author

wusize commented Jun 28, 2024

Thanks for the feedback! I have an additional question on why the warm-up steps of the discriminator are 500000 (--dis_warmup_steps 500000), i.e., the discriminator loss is increased linearly across the whole training process.

@wdrink
Copy link
Collaborator

wdrink commented Jul 6, 2024

Could you share more details, e.g., what type of data did you use? Thanks

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants