We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Hey,
just a heads-up, there is a new version of the Prodigy optimizer that requires significantly less VRAM.
Details: konstmish/prodigy#22 konstmish/prodigy#23 konstmish/prodigy#32
OneTrainer PR: Nerogar/OneTrainer#550
The text was updated successfully, but these errors were encountered:
No branches or pull requests
Hey,
just a heads-up, there is a new version of the Prodigy optimizer that requires significantly less VRAM.
saves 45% of VRAM overhead. There should be no downside. VRAM requirement on the level of AdamW
saves another 25% - this can have a quality impact, similar to Adafactor compared to AdamW
Details:
konstmish/prodigy#22
konstmish/prodigy#23
konstmish/prodigy#32
OneTrainer PR:
Nerogar/OneTrainer#550
The text was updated successfully, but these errors were encountered: