New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

dropout strategy #144

Open

ryancll opened this issue Nov 19, 2024 · 0 comments

ryancll commented Nov 19, 2024

          We adopt the same dropping strategy ("we randomly drop image only (zero image) for 5% of samples, drop text only (empty string) for 5% of samples, drop both of them for 5% of samples for dual-cross-attention.") in all training phases.

"We did not drop image conditional latent in VDG" means the concatenated frame latent with noise will not be dropped.

Originally posted by @Doubiiu in #8 (comment)

Have you ever trid randomly drop image conditional latent(concated latent) for training? I'm curious why you think this strategy is unnecessary.

The text was updated successfully, but these errors were encountered:

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment