-
Notifications
You must be signed in to change notification settings - Fork 21
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Multimodal use of uvcgan2 ? #30
Comments
Hello @PerrinAntonin, Thank you for your interest in our work.
I actually thought about this a bit. The short answer is that currently it is not possible. I think, in principle, one can modify the generator architecture a bit to expose its style to the user. Then, one can implement a custom training setup, following MUnit or DRIT examples. If done correctly, I believe everything will work and make UVCGAN multimodal. All the modifications are rather straightforward, but they will take some time to implement and debug. And, unfortunately, we do not have resources to explore these modifications at the moment. |
Hi @usert5432 Thank you for your quick reply! I could also see that you set a different learning rate for the discriminator and the generator. If the generator rate is smaller, is it because the generator learns too quickly compared to the discriminator? |
I cannot say definitively, since it is more of an empirical question, but my intuition matches yours. Currently, UVCGAN is not trained to work correctly with mismatching styles, so I would expect it to break if some unexpected style is substituted.
Yes, this is my working hypothesis. Although, I am not sure it is 100% correct. |
Hello,
Congratulations for this very successful project! I wanted to ask you if you think a multimodal use of uvcgan2 is possible? In order to use it as an MUnit, where it's possible to generate different images from a single reference image. In MUnit, you simply choose a different style vector for a new generation, but in Uvcgan it's the VIT that generates it, and I was wondering how to play with that.
Sincerely,
Antonin
The text was updated successfully, but these errors were encountered: