-
Notifications
You must be signed in to change notification settings - Fork 91
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Machine Translation Task with DiffuSeq #74
Comments
Hi, |
Yeah makes sense, thanks! Are you referring to works like SeqDiffuSeq which builds on DiffuSeq directly? |
It depends on what your goal is using diffusion model for MT tasks. Follow-up works are not exactly the same with DiffuSeq. SeqDiffuSeq is based on encoder-decoder architecture, while RDM is based on discrete text diffusion. This work also involves pre-trained MLMs. If you're aiming the performance, you could refer to the SOTA model. |
@summmeer thanks, this is very helpful! in the paper DiNoiSer, the authors claim to have surpassed DiffuSeq's performance on the WMT14 EN->DE task, so I wanted to do a similar comparison between DiffuSeq and DiNoiSer on the IWSLT14 task, but DiffuSeq takes a long time to train. Sorry for the trivial question, your replies are really helpful, thanks! |
Hi, |
I will, thanks a lot! |
Hi @summmeer,
I was wondering how I might go about implementing a machine translation task with DiffuSeq. I have trained DiffuSeq for the paraphrase task, but I want to be able to use it for translation tasks.
Would supplying a translation dataset to the existing codebase (since it designed for seq2seq tasks) suffice or would further changes be required?
Would appreciate any advice, thanks!
The text was updated successfully, but these errors were encountered: