You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Hey thanks a lot for this work, this is great. However, I wanted to use a different model (rjac/whisper-tiny-spanish) to transcribe a video in spanish and it did! but it translated the whole thing to english. Could this be skipped?
The text was updated successfully, but these errors were encountered:
Thanks for reaching out! I can look into integrating this feature when I do another PR on it. The main reason English-output was assumed (just so you know) is that a lot of the "cleanup" features (spell checking, punctuation, etc.) either originally only worked for English out of the box or were much easier to implement if you assumed that. However, I think this can be implemented in two parts:
option(s)/flags to skip all the correction/translation bits etc, add in functionality for Whisper integration to specify the language (so it does not assume English, iirc it does). I can do this in a few weeks, hopefully when I have time to work on it again :)
Add "real" multi-language support with the downstream bits I mentioned. Maybe one day... but to be honest, not sure I will have time to implement this myself. We will see!
Hey thanks a lot for this work, this is great. However, I wanted to use a different model (rjac/whisper-tiny-spanish) to transcribe a video in spanish and it did! but it translated the whole thing to english. Could this be skipped?
The text was updated successfully, but these errors were encountered: