Transcribing and translating offline with whisper #1839
Replies: 1 comment 2 replies
-
Hello @hilderonny! This is very useful and would be a great addition to the project! We have a recently opened improvement to integrate Whisper: I was conducting tests to compare Whisper portuguese results to our current Wav2Vec2 portuguese transcription: I didn't finish it yet. But, actually, we don't have to wait for it and we can integrate Whisper and let users decide the algorithm choice (accuracy x speed). My idea would be to integrate Whisper in a slight different way, similar to the way we run Wav2Vec2 today, so IPED's transcription cluster service could use Whisper too, not sure if you know about it: For this, we would need a script like below. We could read a configurable parameter from At the moment, I think about integrating Faster-Whisper, not sure which Whisper implementation you are using... About translation, that is also very useful and would be an awesome addition! We have an old stale feature request about this: It can be very useful, I think not only for audios, but to translate documents too. So I think it could fit better as a separate task, independent of transcription, so users could translate documents independently of audio processing. Do you think it would be possible to split your script into 2, so translation would be a separate task? If yes and if you could submit a pull request, I would be happy to review and test! PS: I know Whisper does translation to english automatically, I think it would be possible to return the english transcription version from the cluster service too, changing a bit the communication protocol. Coincidently, today a colleague of mine shared this to me: I don't know how it compares to Argos about accuracy x speed... Thank you very much for sharing your work and for trying to help the project! Best regards, |
Beta Was this translation helpful? Give feedback.
-
Hello everybody,
this week I created a python task for IPED which uses Whisper to transcribe audio and video files in various languages. It then translates the results into english. Finally the script uses Argos Translate to translate the english result into german for our collegues.
The results are attached to the evidence items as metadata and look like that:
The script needs some additional installations and downloads but it rans completely offline which is important for our department.
If you are interested in this solution I could share the scripts and an installation instruction. Please tell me where and how.
Best regards,
Ronny
Beta Was this translation helpful? Give feedback.
All reactions