VAD inference time #949
Unanswered
kuruvachankgeorge
asked this question in
Q&A
Replies: 1 comment 1 reply
-
Thanks for your interest in pyannote. Improving inference time is definitely not the priority right now as pyannote.audio was not initially designed for real-time processing. That being said,
Feel free to contribute a proper (independent) benchmark. |
Beta Was this translation helpful? Give feedback.
1 reply
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Hi folks! I'm wondering if anyone has compared the inference time (latency) of the Pyannote VAD with other popular VAD algorithms, like Silero-VAD etc. I could find a comparison on their performance in terms of False alarm and Missed detection rates in the paper, but nothing is clearly mentioned about the latency or computational load. Would be great if you can share these details as well. May I also know if pyannote segmentation model supports onnx or tensorRT (for reducing the inference time) because my intention is to integrate the VAD to my ASR engine (by replacing the webrtc VAD for better speech-silence segmentation) for real-time inferencing. Thanks!
Beta Was this translation helpful? Give feedback.
All reactions