Is it possible to pass in precomputed word timings? #1058

laphang · 2022-08-16T06:57:56Z

laphang
Aug 16, 2022

I have an use case where I want to do speech recognition and then diarization, and I've been exploring using pyannote for the latter and seeing good accuracy.

I was wondering whether it's possible to pass in the word timings already obtained from speech recognition into pyannote (ie skip the VAD step / improve speed).

Answered by hbredin

Aug 16, 2022

Not out of the box. Also, the speaker diarization pipeline does not rely on a VAD step per se.
So you would be better off rewriting a pipeline from scratch.

I am also curious about your use case and the "good accuracy" you got.
Feel free to drop me an email to tell me a bit more :)

View full answer

hbredin · 2022-08-16T07:36:30Z

hbredin
Aug 16, 2022
Maintainer

Not out of the box. Also, the speaker diarization pipeline does not rely on a VAD step per se.
So you would be better off rewriting a pipeline from scratch.

I am also curious about your use case and the "good accuracy" you got.
Feel free to drop me an email to tell me a bit more :)

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Is it possible to pass in precomputed word timings? #1058

{{title}}

Replies: 1 comment

{{title}}

Select a reply

Is it possible to pass in precomputed word timings? #1058

laphang Aug 16, 2022

Replies: 1 comment

hbredin Aug 16, 2022 Maintainer

laphang
Aug 16, 2022

hbredin
Aug 16, 2022
Maintainer