Skip to content

Increasing speed of Speaker Diarization pipeline with CPU #778

Answered by hbredin
Vanargh asked this question in Q&A
Discussion options

You must be logged in to vote

In some cases, real-time factor is between 0,5 and 1 with CPU. So it needs 30 to 60 minutes for a file of 1-hour duration. Are there any pre-trained pipelines that work nearly as well but are significantly faster?

No, there isn’t any.

I would like to know why it is so slow even though the downloaded models seem rather small. And is there some way, we can make it work faster?

You could increase the step to reduce the overlap between consecutive chunks in SAD and SCD step.

Currently, I believe that the audio file is provided as raw wave form to the end-to-end pipeline. Is it possible to provide, mfcc or other features extracted from the audio file, directly to the pipeline to obtain re…

Replies: 2 comments 3 replies

Comment options

You must be logged in to vote
2 replies
@Vanargh
Comment options

@asadullah797
Comment options

Answer selected by Vanargh
Comment options

You must be logged in to vote
1 reply
@prkumar112451
Comment options

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
5 participants