Skip to content

Diarization result for "End-to-end speaker segmentation for overlap-aware resegmentation" #795

Answered by hbredin
ChokJohn asked this question in Q&A
Discussion options

You must be logged in to vote

Yes, that is probably correct.

The sum of FA and Miss. does match the numbers reported in the paper.
The high Conf. is due to the fact that the segmentation model is not capable of tracking speakers over time (it only works on small 5s chunks).

You'd have to combine the segmentation model with speaker embedding to perform proper diarization.
See this paper and code for a way to do that.

Replies: 2 comments 1 reply

Comment options

You must be logged in to vote
1 reply
@385614027
Comment options

Answer selected by ChokJohn
Comment options

You must be logged in to vote
0 replies
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
4 participants
Converted from issue

This discussion was converted from issue #794 on October 21, 2021 06:31.