Speaker diarization correctly identifies speakers in notebook but not when run on fastapi endpoint #1556
Replies: 1 comment
-
Seemed to be an issue of passing the data as a request body versus a file upload. The hash was coming through differently when passed a request body. The transcription from whisper worked fine on the same file so it was strange that the speaker identification was struggling. Regardless, passing it as a UploadFile solved it. |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
I have two versions of a program with the same intention for both: speaker diarization. One of the files is a python file that hosts a fastapi endpoint that can be curled to to diarize an audio file. The other is a .ipynb that contains the exact same code except the fastapi endpoint definition has been removed (should not affect anything) and the audio file is being passed differently. Here are the differences below:
.py
With
curl -F "file=@<filepath>" localhost:5000/diarize
being used to pass the file.ipynb
This is the only difference between the two files and they are run on the same kernels. The results for the .ipynb are very good and identify the speakers swapping back and forth throughout the audio. The .py however only identifies one speaker for 99% of the file and then finally identifies the other speaker at the very end of the audio. What am I missing? Any help appreciated.
Beta Was this translation helpful? Give feedback.
All reactions