Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Diarization pipeline fails at end of audio file (RuntimeError: Sizes of tensors must match except in dimension 0.) #1752

Open
ccmilne opened this issue Aug 26, 2024 · 3 comments

Comments

@ccmilne
Copy link

ccmilne commented Aug 26, 2024

Tested versions

  • Reproducible in 3.3.1

System information

Ubuntu 22.04.4 LTS - pyannote.audio 3.3.1 - EC2 g5.4xlarge

Issue description

Receiving this error when running the diarization pipeline on an mp3 file:

RuntimeError: Sizes of tensors must match except in dimension 0. Expected size 160000 but got size 147200 for tensor number 12 in the list.

Code to reproduce:

image

audio file can be found on Supreme Court's website: https://www.supremecourt.gov/oral_arguments/audio/2023/23-334

Full error:

image

Minimal reproduction example (MRE)

https://colab.research.google.com/drive/1odeZBhMTI7Ku4umLZ12VJkqrlVk0MRLk?usp=sharing

@qalabeabbas49
Copy link

Hi, I am not sure but try converting mp3 to wav and trying again.

@ccmilne
Copy link
Author

ccmilne commented Sep 9, 2024

Hi, I am not sure but try converting mp3 to wav and trying again.

Converting to a WAV file worked. Not sure why, but thanks!

@qalabeabbas49
Copy link

qalabeabbas49 commented Sep 9, 2024

Hi, I am not sure but try converting mp3 to wav and trying again.

Converting to a WAV file worked. Not sure why, but thanks!

It has something to do with torachaudio backend. Sometimes it doesn't work well with mp3 format.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants
@qalabeabbas49 @ccmilne and others