Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

no punctuation marks when transcribe Chinese. #2532

Open
SteveTanggithub opened this issue Nov 4, 2024 · 2 comments
Open

no punctuation marks when transcribe Chinese. #2532

SteveTanggithub opened this issue Nov 4, 2024 · 2 comments

Comments

@SteveTanggithub
Copy link

model:whisper-large-v3-turbo-v1.0.0.bin
we use the model to transcribe chinese speech, but there is no any punctuation marks in the output. And we only get the comma marks after we add config like "--prompt "punctuation." "
How can I get the correct punctuation marks for chinese?

@mrfragger
Copy link

--prompt "你可以帮我吗?谢谢你的帮忙!议地点在哪里?我昨天看了一部电影。"

basically just include a few sentences with punctuation included.

@SteveTanggithub
Copy link
Author

thank you! We now get a new issue. We input a wav file about 3 seconds but we got this:
69c28396bdb18ad922c30a16049e555
47c178b49db035f431cc8a35ee932c7

it output some words don't exist in the speech and expand the duration to 33 seconds! We use main.cpp in the main fold. What's the problem? the input audio must longer than 30 seconds? or some config setting we did't change?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants