Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Whisper Vamp Plugin v1 #3

Merged
merged 2 commits into from
Aug 9, 2024
Merged

Whisper Vamp Plugin v1 #3

merged 2 commits into from
Aug 9, 2024

Conversation

pierreguillot
Copy link
Collaborator

The first version of the speech-to-text Whisper Vamp plugin.

The Whisper Vamp plug-in is an implementation of the Whisper speech recognition model developed by OpenAI as a Vamp plug-in. The Whisper Vamp plug-in analyses the text in the audio stream and generates markers corresponding to the tokens (words and/or syllables) found.

The lightweight ggml-tiny model is embedded in the plugin (so you don’t have to download anything to start experimenting), but it is possible to download and use other models that may be more appropriate to your needs. You’ll find all the information in the user manual. Don’t hesitate to send me your feedback.

The plugin lets you define an input marker track to segment the analysis. This feature can be useful in avoiding the biases of certain models, such as the generation or repetition of words not present in the audio stream.

@pierreguillot pierreguillot merged commit 9ee39f0 into main Aug 9, 2024
12 checks passed
@pierreguillot pierreguillot deleted the dev/1.0.0 branch August 9, 2024 10:20
@pierreguillot pierreguillot self-assigned this Aug 9, 2024
@pierreguillot pierreguillot added the enhancement New feature or request label Aug 9, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant