Whisper Vamp Plugin v1 #3

pierreguillot · 2024-08-09T10:06:52Z

The first version of the speech-to-text Whisper Vamp plugin.

The Whisper Vamp plug-in is an implementation of the Whisper speech recognition model developed by OpenAI as a Vamp plug-in. The Whisper Vamp plug-in analyses the text in the audio stream and generates markers corresponding to the tokens (words and/or syllables) found.

The lightweight ggml-tiny model is embedded in the plugin (so you don’t have to download anything to start experimenting), but it is possible to download and use other models that may be more appropriate to your needs. You’ll find all the information in the user manual. Don’t hesitate to send me your feedback.

The plugin lets you define an input marker track to segment the analysis. This feature can be useful in avoiding the biases of certain models, such as the generation or repetition of words not present in the audio stream.

pierreguillot added 2 commits August 9, 2024 11:32

Bump version to 1.0.0

5cada46

Add automatic release note generation in .github/workflows/ci.yml

69fee10

pierreguillot force-pushed the dev/1.0.0 branch from 5d547ed to 69fee10 Compare August 9, 2024 10:07

pierreguillot merged commit 9ee39f0 into main Aug 9, 2024
12 checks passed

pierreguillot deleted the dev/1.0.0 branch August 9, 2024 10:20

pierreguillot self-assigned this Aug 9, 2024

pierreguillot added the enhancement New feature or request label Aug 9, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Whisper Vamp Plugin v1 #3

Whisper Vamp Plugin v1 #3

pierreguillot commented Aug 9, 2024

Whisper Vamp Plugin v1 #3

Whisper Vamp Plugin v1 #3

Conversation

pierreguillot commented Aug 9, 2024