Skip to content

Commit

Permalink
docs: update whisper-manual.md
Browse files Browse the repository at this point in the history
  • Loading branch information
pierreguillot committed Dec 13, 2024
1 parent 28aa526 commit 7785990
Showing 1 changed file with 2 additions and 2 deletions.
4 changes: 2 additions & 2 deletions docs/whisper-manual.md
Original file line number Diff line number Diff line change
Expand Up @@ -25,7 +25,7 @@ The Whisper plugin is an implementation of the [Whisper](https://github.com/open

The Whisper plugin analyses the text in the audio stream and generates markers corresponding to phrases, words or tokens (depending on the *Split Mode* parameter). The *Suppress Non-Speech Tokens* parameter controls whether non-speech tokens are generated (only usable with *Split Mode* on *Tokens*).

The lightweight ggml-base-q5_1 model is embedded in the plugin and the other q5 models (tiny, small, medium, and large_v2) will be installed on your system. The *Model* parameter is used to select which model to use. You can also download and use other models that may be more appropriate to your needs. Please, refer to the following section dedicated to models.
The lightweight ggml-base-q5_1 model is embedded in the plugin and the other models will be installed on your system. The *Model* parameter is used to select which model to use. You can also download and use other models that may be more appropriate to your needs. Please, refer to the following section dedicated to models.

The Whisper Vamp Plugin has been designed for use in the free audio analysis application [Partiels](https://forum.ircam.fr/projects/detail/partiels/).

Expand All @@ -44,7 +44,7 @@ Use the installer for your operating system. The plugin dynamic library (*whispe

## Models

By default, the plugin embeds the ggml-base-q5_1, which requires little space and offers fairly fast calculation and the other q5 models (tiny, small, medium, large_v2, and large_v3) are installed on your system.
By default, the plugin embeds the ggml-base-q5_1, which requires little space and offers fairly fast calculation and other models small-q5_1, medium-q5_0, large-v2-q5_0, and large-v3-turbo-q5_0 (Mac & Linux only) are installed on your system.

It is possible to download other models that are potentially more robust or more adapted to your needs. To do this, you can use the `download-ggml-model.sh/cmd` scripts from Georgi Gerganov's [Whisper.cpp](https://github.com/ggerganov/whisper.cpp) project and provided with the plugin package. Once downloaded, these models must be installed in the repository:
- Linux: `~/.config/Ircam/whispermodels` or `/opt/Ircam/whispermodels`
Expand Down

0 comments on commit 7785990

Please sign in to comment.