[feature request] whisper #1251

0wwafa · 2024-12-05T22:05:27Z

Since you are the only one (my hero!) who still supports CLBLAST, and since you already use whisper, can you make kobold.cpp act like whisper.cpp, so I can use it to transcribe some long italian movies and translate them to english?

LostRuins · 2024-12-06T02:14:23Z

It already exists! You just need to load the whisper model with --whispermodel. See https://github.com/LostRuins/koboldcpp/wiki#what-is-whisper

KoboldCpp can also transcribe wav audio files. using the OpenAI compatible endpoint. For example:

curl --request POST \
  --url http://localhost:5001/v1/audio/transcriptions \
  --header 'Content-Type: multipart/form-data' \
  --form file=@/path/to/file/audio.wav\

0wwafa · 2024-12-06T15:14:42Z

I see. it would be somewhat more useful to have it directly like:
./kobold.cpp -wm model.bin -ot time_offset -p "additional prompt" input.wav

or something like that.
but I'll try it out in server mode.

LostRuins · 2024-12-20T06:05:41Z

Hi, can you please try the latest version 1.80, I've added a feature to upload files to transcribe from the GUI

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[feature request] whisper #1251

[feature request] whisper #1251

0wwafa commented Dec 5, 2024

LostRuins commented Dec 6, 2024

0wwafa commented Dec 6, 2024

LostRuins commented Dec 20, 2024

[feature request] whisper #1251

[feature request] whisper #1251

Comments

0wwafa commented Dec 5, 2024

LostRuins commented Dec 6, 2024

0wwafa commented Dec 6, 2024

LostRuins commented Dec 20, 2024