A Python script for transcribing large audio files using the Groq API, with support for:
- Handling files larger than 25MB
- Audio downsampling
- Chunk-based transcription
- Flexible language and prompt options
- Python 3.8+
- FFmpeg installed
- Groq API Key
git clone https://transcript-with-whisper-groq.git
cd audio-transcription-tool
python3 -m venv venv
source venv/bin/activate # On Windows use `venv\Scripts\activate`
pip install -r requirements.txt
Create a .env
file in the project root:
GROQ_API_KEY=your_groq_api_key_here
python transcribe.py input_audio.mp3 output_transcription.txt --language en
--language
: Specify audio language (ISO 639-1 code)--prompt
: Add context for transcription
- Zip the project including:
transcribe.py
requirements.txt
- Installed dependencies
- Set
GROQ_API_KEY
as an environment variable in Lambda
- Ensure FFmpeg is installed
- Check API key permissions
- Verify audio file format
[Your License Here]