Oncetold Podcast Transcriber

This is a clone of the Modal Podcast Transcriber

This is a complete application that uses OpenAI Whisper to transcribe podcasts. Modal spins up 100-300 containers for a single transcription run, so hours of audio can be transcribed on-demand in a few minutes.

Modal.com LIVE App: You can find the app here: https://modal-labs--whisper-pod-transcriber-fastapi-app.modal.run/
Oncetold App: https://gagglepod--whisper-pod-transcriber-fastapi-app.modal.run

Architecture

The entire application is hosted serverlessly on Modal and consists of 3 components:

React + Vite SPA (pod_transcriber/frontend/)
FastAPI server (pod_transcriber/api.py)
Modal async job queue (pod_transcriber/main.py)

Developing locally

Requirements

account approved Modal.com account
npm
modal installed in your current Python virtual environment

Podchaser Secret

To run this on your own Modal account, you'll need to create a Podchaser account and create an API key.

Once you have the Podchaser account and API keys, then, create a Modal Secret with the following keys:

PODCHASER_CLIENT_SECRET
PODCHASER_CLIENT_ID

It doesn't matter what you call the Modal Secret block -- what matters is that both KEYS (with VALUES) are listed in the block (Note: This will not work locally).

You can find both on their API page.

Vite build

cd into the pod_transcriber/frontend directory, and run:

npm install
npx vite build --watch

The last command will start a watcher process that will rebuild your static frontend files whenever you make changes to the frontend code.

Serve on Modal

Once you have vite build running, in a separate shell run this at the app root to start an ephemeral app on Modal:

modal serve oncetold-podcast-transcriber.main

Pressing Ctrl+C will stop your app.

Deploy to Modal

Once your happy with your changes, cd to the root of the project and run modal deploy oncetold-podcast-transcriber.main to deploy your app to Modal.

Testing

Modal.com deployment allows you to transcribe HTML only at about $.15/60-minutes of audio. However, you can cut and paste what looks like an SRT version of the transcript to your own *.SRT file.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
__pycache__		__pycache__
frontend		frontend
.gitignore		.gitignore
README.md		README.md
__init__.py		__init__.py
api.py		api.py
config.py		config.py
main.py		main.py
podcast.py		podcast.py
search.py		search.py
transcribe_check.py		transcribe_check.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Oncetold Podcast Transcriber

Architecture

Developing locally

Requirements

Podchaser Secret

Vite build

Serve on Modal

Deploy to Modal

Testing

About

Releases

Packages

Languages

gagglepod/oncetold-podcast-transcriber

Folders and files

Latest commit

History

Repository files navigation

Oncetold Podcast Transcriber

Architecture

Developing locally

Requirements

Podchaser Secret

Vite build

Serve on Modal

Deploy to Modal

Testing

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages