Skip to content

Latest commit

 

History

History
15 lines (13 loc) · 662 Bytes

README.md

File metadata and controls

15 lines (13 loc) · 662 Bytes

Speech-To-Text-Converter

A Simple Program that uses Whisper-an AI Speech Recognition Model- to convert real time words into text. This goal of this project is to learn how to implement Speech Recognition AI models so I can properly use them to build my soon to be AI assistant named I.V.E.N.

Reference Videos

Technologies to Download

  • Python (version 3.13 at the time of development)
  • PyTorch
  • Package Manager (Chocolatey for Windows or Homebrew for Mac)
  • FFmpeg
  • Vosk (Realtime Speech Recognition) => pip install vosk
  • Whisper AI (Recorded Audio Files)
  • Pyaudio
  • ipywidgets