🚀 Live Demo on Hugging Face: Click Here
This is a Multifunctional Speech and Text Application built using Streamlit. It combines several powerful features like:
- Converting Speech to Text
- Generating Speech from Text
- Translating Text between languages
This tool is perfect for language learning, accessibility, or just having fun experimenting with text and speech conversions!
-
Speech-to-Text:
- Upload an audio file (
.wav
,.mp3
,.ogg
) and convert it into text usingSpeechRecognition
.
- Upload an audio file (
-
Text-to-Speech:
- Enter text and convert it to speech using
gTTS
(Google Text-to-Speech). Supports multiple languages.
- Enter text and convert it to speech using
-
Text Translation:
- Translate input text into another language using
deep-translator
. Supports translation to/from English, French, Spanish, Hindi, and German.
- Translate input text into another language using
To set up and run the project locally, follow these steps:
- Clone the Repository:
git clone https://github.com/vivekd16/Speech_Text_Processor.git cd Speech_Text_Processor
- Install Dependencies:
Make sure you have Python 3.8+ installed, then run:pip install -r requirements.txt
- Run the Application:
Launch the Streamlit app locally:streamlit run app.py
- Access the App:
Open your web browser and navigate to:http://localhost:8501
- Explore the Features:
- Speech-to-Text: Upload an audio file and convert it into text.
- Text-to-Speech: Enter text and listen to it in the selected language.
- Text Translation: Translate text into multiple supported languages.
- Stop the Application:
To stop the Streamlit server, go back to the terminal and press:Ctrl + C
-
Select a Feature:
Use the sidebar to choose one of the three features:- Speech-to-Text: Upload an audio file in
.wav
,.mp3
, or.ogg
format to convert it into text. - Text-to-Speech: Enter text and convert it into speech. You can download the audio or listen to it directly.
- Text Translation: Input text and select the target language to translate into.
- Speech-to-Text: Upload an audio file in
-
Interact with Results:
- For Speech-to-Text, view the transcribed text in the app.
- For Text-to-Speech, listen to the generated speech or save the audio file.
- For Text Translation, copy or use the translated text directly.
-
Real-Time Processing:
All features are designed for immediate interaction, making it user-friendly and efficient.
This project leverages the following technologies:
- Streamlit: Framework for building interactive web applications.
- gTTS: Library for generating speech from text.
- SpeechRecognition: For converting speech to text from uploaded audio files.
- Deep-Translator: For translating text between languages.
- PyDub: For handling audio file formats and conversions.
Experience the application live on Hugging Face Spaces:
👉 Multifunctional Speech and Text Application
Simply navigate to the link, and start using the app without any installation!
Let me know if you need further updates! 😊