Multifunctional Speech and Text Application

🚀 Live Demo on Hugging Face: Click Here

About the Project

This is a Multifunctional Speech and Text Application built using Streamlit. It combines several powerful features like:

Converting Speech to Text
Generating Speech from Text
Translating Text between languages

This tool is perfect for language learning, accessibility, or just having fun experimenting with text and speech conversions!

Features

Speech-to-Text:
- Upload an audio file (.wav, .mp3, .ogg) and convert it into text using SpeechRecognition.
Text-to-Speech:
- Enter text and convert it to speech using gTTS (Google Text-to-Speech). Supports multiple languages.
Text Translation:
- Translate input text into another language using deep-translator. Supports translation to/from English, French, Spanish, Hindi, and German.

Installation

To set up and run the project locally, follow these steps:

Clone the Repository:

git clone https://github.com/vivekd16/Speech_Text_Processor.git
cd Speech_Text_Processor

Install Dependencies:
Make sure you have Python 3.8+ installed, then run:
```
pip install -r requirements.txt
```
Run the Application:
Launch the Streamlit app locally:
```
streamlit run app.py
```
Access the App:
Open your web browser and navigate to:
```
http://localhost:8501
```
Explore the Features:

Speech-to-Text: Upload an audio file and convert it into text.
Text-to-Speech: Enter text and listen to it in the selected language.
Text Translation: Translate text into multiple supported languages.

Stop the Application:
To stop the Streamlit server, go back to the terminal and press:
```
Ctrl + C
```

Usage

Select a Feature:
Use the sidebar to choose one of the three features:
- Speech-to-Text: Upload an audio file in .wav, .mp3, or .ogg format to convert it into text.
- Text-to-Speech: Enter text and convert it into speech. You can download the audio or listen to it directly.
- Text Translation: Input text and select the target language to translate into.
Interact with Results:
- For Speech-to-Text, view the transcribed text in the app.
- For Text-to-Speech, listen to the generated speech or save the audio file.
- For Text Translation, copy or use the translated text directly.
Real-Time Processing:
All features are designed for immediate interaction, making it user-friendly and efficient.

Technologies Used

This project leverages the following technologies:

Streamlit: Framework for building interactive web applications.
gTTS: Library for generating speech from text.
SpeechRecognition: For converting speech to text from uploaded audio files.
Deep-Translator: For translating text between languages.
PyDub: For handling audio file formats and conversions.

Live Demo

Experience the application live on Hugging Face Spaces:

👉 Multifunctional Speech and Text Application

Simply navigate to the link, and start using the app without any installation!

Let me know if you need further updates! 😊

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

Multifunctional Speech and Text Application

🚀 Live Demo on Hugging Face: Click Here

Table of Contents

About the Project

Features

Installation

Usage

Technologies Used

Live Demo

Files

README.md

Latest commit

History

README.md

File metadata and controls

Multifunctional Speech and Text Application

🚀 Live Demo on Hugging Face: Click Here

Table of Contents

About the Project

Features

Installation

Usage

Technologies Used

Live Demo