Introducing Word2Vid, a groundbreaking solution designed to tackle the common struggles college students encounter with PDF reading and comprehension. Recognizing the prevalent preference for video content over traditional text, Word2Vid offers a transformative approach, enabling users to effortlessly convert textual material into captivating video presentations. With this innovative tool, users can delve into their study materials through engaging visual content, enhancing understanding and retention while fostering creativity and adaptability.
Yes, Word2Vid addresses the real-life problem of transforming textual content, such as PDF documents or prompts, into engaging video content. This solution caters to individuals, particularly visual learners, who prefer consuming information through videos rather than text-based materials. Additionally, it offers a more efficient way for users to convey information or ideas in a visually appealing format.
The problem solved by Word2Vid is highly marketable, especially in educational and professional settings where effective communication and content delivery are essential. By providing a seamless and innovative tool for creating videos from textual content, Word2Vid offers a competitive advantage over existing solutions. Its nominal pricing and additional feature of speech conversion further enhance its marketability, making it appealing to a wide range of users.
Yes, it is safe. It has been tested with different test case scenarios from long pdf to short texts.
Yes, the project works without any errors. It just takes a little long time to generate video of a long text or pdf due to content parsing, so a little patience then is appreciated.
- Frontend: Utilizing React for a sleek and responsive user interface, ensuring an intuitive experience for users interacting with the platform.
- OpenAI API Integration: Leveraging the OpenAI API to seamlessly incorporate relevant pictures into the video content, enhancing visual appeal and comprehension.
- Backend Development: Employing Python for robust backend functionality, facilitating efficient processing and management of user requests and data.
- Creating videos from prompts and PDF
- Nominal Cost
- Better than others already available in market
- Speech conversion as well
/backend
├── .__pycache__/
│ └── API_KEY
├── .idea/
│ └── modules and script
│ └── All downloaded dependencies
├── use.py - splits the pdf into smaller segments
├── text_gen.py
├── video_gen.py
└── final_vide.mp4 - the final output generated
All commands are run from the root of the project, from a terminal:
Command |
---|
cd Frontend |
npm i |
npm start |
This repository contains a backend folder in which there are scripts for text generation and video generation from PDFs.
text_gen.py
: Script for generating text using OpenAI's GPT-3 model.textgen_pdf.py
: Script for generating videos from PDFs.api_key.py
: Python file to store your OpenAI API key.Osi_model.pdf
: Sample PDF file used for text generation (you can replace it with your own PDF).
-
Clone the repository:
git clone <repository_url>
-
Navigate to the project directory:
cd <project_directory>
-
Navigate to the backend directory:
cd backend
-
Install the required Python packages:
pip install -r requirements.txt
-
Create a file named
api_key.py
in the same directory as the scripts:echo "API_KEY = 'your_openai_api_key_here'" > api_key.py
Replace
'your_openai_api_key_here'
with your actual OpenAI API key. -
Run
text_gen.py
to generate text:python text_gen.py
-
Run
textgen_pdf.py
to generate videos from a PDF file:python textgen_pdf.py
Make sure you have a PDF file in the same directory. You can use any PDF file of your choice. For example, we have provided Osi_model.pdf
as a sample.
- Ensure that the PDF file you want to use for text generation is in the same directory as the script.
- Replace
your_openai_api_key_here
with your actual OpenAI API key inapi_key.py
.
https://youtu.be/t5sqSTbHwYg?feature=shared
Presentation Link: Click here to go to our presentation.