For a demonstration of the project, click here
-
Download the Dataset
- Obtain the Flickr8k dataset and store the images in a folder named
Flicker8k_Dataset
within the project directory. - Each image should have the path structure:
Flicker8k_Dataset/<image_name>.jpg
.
- Obtain the Flickr8k dataset and store the images in a folder named
-
Generate Output and Checkpoints
- Open and execute each cell in the provided
.ipynb
notebook. This step allows you to view the output generated at each stage and to create checkpoints during model training. - The application will automatically save the 5 most recent checkpoints in a newly created
checkpoints
directory. - Note: If you only wish to run the application without additional training, pre-generated captions are stored in
data.json
.
- Open and execute each cell in the provided
-
Run the Server
- Start the application by running
server.py
, which contains the backend code.
- Start the application by running
Upon uploading images, the application generates captions and saves them along with the image name as a JSON object. Uploaded images are also stored in a gallery
folder for future reference.