[Project Website 🎯] [Paper 📃] [Code ]
This repository contrains code for the paper ShowHowTo: Generating Scene-Conditioned Step-by-Step Visual Instructions.
-
Environment setup
- Use provided
Dockerfile
to build the environment or install the packages manually.docker build -t showhowto . docker run -it --rm -v $(pwd):$(pwd) -w $(pwd) --gpus=1 showhowto:latest bash
- The code, as written, requires a GPU.
- Use provided
-
Download ShowHowTo model weights
- Use
download_weights.sh
script or download the ShowHowTo weights manually.
- Use
-
Get predictions
- Run the following command to get example predictions.
python predict.py --ckpt_path ./weights/showhowto_2to8steps.pt --prompt_file ./test_data/prompt_file.txt --unconditional_guidance_scale 7.5
- To run the model on your images and prompts, replace
./test_data/prompt_file.txt
with your prompt file.
- Run the following command to get example predictions.
The training code will be released in the coming weeks.
The dataset will be released in the coming weeks.
@article{soucek2024showhowto,
title={ShowHowTo: Generating Scene-Conditioned Step-by-Step Visual Instructions},
author={Sou\v{c}ek, Tom\'{a}\v{s} and Gatti, Prajwal and Wray, Michael and Laptev, Ivan and Damen, Dima and Sivic, Josef},
month = {December},
year = {2024}
}