Stable diffusion video modifier

What

This is a wrapper arround neonsecrets modified stable diffusion model. It allows lower end hardware users <8 GB VRAM to utilize stable diffusion without having to upgrade their rig. The script i added allows you to take a video file and perform a stable diffusion image operation on every frame. Because of performance reasons there is also the functionality to lower the fps of the video, as you might not want to wait for multiple days to render your 3 minute 60 fps video.

How

I have switched out the existing gradio user interface for a simple rest api to interact with the stable diffusion model. This implementation is found in the dockerized version of the model. I prefer to use the dockerized version as i had difficulties installing the prequesites for the bare bone one, and i suspect others might encounter the same issues. The video_altering script takes the input video, seperates it frame by frame, lowers the framerate on request and modifies them via the rest api. After that the resulting images are reconstructed into a video. Currently the audio of the video is lost.

Prequesites

python3 (tested with 3.10.6)
pip install opencv-python
docker + compose

How to use

only tested on linux so far
create the folders specified in the docker-compose file in the parent of the clone of this repository. ('../sd-data', '../sd-output', '../sd-input') This is necessary to do beforehand, because as docker would create them as root. After that they would not be accessible without changing the ownership/ permissions.
get the pretrained stable diffusion weights from huggingfaces like described in the (official stable diffusion repo)[https://github.com/CompVis/stable-diffusion#:~:text=The%20weights%20are%20available] and place (or symlink) them in the '../sd-data' directory. The name of the model needs to be 'model.ckpt'
launch the docker compose yml with 'docker compose up --build'. After it has compiled for the first time you only need to call 'docker compose up'
launch the video altering script with the following params 'python video_altering.py PROJECT_NAME ABSOLUTE_VIDEO_INPUT_PATH ABSOLUTE_VIDEO_OUTPUT_PATH "PROMPT"' PROJECT_NAME = arbitrary name for folder structures e.g.: 'testproject' ABSOLUTE_VIDEO_INPUT_PATH, ABSOLUTE_VIDEO_OUTPUT_PATH = absolute path to video in/output e.g.: '/home/user/Code/vid2vid/samplevid.mp4' PROMPT = the prompt to alter the frames with e.g.: '"oil painting style"'

e.g.: 'python video_altering.py testproject /home/USER/inputvid.mp4 /home/USER/newinputvid.mp4 "comic style artwork"'

For more settings (like FPS adjustments, image size and so on) check out the parameterization at the end of the video_altering file.

ToDo

Port over the audio of the video

Name		Name	Last commit message	Last commit date
Latest commit History 210 Commits
.github		.github
assets		assets
configs		configs
data		data
ldm		ldm
models		models
optimizedSD		optimizedSD
scripts_deprecated		scripts_deprecated
.dockerignore		.dockerignore
Dockerfile		Dockerfile
GUI_TUTORIAL.md		GUI_TUTORIAL.md
LICENSE		LICENSE
README.md		README.md
Stable_Diffusion_v1_Model_Card.md		Stable_Diffusion_v1_Model_Card.md
docker-bootstrap.sh		docker-bootstrap.sh
docker-compose.yml		docker-compose.yml
environment.yaml		environment.yaml
main.py		main.py
notebook_helpers.py		notebook_helpers.py
optimized_colab.ipynb		optimized_colab.ipynb
requirements.txt		requirements.txt
setup.py		setup.py
video_altering.py		video_altering.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Stable diffusion video modifier

What

How

Prequesites

How to use

ToDo

About

Releases

Packages

Languages

License

Pl8tinium/stable-diffusion

Folders and files

Latest commit

History

Repository files navigation

Stable diffusion video modifier

What

How

Prequesites

How to use

ToDo

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages