defogging-diffusion

This is a codebase for preliminary explorations into leveraging diffusion models for image defogging.

This repository is based on the paper Diffusion Models Beat GANS on Image Synthesis and its corresponding repository openai/guided-diffusion.

Download pre-trained model

Below is one checkpoint for a model trained on the defogging task. Before using it, please review the corresponding model card to understand its intended use and limitations.

256x256 defogger: 256x256_defogger.pt

Sampling from pre-trained model

To sample from this model, you can use the defog_sample.py script. We assume that you have downloaded the relevant model checkpoints into a folder called models/.

For these examples, we will generate 100 samples with batch size 4. Feel free to change these values.

SAMPLE_FLAGS="--batch_size 4 --num_samples 100 --timestep_respacing 250"

For these runs, we assume you have paired foggy-clear images in foggy.npz and clear.npz.

MODEL_FLAGS="--attention_resolutions 32,16,8 --diffusion_steps 1000 --image_size 256 --learn_sigma True --noise_schedule linear --num_channels 192 --num_heads 4 --num_res_blocks 2 --resblock_updown True --use_scale_shift_norm True"
python super_res_sample.py $MODEL_FLAGS --model_path models/256x256_defogger.pt --foggy_data_dir foggy.npz --clear_data_dir clear.npz $SAMPLE_FLAGS

Results

This table summarizes our results on image defogging:

Dataset	val size	PSNR	SSIM	FID	IS	PD
NYU Depth Dataset V2 (synthetic)	100	16.67	0.80	10.79	4.87	0.56
I-HAZE	7	15.39	0.66	31.59	3.65	0.63

Below are sample qualitative results of our model:

NYU Depth Dataset V2 (synthetic)
foggy input
defogged output
ground truth

I-HAZE
foggy input
defogged output
ground truth

Training models

With the following as sample hyperparameters:

MODEL_FLAGS="--num_channels 192 --num_res_blocks 2 --learn_sigma True --image_size 256 --num_heads 2 --attention_resolutions 32,16,8 --resblock_updown True"
DIFFUSION_FLAGS="--diffusion_steps 1000 --noise_schedule linear --rescale_learned_sigmas False --rescale_timesteps False --use_scale_shift_norm True"
TRAIN_FLAGS="--lr 3e-4 --batch_size 4 --lr_anneal_steps 700"

a model can be trained with the following command:

python scripts/defog_train.py --foggy_data_dir path/to/foggy/images --clear_data_dir path/to/clear/images $MODEL_FLAGS $DIFFUSION_FLAGS $TRAIN_FLAGS

More details on training can be found at the repository this codebase was forked from. For the specific hyperparameters the hosted checkpoint was trained with, please see model-card.md.

Name		Name	Last commit message	Last commit date
Latest commit History 51 Commits
datasets		datasets
defogging_diffusion		defogging_diffusion
evaluations		evaluations
sample_results		sample_results
scripts		scripts
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
model-card.md		model-card.md
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

defogging-diffusion

Download pre-trained model

Sampling from pre-trained model

Results

Training models

About

Releases

Packages

Languages

License

ryan-caesar-ramos/defogging-diffusion

Folders and files

Latest commit

History

Repository files navigation

defogging-diffusion

Download pre-trained model

Sampling from pre-trained model

Results

Training models

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages