Code for How to Use Diffusion Priors under Sparse Views? (NeurIPS 2024)

Installation

Ubuntu 22.04, CUDA 11.3, PyTorch 1.12.1

conda env create --file environment.yaml
conda activate ipsm

pip install ./submodules/diff-gaussian-rasterization-confidence ./simple-knn

Pre-trained Models Preparation

mkdir pretrained_models
cd pretrained_models

Download StableDiffusion-v1.5, StableDiffusionInpainting-v1.5, MiDaS, BLIP to ./pretrained_models/. (NOTE: Stable Diffusion V1.5 and Stable Diffusion Inpainting V1.5 cannot be downloaded from the original repo, but the same weight can be obtained from other clone repo.)

Data Preparation

LLFF

Download LLFF from the official download link.
Run COLMAP to obtain initial point clouds with sparse views:
```
python tools/colmap_llff.py
```
Randomly select one image from sparse views and run BLIP to obtain its blip-based text results:
```
python ./scripts/script_for_blip.py
```

The data format is supposed to be:

|- <scene>
    |- 3_views
    |- images
    |- images_4
    |- images_8
    |- sparse
    |- blip_rst.txt
    |- poses_bounds.npy
    |- ...

DTU

Download DTU dataset
- Download the DTU dataset "Rectified (123 GB)" from the official website, and extract it.
- Download masks (used for evaluation only) from this link.
Preprocess following DNGaussian
- Poses: following gaussian-splatting, run convert.py to get the poses and the undistorted images by COLMAP.
- Render Path: following LLFF to get the poses_bounds.npy from the COLMAP data. (Optional)
Run COLMAP to obtain initial point clouds with sparse views:
```
python tools/colmap_dtu.py
```
Randomly select one image from sparse views and run BLIP to obtain its blip-based text results:
```
python blip_script.py
```

The data format is supposed to be:

|- <scene>
    |- 3_views
    |- images
    |- images_2
    |- images_4
    |- images_8
    |- mask
    |- sparse
    |- blip_rst.txt
    |- poses_bounds.npy
    |- ...

Training & Rendering & Evaluating

Train & Render & Evaluate IPSM-Gaussian on the LLFF dataset with 3 views:

python ./scripts/script_for_llff.py

Train & Render & Evaluate IPSM-Gaussian on the LLFF dataset with 3 views:

python ./scripts/script_for_dtu.py

Acknowledgement

This code is developed on gaussian-splatting, FSGS, and DNGaussian. Thanks for these great projects!

Citation

@inproceedings{
wang2024how,
title={How to Use Diffusion Priors under Sparse Views?},
author={Qisen Wang and Yifan Zhao and Jiawei Ma and Jia Li},
booktitle={The Thirty-eighth Annual Conference on Neural Information Processing Systems},
year={2024},
url={https://openreview.net/forum?id=i6BBclCymR}
}

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
arguments		arguments
configs		configs
gaussian_renderer		gaussian_renderer
lpipsPyTorch		lpipsPyTorch
scene		scene
scripts		scripts
submodules		submodules
tools		tools
utils		utils
LICENSE.md		LICENSE.md
README.md		README.md
environment.yaml		environment.yaml
metrics.py		metrics.py
metrics_dtu.py		metrics_dtu.py
render.py		render.py
sd_guidance.py		sd_guidance.py
train.py		train.py
train_dtu_mask.py		train_dtu_mask.py
train_record_npy.py		train_record_npy.py
warp.py		warp.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Code for How to Use Diffusion Priors under Sparse Views? (NeurIPS 2024)

Installation

Pre-trained Models Preparation

Data Preparation

LLFF

DTU

Training & Rendering & Evaluating

Acknowledgement

Citation

About

Releases

Packages

Languages

License

iCVTEAM/IPSM

Folders and files

Latest commit

History

Repository files navigation

Code for How to Use Diffusion Priors under Sparse Views? (NeurIPS 2024)

Installation

Pre-trained Models Preparation

Data Preparation

LLFF

DTU

Training & Rendering & Evaluating

Acknowledgement

Citation

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages