Neural Fourier Filter Bank

This repository contains the code (in PyTorch Lightning) for the paper:

Neural Fourier Filter Bank
Zhijie Wu, Yuhe Jin, Kwang Moo Yi
CVPR 2023

Introduction

In this project, we propose to learn a neural field by decomposing the signal both spatially and frequency-wise. We follow the grid-based paradigm for spatial decomposition, but unlike existing work, encourage specific frequencies to be stored in each grid via Fourier feature encodings. We then apply a multi-layer perceptron with sine activations, taking these Fourier encoded features in at appropriate layers so that higher-frequency components are accumulated on top of lower-frequency components sequentially, which we sum up to form the final output. We do the evaluations in the tasks of 2D image fitting, 3D shape reconstruction, and neural radiance fields. All results are tested upon an Nvidia RTX 3090.

If you have any questions, please feel free to contact Zhijie Wu ([email protected]).

Key Requirements

Python 3.8
CUDA 11.6
PyTorch 1.12.0
PyTorch Lightning
torch-scatter
apex
tiny-cuda-nn
Install requirements by pip install -r requirements.txt

Note: Our current implementations are heavily based on the ngp-pl repo. For further details, please also refer to their codebase.

Novel View Synthesis

A quickstart:

python train_nerf.py --root_dir <path/to/lego> --exp_name Lego --num_epochs 30 --lr 2e-2 --eval_lpips --no_save_test

It will train the Lego scene for 30k steps. --no_save_test is to disable saving synthesized images.

More options can be found in opt.py and FFB_config.json under the config folder.

To compute the metrics for the eight Blender scenes, please run the script benchmark_synthetic_nerf.sh under the folder benchmarking.

2D Image Fitting

python train_img.py --config <path/to/config.json> --input_path <path/to/image>

Currently, the model is trained for 50k iterations. But our experiences show that the model has already achieved comparable results near 20k iterations' training.

3D Shape Fitting

python train_sdf.py --config <path/to/config.json> --input_path <path/to/shape>

Similar to 2D Image Fitting, the model is trained with 50k iterations to achieve improved geometric details. However, using size=100 instead of size=1000 in the train_dataset (train_sdf.py) would slightly reduce the output quality while significantly accelerating the training process.

Citation and License

@InProceedings{Wu_2023_CVPR,
    author    = {Wu, Zhijie and Jin, Yuhe and Yi, Kwang Moo},
    title     = {Neural Fourier Filter Bank},
    booktitle = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)},
    month     = {June},
    year      = {2023},
    pages     = {14153-14163}
}

Our codebase is under the MIT License.

TODO

Finish the CUDA version

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
benchmarking		benchmarking
config		config
datasets		datasets
docs/figures		docs/figures
models		models
scripts		scripts
README.md		README.md
requirements.txt		requirements.txt
train_img.py		train_img.py
train_nerf.py		train_nerf.py
train_sdf.py		train_sdf.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Neural Fourier Filter Bank

Introduction

Key Requirements

Novel View Synthesis

2D Image Fitting

3D Shape Fitting

Citation and License

TODO

About

Releases

Packages

Languages

ubc-vision/NFFB

Folders and files

Latest commit

History

Repository files navigation

Neural Fourier Filter Bank

Introduction

Key Requirements

Novel View Synthesis

2D Image Fitting

3D Shape Fitting

Citation and License

TODO

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages