VXP: Voxel-Cross-Pixel Large-scale Image-LiDAR Place Recognition (3DV 2025)

Project page | Paper

We propose a novel Voxel-Cross-Pixel (VXP) approach, which establishes voxel and pixel correspondences in a self-supervised manner and brings them into a shared feature space. We achieve state-of-the-art performance in cross-modal retrieval on the Oxford RobotCar, ViViD++ datasets and KITTI benchmark, while maintaining high uni-modal global localization accuracy.

Setup the environement

git clone https://github.com/yunjinli/vxp.git
cd vxp
conda create -n VXP python=3.10 -y
conda activate VXP
pip install torch==2.0.1 torchvision==0.15.2 numpy pandas tqdm tensorboard psutil scikit-learn==1.2.2 bitarray pytorch-metric-learning==0.9.94 torchinfo
pip install -U openmim
mim install mmengine==0.7.3 mmcv==2.0.0 mmdet==3.0.0 mmdet3d==1.1.0
pip install 'git+https://github.com/facebookresearch/detectron2.git'

For sparse 3D convolution, we're using spconv library. You can follow the detailed installation guide on their repository. Or you can simply run the following command with specific cuda version (I'm using CUDA 12.0).

pip install spconv-cu120

Dataset Format / Creation

Please see here.

Training

Please see here.

Inference

Please see here.

BibTex

@article{li2024vxp,
    title={VXP: Voxel-Cross-Pixel Large-scale Image-LiDAR Place Recognition},
    author={Li, Yun-Jin and Gladkova, Mariia and Xia, Yan and Wang, Rui and Cremers, Daniel},
    journal={arXiv preprint arXiv:2403.14594},
    year={2024}
}

Name		Name	Last commit message	Last commit date
Latest commit History 33 Commits
assets		assets
dataset		dataset
docs		docs
inference		inference
loss		loss
model		model
train		train
utility_functions		utility_functions
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
VERSION		VERSION

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

VXP: Voxel-Cross-Pixel Large-scale Image-LiDAR Place Recognition (3DV 2025)

Project page | Paper

Setup the environement

Dataset Format / Creation

Training

Inference

BibTex

About

Releases 2

Packages

Languages

License

yunjinli/vxp

Folders and files

Latest commit

History

Repository files navigation

VXP: Voxel-Cross-Pixel Large-scale Image-LiDAR Place Recognition (3DV 2025)

Project page | Paper

Setup the environement

Dataset Format / Creation

Training

Inference

BibTex

About

Resources

License

Stars

Watchers

Forks

Releases 2

Packages 0

Languages

Packages