Lidar Annotation Is All You Need

We propose a novel approach that effectively leverages lidar annotations to train image segmentation models directly on RGB images. The approach consists of four main parts: point cloud road annotation, data preparation, masked loss, and the segmentation model itself. The key innovation of our approach is the masked loss, addressing sparse ground-truth masks from lidar point clouds. By calculating loss exclusively where lidar points exist, the model learns road segmentation on images by using lidar points as ground truth. The flexibility of the approach allows mixing lidar data with 2D ground truth, and by doing that, increasing the quality of predictions.

Lidar Annotation Is All You Need
Dinar Sharafutdinov, Stanislav Kuskov, Saian Protasov, Alexey Voropaev

You can find a detailed description of our approach in the paper. The work was published in the IEEE Access journal. If you find our work useful for your research, please consider giving it a star ⭐ and citing the paper:

@article{sharafutdinov2023lidar,
  author={Sharafutdinov, Dinar and Kuskov, Stanislav and Protasov, Saian and Voropaev, Alexey},
  journal={IEEE Access}, 
  title={Lidar Annotation Is All You Need}, 
  year={2023},
  volume={11},
  number={},
  pages={135820-135830},
  doi={10.1109/ACCESS.2023.3337995}
}

Results

Comparison of predictions of three models on the Waymo Open and KITTI-360 datasets. From top to bottom: 2D only, projected 3D only, mix 2D + projected 3D

Metrics

Road segmentation results (% of IoU) on the validation split of the "Waymo full" dataset for models trained on different ratios of 2D image masks and lidar-based ground truth

Setup

Dataset preparation

Lidar-based road ground truth for three setups placed from top to bottom: Waymo Open Dataset (five proprietary lidar sensors), KITTI-360 (Velodyne HDL-64E lidar), Proprietary dataset (two Robosense RS-Helios lidars).

Datasets structure

Datasets should be formatted in the following structure:

dataset
├── images # a folder with images (.jpg)
│   ├── train
│   └── val
├── seg_masks # a folder with 2D segmentation ground truth masks (.png)
│   ├── train
│   └── val
├── seg_points # a folder with lidar road points that were projected to the 2D mask (.npy)
│   ├── train
│   └── val
└── seg_points_total # a folder with all lidar points that were projected to the 2D mask (.npy)
    ├── train
    └── val

Waymo

You can download the filtered Waymo dataset (Waymo with intersection in the paper) from the link.

Or filter the full dataset using our script:

Download the training and validation folders of waymo-open-dataset (we used the 1.4.0 version). 2D segmentation ground truth and point cloud segmentation ground truth are made separately and not for all images. We created two datasets for the paper: "Waymo with intersection" and "Waymo full". The first one is a dataset created from all original images, for which 2D and lidar annotations are intersected. The second one is created separately for 2D and lidar annotations and then combined together. To filter and save the dataset, use these commands:

pip install -r lib/waymo_process/requirments.txt
python3 lib/waymo_process/create_2d3d_dataset.py {path_to_training_or_validation_folder_of_waymo_dataset} --subset={'val' or 'train'}

--subset - folder name to save data
--lidar_data_only=True - for saving only reprojected point cloud points for both road (gt) and other classes (loss mask)
--masks_only=True - for saving only 2D masks.
If no flag is chosen from lidar_data_only and masks_only, you will get a dataset of images where 2D segmentation ground truth and point cloud segmentation ground truth are intersected.

Note: val subset needs both lidar data and 2D masks, don't override flags --lidar_data_only and --masks_only when generating val subset

For "Waymo with intersection" you should get 1852 images in the train set and 315 images in the val set with both 2D masks of road and reprojected points for road and other classes.

KITTI-360

We cannot provide a filtered KITTI-360 dataset due to its size. But you can get it easily by yourself.

Firstly, download the original KITTI-360 dataset from the official website. To process KITTI-360 to our data format, you have to use the following repositories:

https://github.com/autonomousvision/kitti360Scripts - official KITTI-360 repo for working with the dataset
https://github.com/JulesSanchez/recoverKITTI360label - unofficial KITTI-360 to recover instant lidar labels from

Using this two tools you can get all the needed labels and images.

Docker

Build a contatiner:

DOCKER_BUILDKIT=1 docker build --network host -t lidar_segm --target base_image --build-arg UID=1000 --build-arg GID=1000 --build-arg USERNAME={your username} .

Run the container:

docker run --net=host --userns=host --pid=host -itd --gpus all --name=lidar_segm --volume={path_to_lidar_data_2d_road_segmentation}:/lidar_data_2d_road_segmentation --volume={path_to_dataset}:/data/ --shm-size 15G --cpuset-cpus 0-7 lidar_segm

Attach to the container:

docker exec -it lidar_segm bash

Stop and remove the container:

docker container stop lidar_segm
docker container rm lidar_segm

Conda

Alternatively, you can use conda on Ubuntu 20.04 with Python 3.8.

conda env create -f environment.yml

Training

Specify the path to the dataset (DATASET.PATH) and other training parameters in lib/config/waymo.py and then run the script:

python3 scripts/train.py

By setting DATASET.MASKS_ONLY = True in the config file, you will run model training using only 2D ground truth masks. DATASET.LIDAR_DATA_ONLY = True will lead to the training only using projected from the point cloud road ground truth. If both parameters are set to False (default), the model will train on a mix of these two types of ground truth. In all cases, the validation dataset requires 2D ground truth masks for metric calculation.

Testing

To test the model specify the path to the dataset (DATASET.PATH) and other inference parameters in lib/config/waymo_inference.py and then run the script:

python scripts/test.py --weights {path to the .pth weights} --save_video

Unfortunately, publishing the weights of a model trained with the use of the Waymo open dataset is prohibited by the dataset license. But the training experiments can be easily reproduced using our repository and the data provided.

Contribution

Before contributing to this repository, we ask that you set up pre-commit hooks on your local machine. This ensures that your submissions adhere to our code quality standards and formatting guidelines.

Install the pre-commit package using pip:

pip install pre-commit

Navigate to the root directory of this repository.
Install the pre-commit hook scripts into your .git/ directory:

pre-commit install

If you want to manually run the hooks on all files in the repository, you can do so with the following command:

pre-commit run --all-files

License

Project is distributed under the MIT License

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
lib		lib
pictures		pictures
scripts		scripts
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
environment.yml		environment.yml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Lidar Annotation Is All You Need

Results

Metrics

Setup

Dataset preparation

Datasets structure

Waymo

KITTI-360

Docker

Conda

Training

Testing

Contribution

License

Acknowledgements

About

Releases

Packages

Contributors 2

Languages

License

Evocargo/Lidar-Annotation-is-All-You-Need

Folders and files

Latest commit

History

Repository files navigation

Lidar Annotation Is All You Need

Results

Metrics

Setup

Dataset preparation

Datasets structure

Waymo

KITTI-360

Docker

Conda

Training

Testing

Contribution

License

Acknowledgements

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages