This repository contains the implementation of the paper Contrastive Instance Association for 4D Panoptic Segmentation using Sequences of 3D LiDAR Scans

Last update: Dec 01, 2022

Related tags

Deep Learning contrastive_association

Overview

Contrastive Instance Association for 4D Panoptic Segmentation using Sequences of 3D LiDAR Scans

This repository contains the implementation of the paper Contrastive Instance Association for 4D Panoptic Segmentation using Sequences of 3D LiDAR Scans.

The approach builds on top of an arbitrary single-scan Panoptic Segmentation network and extends it to the temporal domain by associating instances across time using our Contrastive Aggregation network that leverages the point-wise features from the panoptic network.

Requirements

Install this package: go to the root directory of this repo and run:

pip3 install -U -e .

Install packages in requirements.txt.
Install MinkowskiEngine.
Install spconv version 1.2.1.

Data preparation

Download the SemanticKITTI dataset inside the directory data/kitti/. The directory structure should look like this:

./
└── data/
    └── kitti
        └── sequences
            ├── 00/           
            │   ├── velodyne/	
            |   |	├── 000000.bin
            |   |	├── 000001.bin
            |   |	└── ...
            │   └── labels/ 
            |       ├── 000000.label
            |       ├── 000001.label
            |       └── ...
            ├── 08/ # for validation
            ├── 11/ # 11-21 for testing
            └── 21/
                └── ...

Pretrained models

Pretrained Panoptic Segmentation model.
Pretrained Contrastive Aggregation model.

Reproducing the results

Run the evaluation script, which will compute the metrics for the validation set:

python evaluate_4dpanoptic.py --ckpt_ps path/to/panoptic_weights --ckpt_ag path/to/aggregation_weights

Training

Create instances dataset

Since we use a frozen Panoptic Segmentation Network, to avoid running the forward pass during training, we save the instance predictions and the point features in advance running:

python save_panoptic_features.py --ckpt path/to/panoptic_weights

This will create a directory in cont_assoc/data/instance_features with the same structure as Kitti but containing, for each sequence of the train set, npy files containing the instance points, labels and features for each scan.

Save validation predictions

To get the 4D Panoptic Segmentation performance for the validation step during training, we save the full predictions for the validation set (sequence 08) running:

python save_panoptic_features.py --ckpt path/to/panoptic_weights --save_val_pred

This will create a directory in cont_assoc/data/validation_predictions with npy files for each scan of the validation sequence containing the semantic and instance predictions for each point.

Train Contrastive Aggregation Network

Once the instance dataset and the validation predictions are generated, we're ready to train the Contrastive Aggregation Network running:

python train_aggregation.py

All the configurations are in the config/contrastive_instances.yaml file.

Citation

If you use this repo, please cite as :

@article{marcuzzi2022ral,
  author = {Rodrigo Marcuzzi and Lucas Nunes and Louis Wiesmann and Ignacio Vizzo and Jens Behley and Cyrill Stachniss},
  title = {{Contrastive Instance Association for 4D Panoptic Segmentation \\ using Sequences of 3D LiDAR Scans}},
  journal = {IEEE Robotics and Automation Letters (RA-L)},
  year = 2022,
  volume={7},
  number={2},
  pages={1550-1557},
}

Acknowledgments

The Panoptic Segmentation Network used in this repo is DS-Net.

The loss function it's a modified version of SupContrast.

License

This project is free software made available under the MIT License. For details see the LICENSE file.

This repository contains the implementation of the paper Contrastive Instance Association for 4D Panoptic Segmentation using Sequences of 3D LiDAR Scans

Related tags

Overview

Contrastive Instance Association for 4D Panoptic Segmentation using Sequences of 3D LiDAR Scans

Requirements

Data preparation

Pretrained models

Reproducing the results

Training

Create instances dataset

Save validation predictions

Train Contrastive Aggregation Network

Citation

Acknowledgments

License

Owner

Photogrammetry & Robotics Bonn

Author Disambiguation using Knowledge Graph Embeddings with Literals

PyTorch code for MART: Memory-Augmented Recurrent Transformer for Coherent Video Paragraph Captioning

Implementation of Deformable Attention in Pytorch from the paper "Vision Transformer with Deformable Attention"

[CVPR 2021] "Multimodal Motion Prediction with Stacked Transformers": official code implementation and project page.

Config files for my GitHub profile.

Source code for "OmniPhotos: Casual 360° VR Photography"

A script helps the user to update Linux and Mac systems through the terminal

SciPy fixes and extensions

MemStream: Memory-Based Anomaly Detection in Multi-Aspect Streams with Concept Drift

The (Official) PyTorch Implementation of the paper "Deep Extraction of Manga Structural Lines"

Person Re-identification

Fuzzing tool (TFuzz): a fuzzing tool based on program transformation

Geneva is an artificial intelligence tool that defeats censorship by exploiting bugs in censors

FedJAX is a library for developing custom Federated Learning (FL) algorithms in JAX.

Implementation for the "Surface Reconstruction from 3D Line Segments" paper.

SalGAN: Visual Saliency Prediction with Generative Adversarial Networks

Code for NeurIPS 2021 paper "Curriculum Offline Imitation Learning"

Romanian Automatic Speech Recognition from the ROBIN project

Reimplement of SimSwap training code

Conversion between units used in magnetism