MonoRCNN is a monocular 3D object detection method for automonous driving

Last update: Dec 27, 2022

Related tags

Overview

MonoRCNN

MonoRCNN is a monocular 3D object detection method for automonous driving, published at ICCV 2021. This project is an implementation of MonoRCNN.

Visualization

Methodology

Installation

Python 3.6
PyTorch 1.5.0
Detectron2 0.1.3

Please use the Detectron2 included in this project. To ignore fully occluded objects during training, build.py, rpn.py, and roi_heads.py have been modified.

Dataset Preparation

KITTI

Model & Log

KITTI val1 split

Organize the downloaded files as follows:

├── projects
│   ├── MonoRCNN
│   │   ├── output
│   │   │   ├── model
│   │   │   ├── log.txt
│   │   │   ├── ...

Test

cd projects/MonoRCNN
./main.py --config-file config/MonoRCNN_KITTI.yaml --num-gpus 1 --resume --eval-only

Set VISUALIZE as True to visualize 3D object detection results (saved in output/evaluation/test/visualization).

Training

cd projects/MonoRCNN
./main.py --config-file config/MonoRCNN_KITTI.yaml --num-gpus 1

Citation

If you find this project useful in your research, please cite:

@inproceedings{MonoRCNN_ICCV21,
    title = {Geometry-based Distance Decomposition for Monocular 3D Object Detection},
    author = {Xuepeng Shi and Qi Ye and 
              Xiaozhi Chen and Chuangrong Chen and 
              Zhixiang Chen and Tae-Kyun Kim},
    booktitle = {ICCV},
    year = {2021},
}

Contact

[email protected]

MonoRCNN is a monocular 3D object detection method for automonous driving

Related tags

Overview

MonoRCNN

Visualization

Methodology

Related Link

Installation

Dataset Preparation

Model & Log

Test

Training

Citation

Contact

Acknowledgement

Owner

Towards Multi-Camera 3D Human Pose Estimation in Wild Environment

Official repository of "Investigating Tradeoffs in Real-World Video Super-Resolution"

Pytorch implementation for "Implicit Feature Alignment: Learn to Convert Text Recognizer to Text Spotter".

Code for the paper "Benchmarking and Analyzing Point Cloud Classification under Corruptions"

Here I will explain the flow to deploy your custom deep learning models on Ultra96V2.

TEA: A Sequential Recommendation Framework via Temporally Evolving Aggregations

modelvshuman is a Python library to benchmark the gap between human and machine vision

It is modified Tensorflow 2.x version of Mask R-CNN

Official code for: A Probabilistic Hard Attention Model For Sequentially Observed Scenes

Contrastive Loss Gradient Attack (CLGA)

Doods2 - API for detecting objects in images and video streams using Tensorflow

MOOSE (Multi-organ objective segmentation) a data-centric AI solution that generates multilabel organ segmentations to facilitate systemic TB whole-person research

Experiments with the Robust Binary Interval Search (RBIS) algorithm, a Query-Based prediction algorithm for the Online Search problem.

official code for dynamic convolution decomposition

NovelD: A Simple yet Effective Exploration Criterion

JstDoS - HTTP Protocol Stack Remote Code Execution Vulnerability

The implementation of ICASSP 2020 paper "Pixel-level self-paced learning for super-resolution"

(Personalized) Page-Rank computation using PyTorch

Train Dense Passage Retriever (DPR) with a single GPU

This repository provides data for the VAW dataset as described in the CVPR 2021 paper titled "Learning to Predict Visual Attributes in the Wild"