Multiple-Object Tracking with Transformer

Last update: Jan 04, 2023

Related tags

Deep Learning TransTrack

Overview

TransTrack: Multiple-Object Tracking with Transformer

Introduction

TransTrack: Multiple-Object Tracking with Transformer

Models

Training data	Training time	Validation MOTA	download
crowdhuman, mot_half	36h + 1h	65.4	model
crowdhuman	36h	53.8	model
mot_half	8h	61.6	model

Models are also available in Baidu Drive by code m4iv.

Notes

Evaluating crowdhuman-training model and mot-training model use different command lines, see Steps.
We observe about 1 MOTA noise.
If the resulting MOTA of your self-trained model is not desired, playing around with the --track_thresh sometimes gives a better performance.
The training time is on 8 NVIDIA V100 GPUs with batchsize 16.
We use the models pre-trained on imagenet.

Demo

Installation

The codebases are built on top of Deformable DETR and CenterTrack.

Requirements

Linux, CUDA>=9.2, GCC>=5.4
Python>=3.7
PyTorch ≥ 1.5 and torchvision that matches the PyTorch installation. You can install them together at pytorch.org to make sure of this
OpenCV is optional and needed by demo and visualization

Steps

Install and build libs

git clone https://github.com/PeizeSun/TransTrack.git
cd TransTrack
cd models/ops
python setup.py build install
cd ../..
pip install -r requirements.txt

Prepare dataset

mkdir -p crowdhuman/annotations
cp -r /path_to_crowdhuman_dataset/annotations/CrowdHuman_val.json crowdhuman/annotations/CrowdHuman_val.json
cp -r /path_to_crowdhuman_dataset/annotations/CrowdHuman_train.json crowdhuman/annotations/CrowdHuman_train.json
cp -r /path_to_crowdhuman_dataset/CrowdHuman_train crowdhuman/CrowdHuman_train
cp -r /path_to_crowdhuman_dataset/CrowdHuman_val crowdhuman/CrowdHuman_val
mkdir mot
cp -r /path_to_mot_dataset/train mot/train
cp -r /path_to_mot_dataset/test mot/test
python track_tools/convert_mot_to_coco.py

CrowdHuman dataset is available in CrowdHuman. We provide annotations of json format.

MOT dataset is available in MOT.

Pre-train on crowdhuman

sh track_exps/crowdhuman_train.sh
python track_tools/crowdhuman_model_to_mot.py

The pre-trained model is available crowdhuman_final.pth.

Train TransTrack

sh track_exps/crowdhuman_mot_trainhalf.sh

Evaluate TransTrack

sh track_exps/mot_val.sh
sh track_exps/mot_eval.sh

Visualize TransTrack

python track_tools/txt2video.py

Notes

Evaluate pre-trained CrowdHuman model on MOT

sh track_exps/det_val.sh
sh track_exps/mot_eval.sh

License

TransTrack is released under MIT License.

Citing

If you use TransTrack in your research or wish to refer to the baseline results published here, please use the following BibTeX entries:

@article{transtrack,
  title   =  {TransTrack: Multiple-Object Tracking with Transformer},
  author  =  {Peize Sun and Yi Jiang and Rufeng Zhang and Enze Xie and Jinkun Cao and Xinting Hu and Tao Kong and Zehuan Yuan and Changhu Wang and Ping Luo},
  journal =  {arXiv preprint arXiv: 2012.15460},
  year    =  {2020}
}

Multiple-Object Tracking with Transformer

Related tags

Overview

TransTrack: Multiple-Object Tracking with Transformer

Introduction

Models

Notes

Demo

Installation

Requirements

Steps

Notes

License

Citing

Owner

Peize Sun

A Quick and Dirty Progressive Neural Network written in TensorFlow.

pyspark🍒🥭 is delicious，just eat it!😋😋

Locally cache assets that are normally streamed in POPULATION: ONE

My freqtrade strategies

PyTorch code accompanying the paper "Landmark-Guided Subgoal Generation in Hierarchical Reinforcement Learning" (NeurIPS 2021).

Fully Adaptive Bayesian Algorithm for Data Analysis (FABADA) is a new approach of noise reduction methods. In this repository is shown the package developed for this new method based on \citepaper.

Official implementation of Meta-StyleSpeech and StyleSpeech

Supervised 3D Pre-training on Large-scale 2D Natural Image Datasets for 3D Medical Image Analysis

Translate darknet to tensorflow. Load trained weights, retrain/fine-tune using tensorflow, export constant graph def to mobile devices

PyTorch Implementations for DeeplabV3 and PSPNet

This project is a re-implementation of MASTER: Multi-Aspect Non-local Network for Scene Text Recognition by MMOCR

Cryptocurrency Prediction with Artificial Intelligence (Deep Learning via LSTM Neural Networks)

DenseCLIP: Language-Guided Dense Prediction with Context-Aware Prompting

Official Pytorch Implementation of GraphiT

"SinNeRF: Training Neural Radiance Fields on Complex Scenes from a Single Image", Dejia Xu, Yifan Jiang, Peihao Wang, Zhiwen Fan, Humphrey Shi, Zhangyang Wang

Add-on for importing and auto setup of character creator 3 character exports.

The official repository for our paper "The Devil is in the Detail: Simple Tricks Improve Systematic Generalization of Transformers". We significantly improve the systematic generalization of transformer models on a variety of datasets using simple tricks and careful considerations.

Poplar implementation of "Bundle Adjustment on a Graph Processor" (CVPR 2020)

Simulation of moving particles under microscopic imaging

(IEEE TIP 2021) Regularized Densely-connected Pyramid Network for Salient Instance Segmentation