Multispectral Object Detection with Yolov5

Last update: Jan 01, 2023

Overview

Multispectral-Object-Detection

Intro

Official Code for Cross-Modality Fusion Transformer for Multispectral Object Detection.

Multispectral Object Detection with Transformer and Yolov5

Citation

If you use this repo for your research, please cite our paper:

@article{fang2021cross,
  title={Cross-Modality Fusion Transformer for Multispectral Object Detection},
  author={Fang Qingyun and Han Dapeng and Wang Zhaokui},
  journal={arXiv preprint arXiv:2111.00273},
  year={2021}
}

Installation

Python>=3.6.0 is required with all requirements.txt installed including PyTorch>=1.7 (The same as yolov5 https://github.com/ultralytics/yolov5 ).

Clone the repo

git clone https://github.com/DocF/multispectral-object-detection

Install requirements

$ cd  multispectral-object-detection
$ pip install -r requirements.txt

Dataset

-[FLIR] download A new aligned version.

-[LLVIP] download

-[VEDAI] download

Run

Download the pretrained weights

yolov5 weights:

CFT weights:

Add the some file

create runs/train, runs/test and runs/detect three files for save the results.

Change the data cfg

some example in data/multispectral/

Train Test and Detect

train: python train.py

test: python test.py

detect: python detect_twostream.py

Results

Dataset	CFT	mAP50	mAP75	mAP
FLIR		73.0	32.0	37.4
FLIR	✔️	77.7 (Δ4.7)	34.8 (Δ2.8)	40.0 (Δ2.6)
LLVIP		95.8	71.4	62.3
LLVIP	✔️	97.5 (Δ1.7)	72.9 (Δ1.5)	63.6 (Δ1.3)
VEDAI		79.7	47.7	46.8
VEDAI	✔️	85.3 (Δ5.6)	65.9(Δ18.2)	56.0 (Δ9.2)

Multispectral Object Detection with Yolov5

Related tags

Overview

Multispectral-Object-Detection

Intro

Citation

Installation

Clone the repo

Install requirements

Dataset

Run

Download the pretrained weights

Add the some file

Change the data cfg

Train Test and Detect

Results

Owner

Richard Fang

[CVPR2021 Oral] End-to-End Video Instance Segmentation with Transformers

This is a work in progress reimplementation of Instant Neural Graphics Primitives

Implementation of the paper "Fine-Tuning Transformers: Vocabulary Transfer"

PyTorch implementation of Neural View Synthesis and Matching for Semi-Supervised Few-Shot Learning of 3D Pose

Pytorch code for "DPFM: Deep Partial Functional Maps" - 3DV 2021 (Oral)

Sequence-tagging using deep learning

Probabilistic Tracklet Scoring and Inpainting for Multiple Object Tracking

Pre-Training Graph Neural Networks for Cold-Start Users and Items Representation.

Deep Learning and Reinforcement Learning Library for Scientists and Engineers 🔥

Cross-Document Coreference Resolution

Code for Greedy Gradient Ensemble for Visual Question Answering （ICCV 2021, Oral）

Official Repo of my work for SREC Nandyal Machine Learning Bootcamp

[ICCV 2021] Released code for Causal Attention for Unbiased Visual Recognition

Code for our paper "MG-GAN: A Multi-Generator Model Preventing Out-of-Distribution Samples in Pedestrian Trajectory Prediction" published at ICCV 2021.

Differential fuzzing for the masses!

Zero-Shot Text-to-Image Generation VQGAN+CLIP Dockerized

PyTorch implementation of Train Short, Test Long: Attention with Linear Biases Enables Input Length Extrapolation.

Active window border replacement for window managers.

Making Structure-from-Motion (COLMAP) more robust to symmetries and duplicated structures

Locally cache assets that are normally streamed in POPULATION: ONE