当前位置:网站首页>Target detection for long tail distribution -- balanced group softmax
Target detection for long tail distribution -- balanced group softmax
2022-07-02 07:57:00 【MezereonXP】
Deal with long tailed target detection – Balanced Group Softmax
List of articles
This time I will introduce an article CVPR2020 The article , Titled “Overcoming Classifier Imbalance for Long-tail Object Detection with Balanced Group Softmax”, It mainly solves the problem of long tail data distribution in target detection , The solution is also very simple .
Long tailed data
First , Long tailed data exist widely , Here we use COCO and LVIS Take two data sets as examples , As shown in the figure below :

The abscissa is the index of the category , The ordinate is the number of samples in the corresponding category .
You can see , In these two data sets , There is an obvious long tail distribution .
Previous methods for dealing with long tail distribution
Here are some related works , Given by category :
- Resampling based on data (data re-sampling)
- Oversampling the tail data :Borderline-smote: a new over-sampling method in im- balanced data sets learning
- Delete the header data :class imbalance, and cost sensitivity: why under-sampling beats over sampling
- Sampling based on category balance :Exploring the limits of weakly supervised pretraining.
- Cost sensitive learning (cost- sensitive learning)
- Through to loss Adjustment , Give different weights to different categories
These methods are usually sensitive to hyperparameters , And poor performance when migrating to the detection framework ( The difference between classification task and detection task )
Balanced Group Softmax
Here is the specific framework of the algorithm :

As shown in the figure above , In the training phase , We will group the categories , Calculate separately in different groups Softmax, Then calculate the respective cross entropy error .
For grouping , The paper is given by 0,10,100,1000,+inf As a segmentation point
Here we need to add one for each group other Category , bring , When the target category is not in a group ,groundtruth Set to other.
The final error form is :
L k = − ∑ n = 0 N ∑ i ∈ G n y i n log ( p i n ) \mathcal{L}_k=-\sum_{n=0}^{N}\sum_{i\in \mathcal{G}_n}y_i^n\log (p_i^n) Lk=−n=0∑Ni∈Gn∑yinlog(pin)
among , N N N It's the number of groups , G n \mathcal{G}_n Gn It's No n n n Category collection of groups , p i n p_i^n pin Is the probability of model output , y i n y_i^n yin Is the label .
Effect evaluation
Here is a precision table for comprehensive comparison

AP The subscript of corresponds to the index of the divided group , You can see , Precision in the tail , That is to say A P 1 AP_1 AP1 and A C C 1 ACC_1 ACC1 It has reached SOTA Performance of .
边栏推荐
- How do vision transformer work? [interpretation of the paper]
- 联邦学习下的数据逆向攻击 -- GradInversion
- Latex formula normal and italic
- Eklavya -- infer the parameters of functions in binary files using neural network
- 用MLP代替掉Self-Attention
- Graph Pooling 简析
- 【Random Erasing】《Random Erasing Data Augmentation》
- Open3d learning note 5 [rgbd fusion]
- 【Sparse-to-Dense】《Sparse-to-Dense:Depth Prediction from Sparse Depth Samples and a Single Image》
- 应对长尾分布的目标检测 -- Balanced Group Softmax
猜你喜欢

【双目视觉】双目矫正

TimeCLR: A self-supervised contrastive learning framework for univariate time series representation
![[Sparse to Dense] Sparse to Dense: Depth Prediction from Sparse Depth samples and a Single Image](/img/05/bf131a9e2716c9147a5473db4d0a5b.png)
[Sparse to Dense] Sparse to Dense: Depth Prediction from Sparse Depth samples and a Single Image

【Mixup】《Mixup:Beyond Empirical Risk Minimization》

【Cutout】《Improved Regularization of Convolutional Neural Networks with Cutout》
![[in depth learning series (8)]: principles of transform and actual combat](/img/2e/89920de2273b6f1bc3b21a19c2ecbe.png)
[in depth learning series (8)]: principles of transform and actual combat
![[CVPR‘22 Oral2] TAN: Temporal Alignment Networks for Long-term Video](/img/bc/c54f1f12867dc22592cadd5a43df60.png)
[CVPR‘22 Oral2] TAN: Temporal Alignment Networks for Long-term Video

论文写作tip2

【FastDepth】《FastDepth:Fast Monocular Depth Estimation on Embedded Systems》

Embedding malware into neural networks
随机推荐
[learning notes] matlab self compiled image convolution function
Thesis tips
【Mixed Pooling】《Mixed Pooling for Convolutional Neural Networks》
Replace convolution with full connection layer -- repmlp
CPU register
程序的内存模型
Implementation of yolov5 single image detection based on onnxruntime
Remplacer l'auto - attention par MLP
用于类别增量学习的动态可扩展表征 -- DER
【TCDCN】《Facial landmark detection by deep multi-task learning》
将恶意软件嵌入到神经网络中
【Programming】
ABM thesis translation
Embedding malware into neural networks
[learning notes] numerical differentiation of back error propagation
How do vision transformer work? [interpretation of the paper]
Ppt skills
【学习笔记】反向误差传播之数值微分
【Hide-and-Seek】《Hide-and-Seek: A Data Augmentation Technique for Weakly-Supervised Localization xxx》
【Mixed Pooling】《Mixed Pooling for Convolutional Neural Networks》