当前位置:网站首页>Target detection for long tail distribution -- balanced group softmax
Target detection for long tail distribution -- balanced group softmax
2022-07-02 07:57:00 【MezereonXP】
Deal with long tailed target detection – Balanced Group Softmax
List of articles
This time I will introduce an article CVPR2020 The article , Titled “Overcoming Classifier Imbalance for Long-tail Object Detection with Balanced Group Softmax”, It mainly solves the problem of long tail data distribution in target detection , The solution is also very simple .
Long tailed data
First , Long tailed data exist widely , Here we use COCO and LVIS Take two data sets as examples , As shown in the figure below :
The abscissa is the index of the category , The ordinate is the number of samples in the corresponding category .
You can see , In these two data sets , There is an obvious long tail distribution .
Previous methods for dealing with long tail distribution
Here are some related works , Given by category :
- Resampling based on data (data re-sampling)
- Oversampling the tail data :Borderline-smote: a new over-sampling method in im- balanced data sets learning
- Delete the header data :class imbalance, and cost sensitivity: why under-sampling beats over sampling
- Sampling based on category balance :Exploring the limits of weakly supervised pretraining.
- Cost sensitive learning (cost- sensitive learning)
- Through to loss Adjustment , Give different weights to different categories
These methods are usually sensitive to hyperparameters , And poor performance when migrating to the detection framework ( The difference between classification task and detection task )
Balanced Group Softmax
Here is the specific framework of the algorithm :
As shown in the figure above , In the training phase , We will group the categories , Calculate separately in different groups Softmax, Then calculate the respective cross entropy error .
For grouping , The paper is given by 0,10,100,1000,+inf As a segmentation point
Here we need to add one for each group other Category , bring , When the target category is not in a group ,groundtruth Set to other.
The final error form is :
L k = − ∑ n = 0 N ∑ i ∈ G n y i n log ( p i n ) \mathcal{L}_k=-\sum_{n=0}^{N}\sum_{i\in \mathcal{G}_n}y_i^n\log (p_i^n) Lk=−n=0∑Ni∈Gn∑yinlog(pin)
among , N N N It's the number of groups , G n \mathcal{G}_n Gn It's No n n n Category collection of groups , p i n p_i^n pin Is the probability of model output , y i n y_i^n yin Is the label .
Effect evaluation
Here is a precision table for comprehensive comparison
AP The subscript of corresponds to the index of the divided group , You can see , Precision in the tail , That is to say A P 1 AP_1 AP1 and A C C 1 ACC_1 ACC1 It has reached SOTA Performance of .
边栏推荐
- [learning notes] numerical differentiation of back error propagation
- CONDA common commands
- Execution of procedures
- Thesis writing tip2
- Summary of solving the Jetson nano installation onnx error (error: failed building wheel for onnx)
- Traditional target detection notes 1__ Viola Jones
- Mmdetection model fine tuning
- Proof and understanding of pointnet principle
- 论文tips
- Installation and use of image data crawling tool Image Downloader
猜你喜欢
用于类别增量学习的动态可扩展表征 -- DER
Remplacer l'auto - attention par MLP
Where do you find the materials for those articles that have read 10000?
Installation and use of image data crawling tool Image Downloader
How do vision transformer work?【论文解读】
Hystrix dashboard cannot find hystrix Stream solution
用全连接层替代掉卷积 -- RepMLP
半监督之mixmatch
Use Baidu network disk to upload data to the server
【双目视觉】双目立体匹配
随机推荐
[learning notes] matlab self compiled Gaussian smoother +sobel operator derivation
浅谈深度学习模型中的后门
What if a new window always pops up when opening a folder on a laptop
Latex formula normal and italic
【Mixup】《Mixup:Beyond Empirical Risk Minimization》
How gensim freezes some word vectors for incremental training
Open3D学习笔记一【初窥门径,文件读取】
【BiSeNet】《BiSeNet:Bilateral Segmentation Network for Real-time Semantic Segmentation》
TimeCLR: A self-supervised contrastive learning framework for univariate time series representation
【Sparse-to-Dense】《Sparse-to-Dense:Depth Prediction from Sparse Depth Samples and a Single Image》
Meta Learning 简述
半监督之mixmatch
Proof and understanding of pointnet principle
Semi supervised mixpatch
包图画法注意规范
【Cutout】《Improved Regularization of Convolutional Neural Networks with Cutout》
Faster-ILOD、maskrcnn_ Benchmark installation process and problems encountered
[binocular vision] binocular correction
超时停靠视频生成
Traditional target detection notes 1__ Viola Jones