当前位置:网站首页>Target detection for long tail distribution -- balanced group softmax
Target detection for long tail distribution -- balanced group softmax
2022-07-02 07:57:00 【MezereonXP】
Deal with long tailed target detection – Balanced Group Softmax
List of articles
This time I will introduce an article CVPR2020 The article , Titled “Overcoming Classifier Imbalance for Long-tail Object Detection with Balanced Group Softmax”, It mainly solves the problem of long tail data distribution in target detection , The solution is also very simple .
Long tailed data
First , Long tailed data exist widely , Here we use COCO and LVIS Take two data sets as examples , As shown in the figure below :
The abscissa is the index of the category , The ordinate is the number of samples in the corresponding category .
You can see , In these two data sets , There is an obvious long tail distribution .
Previous methods for dealing with long tail distribution
Here are some related works , Given by category :
- Resampling based on data (data re-sampling)
- Oversampling the tail data :Borderline-smote: a new over-sampling method in im- balanced data sets learning
- Delete the header data :class imbalance, and cost sensitivity: why under-sampling beats over sampling
- Sampling based on category balance :Exploring the limits of weakly supervised pretraining.
- Cost sensitive learning (cost- sensitive learning)
- Through to loss Adjustment , Give different weights to different categories
These methods are usually sensitive to hyperparameters , And poor performance when migrating to the detection framework ( The difference between classification task and detection task )
Balanced Group Softmax
Here is the specific framework of the algorithm :
As shown in the figure above , In the training phase , We will group the categories , Calculate separately in different groups Softmax, Then calculate the respective cross entropy error .
For grouping , The paper is given by 0,10,100,1000,+inf As a segmentation point
Here we need to add one for each group other Category , bring , When the target category is not in a group ,groundtruth Set to other.
The final error form is :
L k = − ∑ n = 0 N ∑ i ∈ G n y i n log ( p i n ) \mathcal{L}_k=-\sum_{n=0}^{N}\sum_{i\in \mathcal{G}_n}y_i^n\log (p_i^n) Lk=−n=0∑Ni∈Gn∑yinlog(pin)
among , N N N It's the number of groups , G n \mathcal{G}_n Gn It's No n n n Category collection of groups , p i n p_i^n pin Is the probability of model output , y i n y_i^n yin Is the label .
Effect evaluation
Here is a precision table for comprehensive comparison
AP The subscript of corresponds to the index of the divided group , You can see , Precision in the tail , That is to say A P 1 AP_1 AP1 and A C C 1 ACC_1 ACC1 It has reached SOTA Performance of .
边栏推荐
- 【Cutout】《Improved Regularization of Convolutional Neural Networks with Cutout》
- Mmdetection trains its own data set -- export coco format of cvat annotation file and related operations
- 【学习笔记】Matlab自编图像卷积函数
- E-R画图明确内容
- 【学习笔记】反向误差传播之数值微分
- [C # note] the data in DataGridView saved in WinForm is excel and CSV
- Yolov3 trains its own data set (mmdetection)
- [CVPR‘22 Oral2] TAN: Temporal Alignment Networks for Long-term Video
- 【双目视觉】双目立体匹配
- Use Baidu network disk to upload data to the server
猜你喜欢
What if the laptop task manager is gray and unavailable
Proof and understanding of pointnet principle
Faster-ILOD、maskrcnn_ Benchmark installation process and problems encountered
Win10+vs2017+denseflow compilation
Timeout docking video generation
jetson nano安装tensorflow踩坑记录(scipy1.4.1)
【双目视觉】双目立体匹配
Open3d learning note 4 [surface reconstruction]
Common CNN network innovations
EKLAVYA -- 利用神经网络推断二进制文件中函数的参数
随机推荐
[CVPR‘22 Oral2] TAN: Temporal Alignment Networks for Long-term Video
The difference and understanding between generative model and discriminant model
【Cascade FPD】《Deep Convolutional Network Cascade for Facial Point Detection》
【Programming】
[binocular vision] binocular stereo matching
The hystrix dashboard reported an error hystrix Stream is not in the allowed list of proxy host names solution
What if a new window always pops up when opening a folder on a laptop
论文写作tip2
Open3d learning note 3 [sampling and voxelization]
Pointnet understanding (step 4 of pointnet Implementation)
Machine learning theory learning: perceptron
open3d学习笔记三【采样与体素化】
【MobileNet V3】《Searching for MobileNetV3》
MoCO ——Momentum Contrast for Unsupervised Visual Representation Learning
将恶意软件嵌入到神经网络中
[learning notes] matlab self compiled image convolution function
How do vision transformer work? [interpretation of the paper]
【Mixup】《Mixup:Beyond Empirical Risk Minimization》
Installation and use of image data crawling tool Image Downloader
【FastDepth】《FastDepth:Fast Monocular Depth Estimation on Embedded Systems》