当前位置:网站首页>Bbox regression loss function in target detection -l2, smooth L1, IOU, giou, Diou, ciou, focal eiou, alpha IOU, Siou
Bbox regression loss function in target detection -l2, smooth L1, IOU, giou, Diou, ciou, focal eiou, alpha IOU, Siou
2022-07-07 05:53:00 【cartes1us】
Two tasks of target detection , Classification and location regression , This post summarizes the classic position regression loss function as follows , In chronological order of publication .
L1、L2、smooth L1 loss
Put forward smooth L1 loss The paper of :


L1 The lowest point is not differentiable , So it will not converge to the lowest point , May oscillate near the optimal solution . and L2 Loss is easy to cause gradient explosion at outliers .smooth L1 It combines the advantages of both .
IoU loss
Put forward IoU loss The paper of :
Whether it's L2 still smooth L1 loss Did not consider The relevance of the four points and Scale invariance , This is a fatal shortcoming , When two pairs of prediction boxes and GT Framed IoU Phase at the same time , The pair with larger scale loss It will be higher , Or as shown in the figure below , Use the lower left corner and upper right corner to calculate the loss ,L2 loss identical , but IoU But not the same .
IoU There are two forms of loss , The latter is more commonly used :
L I o U = − l n I o U L_{IoU} = -lnIoU LIoU=−lnIoU
L I o U = 1 − I o U L_{IoU} = 1-IoU LIoU=1−IoU
such ,BBox The evaluation index and optimization index of regression problem have been overlapped and unified .
GIoU loss
Put forward GIoU loss The paper of :
IoU loss The biggest drawback is when the two boxes do not intersect IoU Horizontal 0, Loss constant 1, There is no way to provide optimized gradients .
Here's the picture ( The picture is from CHEN), On the right loss It should be smaller , but IoU loss It's the same .
GIoU A concept of minimum closure region is introduced , That is, the smallest rectangular box that can wrap the prediction box and the real box , among , A c A_c Ac Is the minimum closure area , u u u It is the union of prediction box and real box , that GIoU The molecule of the second term is the white area in the above figure , The higher the value of the white area than the minimum closure area ,loss The higher the .
DIoU loss

and 
In the above figure, there are three cases IoU and GIoU Of loss Are all 0.75, But obviously, the third case should be a better prediction , and DIoU loss These situations can be expressed more accurately , The calculation formula is as follows , comparison IoU Loss of one more penalty , yes
[ two individual box in heart spot Of o type distance leave most Small close package Moment shape Yes horn Line Long degree ] 2 [{\frac{ The Euclidean distance between the center points of the two frames }{ Minimum closure rectangle diagonal length }}]^2 [ most Small close package Moment shape Yes horn Line Long degree two individual box in heart spot Of o type distance leave ]2

DIoU There are also the following advantages :
- because DIoU Directly minimize the distance between the two boxes , So convergence is better than GIoU Much faster , As shown in the figure below . Especially when the relative direction of the two boxes is vertical or horizontal .
- As NMS Better results can be obtained when evaluating indicators .

CIoU loss(Complete IoU Loss)
And DIoU loss From the same article
The author thinks that , well IoU Loss should consider three factors :
- The area of intersection
- Distance from the center
- Aspect ratio
and IoU and GIoU loss Only the first factor is considered ,DIoU loss Consider the second factor more .
The author also puts forward CIoU loss, It can measure the coincidence degree and similarity of two boxes more accurately , Than DIoU There is an additional penalty item of aspect ratio v v v, α \alpha α It's the equilibrium coefficient .

The author shows through experiments that ,CIoU Compared to other IoU Loss achieved better experimental results .
Focal-EIoU loss

I can see that the comments on this loss are not very good , I haven't looked carefully yet , Dig a hole .
Alpha IoU

Simply put, yes IoU loss The family did a power operation , The formula shown in the figure below .


Pictured above ,alpha-IoU It can be improved adaptively IoU Object loss and gradient to improve BBox Regression accuracy , And for small data sets and noise BBox Provides better robustness .
SIoU loss

5 The hot month is still the new loss of preprint , But the effect is better than CIoU A lot of improvement , Dig a hole .
边栏推荐
- Classic questions about data storage
- Dynamic memory management
- Taobao store release API interface (New), Taobao oauth2.0 store commodity API interface, Taobao commodity release API interface, Taobao commodity launch API interface, a complete set of launch store i
- Differences and introduction of cluster, distributed and microservice
- Modes of optical fiber - single mode and multimode
- Paper reading [open book video captioning with retrieve copy generate network]
- Paper reading [MM21 pre training for video understanding challenge:video captioning with pre training techniqu]
- Flask1.1.4 Werkzeug1.0.1 源碼分析:啟動流程
- 集群、分布式、微服务的区别和介绍
- 成为资深IC设计工程师的十个阶段,现在的你在哪个阶段 ?
猜你喜欢

数据中心为什么需要一套基础设施可视化管理系统
Interview questions and salary and welfare of Shanghai byte

ML之shap:基于adult人口普查收入二分类预测数据集(预测年收入是否超过50k)利用shap决策图结合LightGBM模型实现异常值检测案例之详细攻略

yarn入门(一篇就够了)

PTA 天梯赛练习题集 L2-004 搜索树判断

毕业之后才知道的——知网查重原理以及降重举例

Sidecar mode

Différenciation et introduction des services groupés, distribués et microservices

Modes of optical fiber - single mode and multimode

集群、分布式、微服务的区别和介绍
随机推荐
Reptile exercises (III)
[云原生]微服务架构是什么?
成为资深IC设计工程师的十个阶段,现在的你在哪个阶段 ?
Five core elements of architecture design
盘点国内有哪些EDA公司?
Taobao store release API interface (New), Taobao oauth2.0 store commodity API interface, Taobao commodity release API interface, Taobao commodity launch API interface, a complete set of launch store i
[reading of the paper] a multi branch hybrid transformer network for channel terminal cell segmentation
5. Data access - entityframework integration
Message queue: how to handle repeated messages?
Unity keeps the camera behind and above the player
谈fpga和asic的区别
Mybaits multi table query (joint query, nested query)
三级菜单数据实现,实现嵌套三级菜单数据
mac版php装xdebug环境(m1版)
每秒10W次分词搜索,产品经理又提了一个需求!!!(收藏)
EMMC print cqhci: timeout for tag 10 prompt analysis and solution
2pc of distributed transaction solution
产业金融3.0:“疏通血管”的金融科技
软件测试面试技巧
Add salt and pepper noise or Gaussian noise to the picture