当前位置:网站首页>Bbox regression loss function in target detection -l2, smooth L1, IOU, giou, Diou, ciou, focal eiou, alpha IOU, Siou
Bbox regression loss function in target detection -l2, smooth L1, IOU, giou, Diou, ciou, focal eiou, alpha IOU, Siou
2022-07-07 05:53:00 【cartes1us】
Two tasks of target detection , Classification and location regression , This post summarizes the classic position regression loss function as follows , In chronological order of publication .
L1、L2、smooth L1 loss
Put forward smooth L1 loss The paper of :
L1 The lowest point is not differentiable , So it will not converge to the lowest point , May oscillate near the optimal solution . and L2 Loss is easy to cause gradient explosion at outliers .smooth L1 It combines the advantages of both .
IoU loss
Put forward IoU loss The paper of :
Whether it's L2 still smooth L1 loss Did not consider The relevance of the four points and Scale invariance , This is a fatal shortcoming , When two pairs of prediction boxes and GT Framed IoU Phase at the same time , The pair with larger scale loss It will be higher , Or as shown in the figure below , Use the lower left corner and upper right corner to calculate the loss ,L2 loss identical , but IoU But not the same .
IoU There are two forms of loss , The latter is more commonly used :
L I o U = − l n I o U L_{IoU} = -lnIoU LIoU=−lnIoU
L I o U = 1 − I o U L_{IoU} = 1-IoU LIoU=1−IoU
such ,BBox The evaluation index and optimization index of regression problem have been overlapped and unified .
GIoU loss
Put forward GIoU loss The paper of :
IoU loss The biggest drawback is when the two boxes do not intersect IoU Horizontal 0, Loss constant 1, There is no way to provide optimized gradients .
Here's the picture ( The picture is from CHEN), On the right loss It should be smaller , but IoU loss It's the same .
GIoU A concept of minimum closure region is introduced , That is, the smallest rectangular box that can wrap the prediction box and the real box , among , A c A_c Ac Is the minimum closure area , u u u It is the union of prediction box and real box , that GIoU The molecule of the second term is the white area in the above figure , The higher the value of the white area than the minimum closure area ,loss The higher the .
DIoU loss
and
In the above figure, there are three cases IoU and GIoU Of loss Are all 0.75, But obviously, the third case should be a better prediction , and DIoU loss These situations can be expressed more accurately , The calculation formula is as follows , comparison IoU Loss of one more penalty , yes
[ two individual box in heart spot Of o type distance leave most Small close package Moment shape Yes horn Line Long degree ] 2 [{\frac{ The Euclidean distance between the center points of the two frames }{ Minimum closure rectangle diagonal length }}]^2 [ most Small close package Moment shape Yes horn Line Long degree two individual box in heart spot Of o type distance leave ]2
DIoU There are also the following advantages :
- because DIoU Directly minimize the distance between the two boxes , So convergence is better than GIoU Much faster , As shown in the figure below . Especially when the relative direction of the two boxes is vertical or horizontal .
- As NMS Better results can be obtained when evaluating indicators .
CIoU loss(Complete IoU Loss)
And DIoU loss From the same article
The author thinks that , well IoU Loss should consider three factors :
- The area of intersection
- Distance from the center
- Aspect ratio
and IoU and GIoU loss Only the first factor is considered ,DIoU loss Consider the second factor more .
The author also puts forward CIoU loss, It can measure the coincidence degree and similarity of two boxes more accurately , Than DIoU There is an additional penalty item of aspect ratio v v v, α \alpha α It's the equilibrium coefficient .
The author shows through experiments that ,CIoU Compared to other IoU Loss achieved better experimental results .
Focal-EIoU loss
I can see that the comments on this loss are not very good , I haven't looked carefully yet , Dig a hole .
Alpha IoU
Simply put, yes IoU loss The family did a power operation , The formula shown in the figure below .
Pictured above ,alpha-IoU It can be improved adaptively IoU Object loss and gradient to improve BBox Regression accuracy , And for small data sets and noise BBox Provides better robustness .
SIoU loss
5 The hot month is still the new loss of preprint , But the effect is better than CIoU A lot of improvement , Dig a hole .
边栏推荐
- Industrial Finance 3.0: financial technology of "dredging blood vessels"
- Nodejs get client IP
- 目标检测中的损失函数与正负样本分配:RetinaNet与Focal loss
- 980. 不同路径 III DFS
- EMMC print cqhci: timeout for tag 10 prompt analysis and solution
- 架构设计的五个核心要素
- [reading of the paper] a multi branch hybrid transformer network for channel terminal cell segmentation
- Five core elements of architecture design
- What is message queuing?
- Digital IC interview summary (interview experience sharing of large manufacturers)
猜你喜欢
[云原生]微服务架构是什么?
Question 102: sequence traversal of binary tree
Hcip eighth operation
如何提高网站权重
SAP ABAP BDC(批量数据通信)-018
Harmonyos practice - Introduction to development, analysis of atomized services
Message queue: how to deal with message backlog?
毕业之后才知道的——知网查重原理以及降重举例
数据中心为什么需要一套基础设施可视化管理系统
《2022中国低/无代码市场研究及选型评估报告》发布
随机推荐
Codeforces Round #416 (Div. 2) D. Vladik and Favorite Game
《ClickHouse原理解析与应用实践》读书笔记(6)
2pc of distributed transaction solution
5. Data access - entityframework integration
Nvisual network visualization
yarn入门(一篇就够了)
【日常训练--腾讯精选50】292. Nim 游戏
STM32 key state machine 2 - state simplification and long press function addition
Hcip seventh operation
Flink SQL realizes reading and writing redis and dynamically generates hset key
原生小程序 之 input切換 text與password類型
原生小程序 之 input切换 text与password类型
爬虫练习题(三)
Interview questions and salary and welfare of Shanghai byte
Reading the paper [sensor enlarged egocentric video captioning with dynamic modal attention]
async / await
make makefile cmake qmake都是什么,有什么区别?
Go 语言的 Context 详解
JVM the truth you need to know
How to improve website weight