当前位置:网站首页>Bbox regression loss function in target detection -l2, smooth L1, IOU, giou, Diou, ciou, focal eiou, alpha IOU, Siou
Bbox regression loss function in target detection -l2, smooth L1, IOU, giou, Diou, ciou, focal eiou, alpha IOU, Siou
2022-07-07 05:53:00 【cartes1us】
Two tasks of target detection , Classification and location regression , This post summarizes the classic position regression loss function as follows , In chronological order of publication .
L1、L2、smooth L1 loss
Put forward smooth L1 loss The paper of :
L1 The lowest point is not differentiable , So it will not converge to the lowest point , May oscillate near the optimal solution . and L2 Loss is easy to cause gradient explosion at outliers .smooth L1 It combines the advantages of both .
IoU loss
Put forward IoU loss The paper of :
Whether it's L2 still smooth L1 loss Did not consider The relevance of the four points and Scale invariance , This is a fatal shortcoming , When two pairs of prediction boxes and GT Framed IoU Phase at the same time , The pair with larger scale loss It will be higher , Or as shown in the figure below , Use the lower left corner and upper right corner to calculate the loss ,L2 loss identical , but IoU But not the same .
IoU There are two forms of loss , The latter is more commonly used :
L I o U = − l n I o U L_{IoU} = -lnIoU LIoU=−lnIoU
L I o U = 1 − I o U L_{IoU} = 1-IoU LIoU=1−IoU
such ,BBox The evaluation index and optimization index of regression problem have been overlapped and unified .
GIoU loss
Put forward GIoU loss The paper of :
IoU loss The biggest drawback is when the two boxes do not intersect IoU Horizontal 0, Loss constant 1, There is no way to provide optimized gradients .
Here's the picture ( The picture is from CHEN), On the right loss It should be smaller , but IoU loss It's the same .
GIoU A concept of minimum closure region is introduced , That is, the smallest rectangular box that can wrap the prediction box and the real box , among , A c A_c Ac Is the minimum closure area , u u u It is the union of prediction box and real box , that GIoU The molecule of the second term is the white area in the above figure , The higher the value of the white area than the minimum closure area ,loss The higher the .
DIoU loss
and
In the above figure, there are three cases IoU and GIoU Of loss Are all 0.75, But obviously, the third case should be a better prediction , and DIoU loss These situations can be expressed more accurately , The calculation formula is as follows , comparison IoU Loss of one more penalty , yes
[ two individual box in heart spot Of o type distance leave most Small close package Moment shape Yes horn Line Long degree ] 2 [{\frac{ The Euclidean distance between the center points of the two frames }{ Minimum closure rectangle diagonal length }}]^2 [ most Small close package Moment shape Yes horn Line Long degree two individual box in heart spot Of o type distance leave ]2
DIoU There are also the following advantages :
- because DIoU Directly minimize the distance between the two boxes , So convergence is better than GIoU Much faster , As shown in the figure below . Especially when the relative direction of the two boxes is vertical or horizontal .
- As NMS Better results can be obtained when evaluating indicators .
CIoU loss(Complete IoU Loss)
And DIoU loss From the same article
The author thinks that , well IoU Loss should consider three factors :
- The area of intersection
- Distance from the center
- Aspect ratio
and IoU and GIoU loss Only the first factor is considered ,DIoU loss Consider the second factor more .
The author also puts forward CIoU loss, It can measure the coincidence degree and similarity of two boxes more accurately , Than DIoU There is an additional penalty item of aspect ratio v v v, α \alpha α It's the equilibrium coefficient .
The author shows through experiments that ,CIoU Compared to other IoU Loss achieved better experimental results .
Focal-EIoU loss
I can see that the comments on this loss are not very good , I haven't looked carefully yet , Dig a hole .
Alpha IoU
Simply put, yes IoU loss The family did a power operation , The formula shown in the figure below .
Pictured above ,alpha-IoU It can be improved adaptively IoU Object loss and gradient to improve BBox Regression accuracy , And for small data sets and noise BBox Provides better robustness .
SIoU loss
5 The hot month is still the new loss of preprint , But the effect is better than CIoU A lot of improvement , Dig a hole .
边栏推荐
- AI face editor makes Lena smile
- TCC of distributed transaction solutions
- Digital IC interview summary (interview experience sharing of large manufacturers)
- PTA 天梯赛练习题集 L2-002 链表去重
- SAP ABAP BDC(批量数据通信)-018
- Forkjoin is the most comprehensive and detailed explanation (from principle design to use diagram)
- Nvisual network visualization
- Add salt and pepper noise or Gaussian noise to the picture
- 驱动开发中platform设备驱动架构详解
- Pinduoduo product details interface, pinduoduo product basic information, pinduoduo product attribute interface
猜你喜欢
AI face editor makes Lena smile
毕业之后才知道的——知网查重原理以及降重举例
《2022中国低/无代码市场研究及选型评估报告》发布
分布式事务介绍
Industrial Finance 3.0: financial technology of "dredging blood vessels"
WEB架构设计过程
Modes of optical fiber - single mode and multimode
[云原生]微服务架构是什么?
Web authentication API compatible version information
Web Authentication API兼容版本信息
随机推荐
消息队列:如何确保消息不会丢失
集群、分布式、微服務的區別和介紹
Pinduoduo product details interface, pinduoduo product basic information, pinduoduo product attribute interface
AI face editor makes Lena smile
Reading the paper [sensor enlarged egocentric video captioning with dynamic modal attention]
ForkJoin最全详解(从原理设计到使用图解)
Message queue: how to deal with message backlog?
WEB架构设计过程
[solved] record an error in easyexcel [when reading the XLS file, no error will be reported when reading the whole table, and an error will be reported when reading the specified sheet name]
C#可空类型
Get the way to optimize the one-stop worktable of customer service
5. Data access - entityframework integration
zabbix_get测试数据库失败
C nullable type
PowerPivot——DAX(函数)
"Multimodal" concept
Interview skills of software testing
JD commodity details page API interface, JD commodity sales API interface, JD commodity list API interface, JD app details API interface, JD details API interface, JD SKU information interface
Introduction to distributed transactions
Modes of optical fiber - single mode and multimode