当前位置:网站首页>Loss function and positive and negative sample allocation in target detection: retinanet and focal loss
Loss function and positive and negative sample allocation in target detection: retinanet and focal loss
2022-07-07 05:53:00 【cartes1us】
RetinaNet
In the field of target detection , For the first time, the accuracy of single-stage algorithm exceeds that of two-stage algorithm , Namely RetinaNet.
Network structure :
The network structure designed by the author is not very innovative , This is what the article says :
The design of our RetinaNet detector shares many similarities with
previous dense detectors, in particular the concept of ‘anchors’
introduced by RPN [3] and use of features pyramids as in SSD [9] and
FPN [4]. We emphasize that our simple detector achieves top results
not based on innovations in network design but due to our novel loss.
Detection head is Classification and BBox Regression decoupling Of , And it is based on anchor frame , after FPN After that, five characteristic maps with different scales are output , Each layer corresponds to 32~512 Anchor frame of scale , And each layer is based on scale and ratios There are different combinations of 9 Seed anchor frame , Finally, the anchor frame size of the whole network is 32 ~ 813 Between . Use the offset predicted by the network relative to the anchor box to calculate BBox Methods and Faster R-CNN identical . The following figure is the picture of thunderbolt .
The structure diagram in the paper is as follows , Only by FPN The characteristic diagrams of three scales are drawn ,W,H,K,A Respectively represent the width of the feature map , high , Number of categories ( Does not contain background classes ), Number of anchor frames (9).
Positive and negative samples match
Positive sample : Predicted BBox And gt IoU>=0.5,
Negative sample : Predicted BBox And gt IoU<0.4,
Other samples are discarded
prospects , Background quantity imbalance
CE Variants of loss
The biggest innovation of this work :Focal loss, Rewrite the classic cross entropy loss , Apply to class subnet Branch , The weight of the loss of easily classified samples is greatly reduced , Beautiful form . In the paper γ \gamma γ Recommended 2, if γ \gamma γ take 0, be FL It degenerates into CE.
Loss :
The first category loss is to calculate all samples ( Including positive and negative ) Of Focal loss, Then remove the number of positive samples N p o s N_{pos} Npos.BBox The return loss is Fast R-CNN Proposed in smooth L1 loss.
To be continued
边栏推荐
- 毕业之后才知道的——知网查重原理以及降重举例
- Reading the paper [sensor enlarged egocentric video captioning with dynamic modal attention]
- 消息队列:如何确保消息不会丢失
- 集群、分布式、微服務的區別和介紹
- Flinksql read / write PgSQL
- Message queuing: how to ensure that messages are not lost
- STM32按键状态机2——状态简化与增加长按功能
- [paper reading] semi supervised left atrium segmentation with mutual consistency training
- async / await
- [reading of the paper] a multi branch hybrid transformer network for channel terminal cell segmentation
猜你喜欢
Différenciation et introduction des services groupés, distribués et microservices
Paper reading [open book video captioning with retrieve copy generate network]
判断文件是否为DICOM文件
Realize GDB remote debugging function between different network segments
消息队列:如何确保消息不会丢失
bat 批示处理详解
Paper reading [MM21 pre training for video understanding challenge:video captioning with pre training techniqu]
毕业之后才知道的——知网查重原理以及降重举例
Hcip seventh operation
集群、分布式、微服務的區別和介紹
随机推荐
Paper reading [open book video captioning with retrieve copy generate network]
《2022中国低/无代码市场研究及选型评估报告》发布
Web architecture design process
Forkjoin is the most comprehensive and detailed explanation (from principle design to use diagram)
Win configuration PM2 boot auto start node project
I didn't know it until I graduated -- the principle of HowNet duplication check and examples of weight reduction
如何提高网站权重
[daily training -- Tencent selected 50] 292 Nim games
Educational Codeforces Round 22 B. The Golden Age
Flask1.1.4 werkzeug1.0.1 source code analysis: start the process
pytorch_ 01 automatic derivation mechanism
Nvisual network visualization
常用消息队列有哪些?
Explication contextuelle du langage Go
Differences and introduction of cluster, distributed and microservice
Reading notes of Clickhouse principle analysis and Application Practice (6)
sql查询:将下一行减去上一行,并做相应的计算
Unity keeps the camera behind and above the player
Web Authentication API兼容版本信息
Data storage 3