当前位置:网站首页>[target detection] |dive detector into box for object detection new training method based on fcos
[target detection] |dive detector into box for object detection new training method based on fcos
2022-06-12 21:26:00 【rrr2】
ECCV2020 The paper
Address of thesis :https://arxiv.org/abs/2007.14350
Problem description
In the field of target detection , Despite the success of the no anchor box , But the positioning accuracy is insufficient .
Problem cause analysis
1 The semantics of the central key point and the target are inconsistent . In the current anchor-free In the method , The central key point is very important , But as shown in the picture 1 Shown , The central key point area corresponding to the target is more irrelevant background , This will inevitably Noise pixels are taken as positive samples . If Use this simple strategy to define positive sample pixels , Certainly Leading to obvious semantic inconsistencies , This leads to a decline in the accuracy of regression .
2 The regression of local features has limitations . Because of the limited size of convolution kernel , Every The effective perception domain corresponding to the central key point Probably Only part of the target information is covered , Using only the key points bbox Regression can cause performance degradation . Pictured 2 Shown , The dotted line prediction box is the result of the center point prediction , Each box is not perfectly aligned to the target .

This method
This paper proposes a method based on FCOS A new target detection algorithm DDBNet, The main innovation lies in box Decomposing and reorganizing modules (D&R, decomposition and recombination) and Semantic consistency module (semantic consistency), It is used to solve the problem of inaccuracy of the central key point and the semantic inconsistency between the central key point and the target , The result is shown in Fig. 2 Solid wireframe in .
D&R modular , Decompose multiple prediction boxes into multiple boundaries , And then combine it into a new prediction box , These boundaries are connected behind the regression branch . Combine the original prediction box for accurate training , This module removes from the prediction . In the training phase , Once the bounding box prediction is regressed at each pixel ,D&R The module decomposes each bounding box into four directions . then , Sort the boundaries of the same kind according to the actual boundary deviation from the ground true value . therefore , By regrouping ranking boundaries , More accurate box predictions can be expected , And then through IoU Loss optimization box forecast 【30】.
Semantic consistency module , According to the classification score and intrinsic importance of pixels , It is adaptively classified into positive sample pixel and sub sample pixel . The framework introduces a new branch , That is, to estimate semantic consistency rather than centrality , And it is optimized under the supervision of the semantic consistency module . The module uses an adaptive filtering strategy based on classification and regression branch output .
Main contributions :
be based on anchor-free Architecture proposes a new target detection algorithm DDBNet, can Well solve the regression problem of the central key point as well as Semantic consistency of central key points .
Verify the central key point and GT Semantic consistency of , It can help to improve the convergence of the target detection network .
DDBNet Can achieve SOTA precision (45.5%), And it can be efficiently extended to other anchor-free In the detector .

Box Decomposition and Recombination

semantic consistency module

REF
https://jishuin.proginn.com/p/763bfbd2df4a
边栏推荐
- 二分查找
- Market trend report, technical innovation and market forecast of hydraulic torque wrench in China
- #981 Time Based Key-Value Store
- pytorch transforms. Use of lambda
- 有向图深拷贝
- ATOI super resolution
- Lake shore PT-100 platinum resistance temperature sensor
- How to improve communication efficiency during home office | community essay solicitation
- 重排数列练习题
- Yanghui triangle code implementation
猜你喜欢

makefile 的ifeq,filter,strip 简单使用

Vs2017 environmental issues

Lake shore PT-100 platinum resistance temperature sensor

结构体知识点all in

Mxnet record IO details

#113 Path Sum II

nn. PReLU(planes)

The Post-00 financial woman with a monthly salary of 2W conquered the boss with this set of report template

Leetcode: 210. Programme II

The year of the outbreak of financial innovation! All dtinsight products of kangaroo cloud data stack have passed the special test of Xinchuang of ICT Institute
随机推荐
Data visualization - broken line area chart
ZGC concurrent identity and multi view address mapping in concurrent transition phase
Experiment 7-2-6 print Yanghui triangle (20 points)
金融信创爆发年!袋鼠云数栈DTinsight全线产品通过信通院信创专项测试
torch. Finfo function
Solution of multi machine room dynamic loop status network touch screen monitoring
Gather function in pytorch_
drf 接收嵌套数据并创建对象, 解决:drf NOT NULL constraint failed
shell语言
leetcode:210. Schedule II
Draw according to weight
Insert sort
Research Report on market supply and demand and strategy of China's hydraulic hammer industry
JdbcTemplate插入并返回主键
remote: Support for password authentication was removed on August 13, 2021
Bluetooth for Delphi xe7
最简单ALV模板
Binary search
A blog written clearly by vit
Delphi XE7的蓝牙 Bluetooth