当前位置:网站首页>[target detection] |dive detector into box for object detection new training method based on fcos
[target detection] |dive detector into box for object detection new training method based on fcos
2022-06-12 21:26:00 【rrr2】
ECCV2020 The paper
Address of thesis :https://arxiv.org/abs/2007.14350
Problem description
In the field of target detection , Despite the success of the no anchor box , But the positioning accuracy is insufficient .
Problem cause analysis
1 The semantics of the central key point and the target are inconsistent . In the current anchor-free In the method , The central key point is very important , But as shown in the picture 1 Shown , The central key point area corresponding to the target is more irrelevant background , This will inevitably Noise pixels are taken as positive samples . If Use this simple strategy to define positive sample pixels , Certainly Leading to obvious semantic inconsistencies , This leads to a decline in the accuracy of regression .
2 The regression of local features has limitations . Because of the limited size of convolution kernel , Every The effective perception domain corresponding to the central key point Probably Only part of the target information is covered , Using only the key points bbox Regression can cause performance degradation . Pictured 2 Shown , The dotted line prediction box is the result of the center point prediction , Each box is not perfectly aligned to the target .

This method
This paper proposes a method based on FCOS A new target detection algorithm DDBNet, The main innovation lies in box Decomposing and reorganizing modules (D&R, decomposition and recombination) and Semantic consistency module (semantic consistency), It is used to solve the problem of inaccuracy of the central key point and the semantic inconsistency between the central key point and the target , The result is shown in Fig. 2 Solid wireframe in .
D&R modular , Decompose multiple prediction boxes into multiple boundaries , And then combine it into a new prediction box , These boundaries are connected behind the regression branch . Combine the original prediction box for accurate training , This module removes from the prediction . In the training phase , Once the bounding box prediction is regressed at each pixel ,D&R The module decomposes each bounding box into four directions . then , Sort the boundaries of the same kind according to the actual boundary deviation from the ground true value . therefore , By regrouping ranking boundaries , More accurate box predictions can be expected , And then through IoU Loss optimization box forecast 【30】.
Semantic consistency module , According to the classification score and intrinsic importance of pixels , It is adaptively classified into positive sample pixel and sub sample pixel . The framework introduces a new branch , That is, to estimate semantic consistency rather than centrality , And it is optimized under the supervision of the semantic consistency module . The module uses an adaptive filtering strategy based on classification and regression branch output .
Main contributions :
be based on anchor-free Architecture proposes a new target detection algorithm DDBNet, can Well solve the regression problem of the central key point as well as Semantic consistency of central key points .
Verify the central key point and GT Semantic consistency of , It can help to improve the convergence of the target detection network .
DDBNet Can achieve SOTA precision (45.5%), And it can be efficiently extended to other anchor-free In the detector .

Box Decomposition and Recombination

semantic consistency module

REF
https://jishuin.proginn.com/p/763bfbd2df4a
边栏推荐
- Data visualization - broken line area chart
- #141 Linked List Cycle
- GPU giant NVIDIA suffered a "devastating" network attack, and the number one malware shut down its botnet infrastructure | global network security hotspot on February 28
- 同花顺能开户吗,在同花顺开户安全么
- Compréhension préliminaire des expressions régulières cognitives (regex)
- Solve one-dimensional array prefix sum
- What are the disadvantages of bone conduction earphones? Analysis of advantages and disadvantages of bone conduction earphones
- Teamwork collaboration application experience sharing | community essay solicitation
- NIO使用指南
- Draw according to weight
猜你喜欢

Access control system based on RFID

风控建模十:传统建模方法存在的问题探讨及改进方法探索

Pixel level reconstruction and restoration technology to solve severe image blur

The Post-00 financial woman with a monthly salary of 2W conquered the boss with this set of report template

Leetcode: 210. Programme II

Lake shore PT-100 platinum resistance temperature sensor

GNS installation and configuration

Image processing 12- image linear blending

Teambition 协作应用心得分享|社区征文

Lombok package is successfully installed, but the runtime prompts that get, set method and constructor solution cannot be found
随机推荐
Vs2017 environmental issues
torch. unique()
Teamwork collaboration application experience sharing | community essay solicitation
Allegro Xile technology, a developer of distributed cloud services, received millions of dollars of angel round financing and was independently invested by Yaotu capital
Junda technology is applicable to "kestar" intelligent precision air conditioning network monitoring
Ubuntu 16.04 installing mysql5.6
Gather function in pytorch_
Scatter in pytorch_ () function
torch. clamp_ min_ method
Cv2.lut() (populates the output array with values from the lookup table)
GPU giant NVIDIA suffered a "devastating" network attack, and the number one malware shut down its botnet infrastructure | global network security hotspot on February 28
Image processing 12- image linear blending
Risk control modeling X: Discussion on problems existing in traditional modeling methods and Exploration on improvement methods
What did new do
JdbcTemplate插入并返回主键
CUDA out of memory
风控建模十:传统建模方法存在的问题探讨及改进方法探索
zgc的垃圾收集的主要阶段
Shell language
Lua pattern matching