当前位置:网站首页>Anchor free detector: centernet
Anchor free detector: centernet
2022-07-27 10:01:00 【yfy2022yfy】
2019/07/15, Reprint please indicate
Address of thesis :https://arxiv.org/abs/1904.07850
Source code :https://github.com/xingyizhou/CenterNet
One 、 Abstract
At present, more successful detectors , They all list a lot of potential target locations , Then classify each , After that, we need to add post-processing , The author believes that this practice is still inefficient .
In this paper , The author put A goal is modeled as a point -- The center point of the target detection frame . The detector estimates through key points , Find the center point , And return to other attributes , such as size,3D Location , Direction and even posture .
CenterNet stay MS COCO The best speed accuracy performance is achieved on the data set :
- 28.1%AP @142fps
- 37.4%AP @52fps
- 45.1%AP @1.4fps( Multi scale testing ).
Two 、 primary coverage
1、 Methods to introduce
In this paper , The author modeled a goal as a point -- The center point of the target detection frame . The detector estimates through key points , Find the center point , And return to other attributes , such as size,3D Location , Direction and even posture .
object detection It has become a standard key point prediction problem :
- Input the picture into the full convolution network to generate heatmap, The peak value of the heat map corresponds to the target center ;
- Predict the width and height of the target detection frame according to the picture features at the peak ;
in addition , Output additional predictions at the center , Can be done 3d object detection , And multi person posture estimation :
- about 3D Box estimates , We return to the absolute depth of the object 、 3、 ... and D Box size and object orientation
- For human posture estimation , We regard the two-dimensional joint point as the central position + Offset , Direct regression offset
2、 As before based on anchor Comparison of methods
This method and is based on anchor Of one stage The method is similar to . The center point itself can also be regarded as a box with unknown shape anchor.
The difference is :
- Center point anchor Allocation is only based on location , Don't consider IoU, There is no threshold and classification ;
- A goal has only one positive example center , no need NMS, But simply from heatmap Take the peak .
- CenterNet Higher output resolution , Output stride=4, Compared with the traditional stride=16.
3、 Output thermal diagram
The network output result is down sampling stride=4. The author uses several full convolutional codes - Decode the network prediction heat map : Stacked Hourglass The Internet , On the sampling ResNet, And deep polymerization DLA.
First calculate GT Key points , Using Gauss check GT Point and adjacent pixel calculation . If there is overlap after Gaussian calculation , Then the maximum value is used for each pixel . Use during training focal loss.
To compensate for downsampling (stride=4) Resulting quantization error , We additionally predicted the center point offset , For training L1 loss.
4、 The central point is the goal
Suppose the coordinates of the target box are
. Center point
Output from the thermodynamic diagram network . in addition , You can return to the target frame width and height
. Considering the amount of calculation , Here, only the output thermal map is used to predict the size of the target frame , Use L1 loss.
The author uses a network to predict key points at the same time 、 Center point offset 、 And scale , common C+4 results ( For key points C A thermodynamic diagram shows ) By the way , The value of the center point is 0~1 between , Is confidence . All outputs share a backbone , Then each output will use one 3x3 Convolution ,ReLU and 1x1 Convolution .

- Heat map C The thermodynamic diagrams of this category share C individual .
- It is written that every thermodynamic diagram has n A peak , The author sets the maximum 100, I understand it is multi-target detection .
- Each target has a corresponding center point and frame size
5、 Detection extension
In addition to output target detection , This method can be in the same form , increase 3D testing 、 The branch of human posture estimation .


6、 experimental result
(1) object detection


Most of the one-stage detector , Single scale / Multiscale testing .
(2)3D object detection

(3) Human posture estimation

There is an appendix after the quotation of the paper , If you are interested, you can have a look .
边栏推荐
- npm常用命令
- [cloud native • Devops] master the container management tool rancher
- 注解与反射
- Qt 学习(二) —— .pro文件详解
- swagger-editor
- Understand chisel language. 26. Chisel advanced input signal processing (II) -- majority voter filtering, function abstraction and asynchronous reset
- LeetCode.1260. 二维网格迁移____原地暴力 / 降维+循环数组直接定位
- Simple use of tflite
- XML overview
- Looking for a job for 4 months, interviewing 15 companies and getting 3 offers
猜你喜欢

Why is redis so fast? Redis threading model and redis multithreading

A ride into Qinchuan -- a brief talk on how beego Autorouter works

视觉SLAM十四讲笔记(一):第一讲+第二讲
![[scm] source code management - lock of perforce branch](/img/c6/daead474a64a9a3c86dd140c097be0.jpg)
[scm] source code management - lock of perforce branch

拜托!面试请不要再问我 Ribbon 的架构原理

PCL各模块概述(1.6)

并发之park与unpark说明

LeetCode.814. 二叉树剪枝____DFS

Talk about 10 scenarios of index failure. It's too stupid
![Shell函数、系统函数、basename [string / pathname] [suffix] 可以理解为取路径里的文件名称 、dirname 文件绝对路径、自定义函数](/img/3d/d7276d2010f1d77a3bd572cc66eced.png)
Shell函数、系统函数、basename [string / pathname] [suffix] 可以理解为取路径里的文件名称 、dirname 文件绝对路径、自定义函数
随机推荐
安装了HAL库如何恢复原来的版本
Shell的正则表达式入门、常规匹配、特殊字符:^、$、.、*、字符区间(中括号):[ ]、特殊字符:\、匹配手机号
3D人脸重建:Joint 3D Face Reconstruction and Dense Alignment with position Map Regression Network
Fundamentals of Materials Engineering - key points
Understand chisel language. 27. Chisel advanced finite state machine (I) -- basic finite state machine (Moore machine)
Leetcode.565. array nesting____ Violent dfs- > pruning dfs- > in situ modification
达梦 PARTGROUPDEF是自定义的对象吗?
NFT系统开发-教程
Come on, chengxujun
Expose a technology boss from a poor family
Write yourself a year-end summary. Happy New Year!
July training (day 24) - segment tree
会议OA项目之会议排座功能&&会议送审的实现
Annotation and reflection
习题 --- 快排、归并、浮点数二分
Explain knative cloud function framework in simple terms!
一骑入秦川——浅聊Beego AutoRouter是如何工作
Leetcode.1260. 2D grid migration____ In situ violence / dimensionality reduction + direct positioning of circular array
圆环工件毛刺(凸起)缺口(凹陷)检测案例
Redis 为什么这么快?Redis 的线程模型与 Redis 多线程