当前位置:网站首页>Improved pillar with fine grained feature for 3D object detection paper notes
Improved pillar with fine grained feature for 3D object detection paper notes
2022-07-29 07:03:00 【byzy】
Link to the original text :https://arxiv.org/pdf/2110.06049.pdf
introduction
current 3D According to the expression of point cloud, the detection methods are mainly divided into point based 、 Voxel based and 2D Gridded . The point based method can extract the most fine-grained features , But it takes a long time ; Voxel based method due to sparse convolution , Time consuming and unstable ; be based on 2D Gridded ( Such as BEV or RV) The fastest , But the projection may lose information , Therefore, the effect may not be as good as the first two .
In this paper PointPillars On the basis of , Introduce height sensing sub cylinder (HS Cylinder ), Use highly aware location coding to get fine-grained features in the vertical direction ; Introduce a small cylinder based on sparsity (ST Cylinder ), Use sparsity based CNN The trunk ( Sparse attention by dense features /DFSA Stacked modules ) Get fine-grained features in the horizontal direction .
Method
As shown in the figure below , It consists of three parts . First, the point cloud is projected into small cylinders and sub cylinders , Get fine-grained 2D Pseudoimage . Then use include DFSA Of CNN Trunk feature extraction , The large-scale feature map contains the position information of the object , The small-scale feature map contains the shape information of the object . Last , The feature is input to the detection head to predict the size and position of the bounding box .

High perception sub cylinder
Sub cylinder : Divide each column into
Sub cylinder , Use the center of each point of the sub cylinder
And with the center of the sub cylinder
The migration
Strengthen each point , Then use two layers VFE As column feature code (PFE), Extract features from each sub cylinder . Then the features of all sub cylinders are spliced as 2D The feature of the corresponding position in the pseudo image .

Due to the concentration of high distribution , Dividing into sub cylinders will only bring small calculation time increments .
Highly aware location coding : Directly splicing each sub cylinder feature will lose the height information of the sub cylinder . Introduce height position code

And it is spliced with the characteristics of each sub cylinder , As 2D Characteristics of corresponding positions of pseudo images .
Small cylinder based on sparsity
Small cylinder : take 2D Halve the mesh size , Get finer grained features .
Based on sparsity CNN The trunk : Direct reduction 2D Grid size brings serious time-consuming increase , And the receptive field decreases .
In this paper, based on sparsity CNN The trunk (SCB), Sparse attention module by dense features (DFSA) Stack up . Because most small cylinders are empty , Use it directly CNN Is unnecessary and inefficient ; Sparse large-scale features can be used to express the distribution of objects , To predict the center of the object more accurately , At the same time, dense small-scale features are used to extract fine-grained object features , Predict more accurate object boundaries .
DFSA The modules are as follows :

The input sparse large-scale feature passes through the convolution block with step size , Then, average pooling and maximum pooling are carried out along the channel dimension and spliced . Then input to the convolution layer +sigmoid function , Generate a spatial attention map . meanwhile
Characteristics of underground sampling at different scales of branches , And through several convolution blocks , Get dense small-scale feature map . The smaller the scale of the feature map , The more volumes or blocks are used . Dense small-scale feature map is guided by spatial attention map , Upsample to output size . Finally, the characteristic graphs of all branches are spliced , adopt
Convolution block .
SCB The output of is all DFSA The result of splicing after sampling the input size on the module output .
experiment
Implementation details
Detection head : And CenterPoint similar , Use the central heat map header and regression header ( The center position is refined 、 Height above ground 、3D Size 、 Yaw angle 、 With the real bounding box IoU). Use during training focal Loss , Supervised by the center of the real object ; When inferring, find the output position of the dense regression head corresponding to the peak of the heat map and use IoU Perceived confidence correction .
Melting research
The impact of major contributions : Sub cylinder 、 Location code 、 Small cylinder 、DF Branches and SA Branches can improve the results . The detection accuracy of small objects has been greatly improved .
The influence of the number of sub columns : The detection accuracy increases with the number of sub cylinders
Increase and increase , But to a certain extent , Because the points of each sub cylinder are reduced , Feature extraction becomes difficult , The detection accuracy of automobile categories has decreased .
DFSA Influence of module settings : The experimental results are right DFSA The hyperparameters in the module are more robust . The more convolution blocks there are , The bigger the feeling field , Improved performance ; The higher the degree of down sampling , The faster the speed. , But the performance has declined .
边栏推荐
- 'function VTable for error: undefined reference to... 'cause and solution of the problem
- MySql基础知识(高频面试题)
- 数据库多表查询 联合查询 增删改查
- 王树尧老师运筹学课程笔记 04 线性代数基础
- Actual combat! Talk about how to solve the deep paging problem of MySQL
- 模拟卷Leetcode【普通】093. 复原 IP 地址
- Leetcode-1331: array ordinal conversion
- Cesium反射
- 王树尧老师运筹学课程笔记 08 线性规划与单纯形法(单纯形法)
- Teacher wangshuyao wrote the notes of operations research course 00 in the front
猜你喜欢

基于C语言设计的学籍管理系统

CVPR2022Oral专题系列(一):低光增强

新同事写了几段小代码,把系统给搞崩了,被老板爆怼一顿!

阿里一面,给了几条SQL,问需要执行几次树搜索操作?

The core of openresty and cosocket

Flink实时仓库-DWD层(流量域)模板代码

vim文本编辑器的一些使用小技巧

Is online legend software testing training really so black hearted? Are they all scams?

【冷冻电镜入门】加州理工公开课课程笔记 Part 3: Image Formation

ECCV 2022丨轻量级模型架ParC-Net 力压苹果MobileViT代码和论文下载
随机推荐
HJ37 统计每个月兔子的总数 斐波那契数列
二次元卡通渲染——进阶技巧
【技能积累】写邮件时的常用表达
Idea cannot find a database solution
Teacher wangshuyao's notes on operations research 03 KKT theorem
Flink实时仓库-DWD层(下单-多张表实现join操作)模板代码
Software definition boundary SDP
谷歌零碎笔记之JWT(草稿)
王树尧老师运筹学课程笔记 06 线性规划与单纯形法(几何意义)
没那么简单的单例模式
Some tips of vim text editor
Analog volume leetcode [normal] 093. Restore IP address
Dbasql interview questions
Share some tips for better code, smooth coding and improve efficiency
说一下 TCP/IP 协议?以及每层的作用?
Cesium反射
崔雪婷老师最优化理论与方法课程笔记 00 写在前面
Windows 上 php 7.4 连接 oracle 配置
IDEA中实现Mapper接口到映射文件xml的跳转
Google fragmented notes JWT (Draft)