当前位置:网站首页>Improved pillar with fine grained feature for 3D object detection paper notes
Improved pillar with fine grained feature for 3D object detection paper notes
2022-07-29 07:03:00 【byzy】
Link to the original text :https://arxiv.org/pdf/2110.06049.pdf
introduction
current 3D According to the expression of point cloud, the detection methods are mainly divided into point based 、 Voxel based and 2D Gridded . The point based method can extract the most fine-grained features , But it takes a long time ; Voxel based method due to sparse convolution , Time consuming and unstable ; be based on 2D Gridded ( Such as BEV or RV) The fastest , But the projection may lose information , Therefore, the effect may not be as good as the first two .
In this paper PointPillars On the basis of , Introduce height sensing sub cylinder (HS Cylinder ), Use highly aware location coding to get fine-grained features in the vertical direction ; Introduce a small cylinder based on sparsity (ST Cylinder ), Use sparsity based CNN The trunk ( Sparse attention by dense features /DFSA Stacked modules ) Get fine-grained features in the horizontal direction .
Method
As shown in the figure below , It consists of three parts . First, the point cloud is projected into small cylinders and sub cylinders , Get fine-grained 2D Pseudoimage . Then use include DFSA Of CNN Trunk feature extraction , The large-scale feature map contains the position information of the object , The small-scale feature map contains the shape information of the object . Last , The feature is input to the detection head to predict the size and position of the bounding box .

High perception sub cylinder
Sub cylinder : Divide each column into
Sub cylinder , Use the center of each point of the sub cylinder
And with the center of the sub cylinder
The migration
Strengthen each point , Then use two layers VFE As column feature code (PFE), Extract features from each sub cylinder . Then the features of all sub cylinders are spliced as 2D The feature of the corresponding position in the pseudo image .

Due to the concentration of high distribution , Dividing into sub cylinders will only bring small calculation time increments .
Highly aware location coding : Directly splicing each sub cylinder feature will lose the height information of the sub cylinder . Introduce height position code

And it is spliced with the characteristics of each sub cylinder , As 2D Characteristics of corresponding positions of pseudo images .
Small cylinder based on sparsity
Small cylinder : take 2D Halve the mesh size , Get finer grained features .
Based on sparsity CNN The trunk : Direct reduction 2D Grid size brings serious time-consuming increase , And the receptive field decreases .
In this paper, based on sparsity CNN The trunk (SCB), Sparse attention module by dense features (DFSA) Stack up . Because most small cylinders are empty , Use it directly CNN Is unnecessary and inefficient ; Sparse large-scale features can be used to express the distribution of objects , To predict the center of the object more accurately , At the same time, dense small-scale features are used to extract fine-grained object features , Predict more accurate object boundaries .
DFSA The modules are as follows :

The input sparse large-scale feature passes through the convolution block with step size , Then, average pooling and maximum pooling are carried out along the channel dimension and spliced . Then input to the convolution layer +sigmoid function , Generate a spatial attention map . meanwhile
Characteristics of underground sampling at different scales of branches , And through several convolution blocks , Get dense small-scale feature map . The smaller the scale of the feature map , The more volumes or blocks are used . Dense small-scale feature map is guided by spatial attention map , Upsample to output size . Finally, the characteristic graphs of all branches are spliced , adopt
Convolution block .
SCB The output of is all DFSA The result of splicing after sampling the input size on the module output .
experiment
Implementation details
Detection head : And CenterPoint similar , Use the central heat map header and regression header ( The center position is refined 、 Height above ground 、3D Size 、 Yaw angle 、 With the real bounding box IoU). Use during training focal Loss , Supervised by the center of the real object ; When inferring, find the output position of the dense regression head corresponding to the peak of the heat map and use IoU Perceived confidence correction .
Melting research
The impact of major contributions : Sub cylinder 、 Location code 、 Small cylinder 、DF Branches and SA Branches can improve the results . The detection accuracy of small objects has been greatly improved .
The influence of the number of sub columns : The detection accuracy increases with the number of sub cylinders
Increase and increase , But to a certain extent , Because the points of each sub cylinder are reduced , Feature extraction becomes difficult , The detection accuracy of automobile categories has decreased .
DFSA Influence of module settings : The experimental results are right DFSA The hyperparameters in the module are more robust . The more convolution blocks there are , The bigger the feeling field , Improved performance ; The higher the degree of down sampling , The faster the speed. , But the performance has declined .
边栏推荐
- Not so simple singleton mode
- Flink实时仓库-DWD层(kafka-关联mysql的lookup join)模板代码
- MySql基础知识(高频面试题)
- Is online legend software testing training really so black hearted? Are they all scams?
- 模拟卷Leetcode【普通】172. 阶乘后的零
- Excerpts from good essays
- 10道面试常问JVM题
- Teacher Cui Xueting's course notes on optimization theory and methods 00 are written in the front
- MySQL queries are case sensitive
- MySQL: what happens in the bufferpool when you crud? Ten pictures can make it clear
猜你喜欢

MVFuseNet:Improving End-to-End Object Detection and Motion Forecasting through Multi-View Fusion of

竣达技术 | 适用于”日月元”品牌UPS微信云监控卡

图像加噪声与矩阵求逆

Actual combat! Talk about how to solve the deep paging problem of MySQL

IDEA找不到Database解决方法

CVPR2022Oral专题系列(一):低光增强

线程 - 线程安全 - 线程优化

记 - 踩坑-实时数仓开发 - doris/pg/flink

Junda technology | applicable to "riyueyuan" brand ups wechat cloud monitoring card

二次元卡通渲染——进阶技巧
随机推荐
10 frequently asked JVM questions in interviews
【论文阅读 | cryoET】Gum-Net:快速准确的3D Subtomo图像对齐和平均的无监督几何匹配
Sword finger offer II 115: reconstruction sequence
分享一些你代码更好的小建议,流畅编码提搞效率
The core of openresty and cosocket
【冷冻电镜】Relion4.0——subtomogram教程
Unity探索地块通路设计分析 & 流程+代码具体实现
Unity免费元素特效推荐
IO流 - File - properties
王树尧老师运筹学课程笔记 02 高等数学基础
Talk about tcp/ip protocol? And the role of each layer?
ECCV 2022丨轻量级模型架ParC-Net 力压苹果MobileViT代码和论文下载
Leetcode-1331: array ordinal conversion
Excerpts from good essays
城市花样精~侬好!DESIGN#可视化电台即将开播
Apisik health check test
Simulation volume leetcode [ordinary] 172. Zero after factorial
Teacher wangshuyao's notes on operations research 05 linear programming and simplex method (concept, modeling, standard type)
JVM之垃圾回收机制(GC)
Jetpack Compose 中的键盘处理