当前位置:网站首页>Deep Hough voting for 3D object detection in point clouds
Deep Hough voting for 3D object detection in point clouds
2022-06-12 21:26:00 【Wield the sword to break the clouds-】
Deep Hough Voting for 3D Object Detection in Point Clouds
In the clouds Three dimensional target detection The depth of the Hough laws and regulations governing balloting
PS:
pointnet: Because of its maxpooling Global features from the operation , Make the classification task effective ;
For split tasks , The global features are spliced with the local features of each point cloud learned before , Re pass mlp Get the classification result of each point .
Pointnet++: Yes, before pointnet A supplementary and upgraded version of ,pointnet The ability of local feature extraction is poor , This makes it difficult to analyze complex scenes .
pointnet++ Learn from it CNN The idea of multi-layer receptive field , First, the point cloud is sampled and divided into regions , In each small area pointnet Network feature extraction , Continuous iteration .

The network structure is as follows :
1、Sample layer: It mainly samples the input points , Select several center points from these points , Utilization of FPS Farthest point sampling , Ensure that the sampling points are evenly distributed on the whole point cloud .
2、grouping layer: The point set is divided into several regions by using the center point obtained from the upper layer .
3、PointNet layer: For these points MLP Extract features and maximize pool aggregation to sample point coordinates .

stay set abstraction Inside , It uses multi-scale feature extraction to do an optimization , Combine small features with large ones ( Different radii ), Improve the generalization ability .

Optimization of split tasks , What we need to do is to make a semantic segmentation label for each point , In the network , Let's do an up sampling first , How to do it? ? This is achieved by doing an interpolation , A hierarchical propagation strategy based on distance interpolation and cross level hopping links is adopted , Among many interpolation options , We use based on k Inverse distance weighted average of the nearest neighbors ( As formula 2 , By default, we use p = 2,k = 3). It will be based on K Make a weighted average of the distance between points and the characteristics of points , After interpolation, the global feature is restored , We also need to splice these features with the previous local features , Then continue to do some feature propagation in the future , Repeat the process , Until we propagate the feature to the original point set , Then do the semantic segmentation task , The effect will be better .
VoteNet:
What do you want to do? :
It builds a as general as possible for point cloud data 3D Detection structure
Put forward the background :
3D The target of object detection is to locate and recognize 3D Objects in the scene , More specifically , In this work , Our goal is to estimate orientation 3D Bounding boxes and semantic categories of objects from point clouds .
However , Current 3D Target detection methods are subject to 2D The effect of the detector is great , Some of the 2D The detection framework extends to 3D, For example, will Faster or Mask R-CNN etc. 2D The detection framework extends to 3D, Convert irregular point cloud voxels into regular ones 3D Grid and apply 3D CNN detector , This cannot take advantage of the sparsity in the data , And because of the expensive 3D Convolution is affected by high computing cost .
Or project the point cloud data into regular 2D Aerial view image , Then apply 2D The detector locates the object . However , This sacrifices geometric details that may be crucial in a cluttered indoor environment , Image visual conversion requires additional computational overhead .
This paper introduces a point cloud centered 3D Detection framework , The framework Direct processing of raw data , And don't rely on anything in the architecture or object proposal 2D detector . Our detection network VoteNet Based on point cloud 3D The latest development of deep learning model , And it is generalized for object detection Hough Inspired by the voting process
Problems encountered :
However , Because of the sparsity of the data , There is a major challenge in predicting the bounding box parameters directly from the scene : One 3D The center of mass of an object may be far away from any surface point , Therefore, it is difficult to accurately regress in one step .
Solution :
Use Hoff to vote , First, a number of samples are taken on the input point cloud seed Point Union vote The central point of its target , In this way, you can get a lot of vote spot , And then in vote Point up bounding box The advice of , The defect of inaccuracy when the target center point is far from the surface point is well solved
Network architecture :

First , adopt pointnet++ Extract an information of point cloud in the original scene , We need to find the target object bondingbox Words , To determine the center point of an object , Because our point cloud is a representation of object surface information , The center must be additionally defined , We use the Hoff voting mechanism to pick out these candidates , Get the proposal of some central points that did not exist in the point cloud data ( It's called proposal), With these points , Just keep using pointnet++ Inside sampling and grouping Go to the farthest point to sample K Cluster centers , Divide the spherical space , utilize mlp The feature vectors representing these clusters are extracted , Then we predict a category label for these vectors , Include bondingbox Where should the box be .
To be improved ......
边栏推荐
- jsonUtils
- Simplest ALV template
- shell语言
- Composer version degradation
- Can tonghuashun open an account? Is it safe to open an account in tonghuashun
- Research Report on market supply and demand and strategy of China's hydraulic hammer industry
- 初步了解认识正则表达式(Regex)
- 初步了解認識正則錶達式(Regex)
- 金融信创爆发年!袋鼠云数栈DTinsight全线产品通过信通院信创专项测试
- 最简单ALV模板
猜你喜欢

Lombok package is successfully installed, but the runtime prompts that get, set method and constructor solution cannot be found

leetcode:210. 課程錶 II

Sorting out the knowledge points of primary and secondary indicators

风控建模十:传统建模方法存在的问题探讨及改进方法探索

Graphics2D类基本使用

My way of programming

Shell language

字符串基础知识

一级指针&二级指针知识点梳理

Vs2017 environmental issues
随机推荐
结构体知识点all in
Bluetooth for Delphi xe7
数据批量写入
测试基础之:单元测试
Can flush open an account? Can you directly open the security of securities companies on the app
一级指针&二级指针知识点梳理
Lombok package is successfully installed, but the runtime prompts that get, set method and constructor solution cannot be found
torch. clamp_ min_ method
在同花顺开户安全么,买股票怎么网上开户
【目标检测】|Dive Deeper Into Box for Object Detection 基于FCOS新训练方法
#141 Linked List Cycle
zgc的垃圾收集的主要阶段
模块八:设计消息队列存储消息数据的MySQL表
ASCII 码对照表
同花顺能开户吗,在APP上可以直接开通券商安全吗 ,证券开户怎么开户流程
pytorch transforms. Use of lambda
makefile 的ifeq,filter,strip 简单使用
JdbcTemplate插入并返回主键
Lintcode:127. Topology sorting
JS deep and shallow copy