当前位置:网站首页>[Point Cloud] M3DeTR: Multi-representation, Multi-scale, Mutual-relation 3D Object Detection with Transformers
[Point Cloud] M3DeTR: Multi-representation, Multi-scale, Mutual-relation 3D Object Detection with Transformers
2022-07-29 22:16:00 【BIT can reach duck】
【WACV 2022】M3DeTR: Multi-representation, Multi-scale, Mutual-relation 3D Object Detection with Transformers
Introduction to the paper:
This paper proposes a new 3D object detection architecture, M3DETR, which combines different point cloud representations (raw, voxel, bird's-eye view) with different feature scales based on a multi-scale feature pyramid.M3DETR is the first method to use Transformers to simultaneously unify multiple point cloud representations, feature scales, and model the interrelationships between point clouds.
The authors conduct extensive ablation experiments, emphasizing the benefits of fusing different representations and scales, and modeling the relationship.The method achieves state-of-the-art performance on the KITTI 3D object detection dataset and the Waymo open dataset.The results show that M3DETR significantly improves mAP by 1.48% over the baseline for all classes on the Waymo Open Dataset, and ranks first on the KITTI 3D detection benchmark for the car and bicycle classes, on the Waymo Open Dataset with a single-frame point cloud inputranked first.
Basic idea:
There are two key limitations of 3D object detection methods based on different networks:
- Invalid point cloud representation: The three main techniques used to process point clouds are voxel based, raw point cloud and bird's eye view.Each representation has a unique advantage, and it has been shown that combining these representations can improve detection accuracy.However, fusing these representations is not straightforward.First, the corresponding structures of the three types of neural networks are different.In addition, when
边栏推荐
- Come in now!!!Take you to know the basic data types of C language
- 解决reudx中的异步问题 applyMiddleware thunk
- 给pdf添加已作废标识
- 阿里 P8 爆出的这份大厂面试指南,看完工资暴涨 30k!
- 第3章业务功能开发(线索关联市场活动,插入数据并查询)
- 解释器模式
- Cobaltstrike and BurpSuite desktop shortcut configuration
- LeetCode 593 有效的正方形[数学] HERODING的LeetCode之路
- Fully automated machine learning modeling!The effect hangs the primary alchemist!
- [ACTF2020 Freshman Competition]Exec 1
猜你喜欢
随机推荐
分析少年派2中的Crypto
网络通信编程基础,BIO,NIO
爽朗的一天
Cooler Navigation helps you shop easily in shopping malls without confusion
mdnice-test
Writing Elegant Kotlin Code: Talk About What I Think "Kotlinic"
结合布林线理解现货白银走势图的方法
The Ministry of Human Resources and Social Security announced that "database operation administrator" has become a new occupation, and OceanBase participated in the formulation of occupational standar
MySQL数据查询 - 简单查询
转:idea中language level设置
SAP ABAP OData 服务 Data Provider Class 的 GET_ENTITYSET 方法实现指南试读版
linux使用脚本安装redis
[ACTF2020 Freshman Competition]Exec 1
【593. 有效的正方形】
MySQL - Design game user information table
给图片左上角加logo标识、左下角加时间和地址、地址到达指定长度换行
刚重装的win7系统不能上网(深度系统安装步骤)
JS教程之 ElectronJS 自定义标题栏
容器网络硬核技术内幕 (23) 权利,知识与责任
容器网络硬核技术内幕 (26) 知微知彰,知柔知刚 (下)






![LeetCode 593 有效的正方形[数学] HERODING的LeetCode之路](/img/c2/34624c9c7693ba40d0b3724c0db611.png)

