当前位置:网站首页>[Point Cloud] M3DeTR: Multi-representation, Multi-scale, Mutual-relation 3D Object Detection with Transformers
[Point Cloud] M3DeTR: Multi-representation, Multi-scale, Mutual-relation 3D Object Detection with Transformers
2022-07-29 22:16:00 【BIT can reach duck】
【WACV 2022】M3DeTR: Multi-representation, Multi-scale, Mutual-relation 3D Object Detection with Transformers
Introduction to the paper:
This paper proposes a new 3D object detection architecture, M3DETR, which combines different point cloud representations (raw, voxel, bird's-eye view) with different feature scales based on a multi-scale feature pyramid.M3DETR is the first method to use Transformers to simultaneously unify multiple point cloud representations, feature scales, and model the interrelationships between point clouds.
The authors conduct extensive ablation experiments, emphasizing the benefits of fusing different representations and scales, and modeling the relationship.The method achieves state-of-the-art performance on the KITTI 3D object detection dataset and the Waymo open dataset.The results show that M3DETR significantly improves mAP by 1.48% over the baseline for all classes on the Waymo Open Dataset, and ranks first on the KITTI 3D detection benchmark for the car and bicycle classes, on the Waymo Open Dataset with a single-frame point cloud inputranked first.
Basic idea:
There are two key limitations of 3D object detection methods based on different networks:
- Invalid point cloud representation: The three main techniques used to process point clouds are voxel based, raw point cloud and bird's eye view.Each representation has a unique advantage, and it has been shown that combining these representations can improve detection accuracy.However, fusing these representations is not straightforward.First, the corresponding structures of the three types of neural networks are different.In addition, when
边栏推荐
- VR直播营销需求增加,数据模块为我们铺路
- Dry goods!Cooperative Balance in Federated Learning
- 刚重装的win7系统不能上网(深度系统安装步骤)
- SwiftUI 手势大全之可用的手势类型有哪些(教程含源码)
- Bug fix: Clipping input data to the valid range for imshow with RGB data ([0..1] for floats or [0..255]
- GET_ENTITYSET Method Implementation Guide for SAP ABAP OData Service Data Provider Class
- 02-SDRAM:自动刷新
- 转:idea中language level设置
- 官宣!苏州吴江开发区上线电子劳动合同平台
- VSCode 插件大全
猜你喜欢
随机推荐
OneNote 教程,如何在 OneNote 中做笔记?
【Verilog 设计】Verilog 实现偶数、奇数分频和任意小数分频
打破原则!MongoDB 引入 SQL?
容器网络硬核技术内幕 (小结-中)
PyQt5学习一(环境搭建)
GET_ENTITYSET Method Implementation Guide for SAP ABAP OData Service Data Provider Class
【HDLBits 刷题】Verilog Language(4)Procedures 和 More Verilog Features 部分
SwiftUI 手势大全之可用的手势类型有哪些(教程含源码)
golang文件行号探索
c#开发知识点总结
VSCode配置终端为系统命令行
容器网络硬核技术内幕 (25) 知微知彰,知柔知刚 (中)
Writing Elegant Kotlin Code: Talk About What I Think "Kotlinic"
防火墙——SNAT和DNAT策略的原理及应用、防火墙规则的备份和还原
笔记:fgets函数详解
【无标题】
The implementation of the flood control project and safety construction project in the flood storage and detention areas in Luluze and Ningjinbo was launched
华为畅享50 Pro评测:HarmonyOS加持 更流畅更安全
容器网络硬核技术内幕 (26) 知微知彰,知柔知刚 (下)
4. Implementation Guide for GET_ENTITYSET Method of SAP ABAP OData Service Data Provider Class






![[ACTF2020 Freshman Competition]Exec 1](/img/1e/a3c19d514207e6965d09c66b86e519.png)

