当前位置:网站首页>[Point Cloud] M3DeTR: Multi-representation, Multi-scale, Mutual-relation 3D Object Detection with Transformers
[Point Cloud] M3DeTR: Multi-representation, Multi-scale, Mutual-relation 3D Object Detection with Transformers
2022-07-29 22:16:00 【BIT can reach duck】
【WACV 2022】M3DeTR: Multi-representation, Multi-scale, Mutual-relation 3D Object Detection with Transformers
Introduction to the paper:
This paper proposes a new 3D object detection architecture, M3DETR, which combines different point cloud representations (raw, voxel, bird's-eye view) with different feature scales based on a multi-scale feature pyramid.M3DETR is the first method to use Transformers to simultaneously unify multiple point cloud representations, feature scales, and model the interrelationships between point clouds.
The authors conduct extensive ablation experiments, emphasizing the benefits of fusing different representations and scales, and modeling the relationship.The method achieves state-of-the-art performance on the KITTI 3D object detection dataset and the Waymo open dataset.The results show that M3DETR significantly improves mAP by 1.48% over the baseline for all classes on the Waymo Open Dataset, and ranks first on the KITTI 3D detection benchmark for the car and bicycle classes, on the Waymo Open Dataset with a single-frame point cloud inputranked first.
Basic idea:
There are two key limitations of 3D object detection methods based on different networks:
- Invalid point cloud representation: The three main techniques used to process point clouds are voxel based, raw point cloud and bird's eye view.Each representation has a unique advantage, and it has been shown that combining these representations can improve detection accuracy.However, fusing these representations is not straightforward.First, the corresponding structures of the three types of neural networks are different.In addition, when
边栏推荐
- 大陆泽、宁晋泊蓄滞洪区防洪工程与安全建设项目启动实施
- 网络通信编程基础,BIO,NIO
- The implementation of the flood control project and safety construction project in the flood storage and detention areas in Luluze and Ningjinbo was launched
- 【AD】【持续更新ing】关于AD设计过程中一些小细节
- APM电机输出逻辑(Motors类详解)
- 转:idea中language level设置
- VR直播营销需求增加,数据模块为我们铺路
- bright day
- The demand for VR live broadcast marketing is increasing, and the data module is paving the way for us
- 7 行代码搞崩溃 B 站,原因令人唏嘘!
猜你喜欢

WeChat Mini Program 30 Customizing Templates and Obtaining User Login Credentials

笔记:fgets函数详解

人社部公布“数据库运行管理员”成新职业,OceanBase参与制定职业标准

全球都热炸了,谷歌服务器已经崩掉了

南华早报 | 助力亚洲最具公信力报章实现AD域自动化管理

In 2022, you still can't "low code"?Data science can also play with Low-Code!

高通WLAN框架学习(31)-- Power save

数字孪生万物可视 | 联接现实世界与数字空间

网安学习-内网渗透2

《nlp入门+实战:第七章:pytorch中数据集加载和自带数据集的使用》
随机推荐
容器网络硬核技术内幕 (23) 权利,知识与责任
SwiftUI CoreData 教程之如何加速搜索速度
[ACTF2020 新生赛]Exec 1
How to implement your personal knowledge base?
高通WLAN框架学习(31)-- Power save
银河麒麟V10 SP2 x86编译安装 PHP7.4
【593. 有效的正方形】
关于云计算的海量数据存储模型[通俗易懂]
笔记:fgets函数详解
5 V booster charge 8.4 V chip
Writing Elegant Kotlin Code: Talk About What I Think "Kotlinic"
方法的传递
Liu Genghong, boys and girls, come here!Sports data analysis and mining!(with a full set of code and data sets)
三品牌下半年将带来多款新品,东风日产将迎来“产品大潮”
解决reudx中的异步问题 applyMiddleware thunk
网络通信编程基础,BIO,NIO
The cornerstone of distributed: reliability - What a tangled web we weave
iNFTnews | 福布斯的Web3探索
APM电机输出逻辑(Motors类详解)
仿Modbus消息帧进行通信