当前位置:网站首页>[Detr for 3D object detection] detr3d: 3D object detection from multi view images via 3D-to-2D queries
[Detr for 3D object detection] detr3d: 3D object detection from multi view images via 3D-to-2D queries
2022-07-25 19:09:00 【Bit reachable duck】
DETR3D: 3D Object Detection from Multi-view Images via 3D-to-2D Queries
Brief introduction of the paper :
This paper introduces a framework for multi camera 3D target detection . The existing work is to estimate the 3D bounding box directly from monocular images , Or use the depth prediction network to generate the input of three-dimensional target detection from two-dimensional information , Unlike the , The method in this paper operates prediction directly in three-dimensional space .
DETR3D Extract two-dimensional features from multiple camera images , Then use a sparse set of 3D Object query to index these two-dimensional features , Use the camera conversion matrix to 3D The location is linked to the multi view image , Then the bounding box prediction is performed for each object query , Use the set to set loss to measure the difference between the ground truth and the prediction .
This top-down approach is better than the bottom-up approach , That is, the object boundary box prediction follows the depth estimation per pixel , Because it is not affected by the composite error introduced by the depth prediction model . Besides , This method does not require post-processing , If not the maximum inhibition , Significantly improve the reasoning speed , And in nuScenes The self driving benchmark has achieved the most advanced performance .
Contribution of thesis :
- The original name is based on RGB 3D object detection model of image . Different from the existing work ,DETR3D At the last stage
边栏推荐
- Pymoo learning (5): convergence analysis
- Alibaba cloud technology expert haochendong: cloud observability - problem discovery and positioning practice
- Intouch高级报警(报警筛选)
- 有孚网络受邀参加2022全国CIO大会并荣获“CIO信赖品牌”称号
- 李宏毅《机器学习》丨1. Introduction of this course(机器学习介绍)
- srec_ Use of common cat parameters
- The difference between PHP equal to = = and identity equal to = = =
- 【DETR用于3D目标检测】DETR3D: 3D Object Detection from Multi-view Images via 3D-to-2D Queries
- Fruit chain "siege": it's a journey of sweetness and bitterness next to apples
- 【Web技术】1391- 页面可视化搭建工具前生今世
猜你喜欢

FPGA based 1080p 60Hz bt1120 interface debugging process record

A brief history from object detection to image segmentation

Analysis of the internet jam in IM development? Network disconnection?

C 调的满级和玄

In the first half of the year, the shipment volume has exceeded that of the whole year of last year, and centritec millimeter wave radar has "captured" the international giant

SQL Server 2019 安装教程

Modelsim and quartus jointly simulate PLL FIFO and other IP cores

2022 IAA industry category development insight series report - phase II

HTTP缓存通天篇,可能有你想要的

果链“围城”:傍上苹果,是一场甜蜜与苦楚交错的旅途
随机推荐
[cloud native kubernetes] management of secret storage objects under kubernetes cluster
Software testing (mind mapping)
Wechat campus maintenance application applet graduation design finished product of applet completion work (8) graduation design thesis template
telnet安装以及telnet(密码正确)无法登录!
How to create an effective help document?
APP测试点(思维导图)
Real estate enterprises have launched a "war of guarantee"
JS 基本类型 引用类型 深/浅克隆复制
鸿蒙-大喵计算画板-简介
弱网测试工具-QNET
Actual combat of MySQL database design project of online mall system
SQL Server 2019 installation tutorial
Go code checking tool
Huawei recruited "talented teenagers" twice this year; 5.4 million twitter account information was leaked, with a selling price of $30000; Google fired engineers who believed in AI consciousness | gee
这种动态规划你见过吗——状态机动态规划之股票问题(上)
ES6 implements the observer mode through proxy and reflection
【919. 完全二叉树插入器】
Gan, why ".Length! == 3??
JMeter performance test actual video (what are the common performance test tools)
Baklib: make excellent product instruction manual