当前位置:网站首页>Detr introduction
Detr introduction
2022-07-07 13:27:00 【Name of algorithm】
DETR yes facebook Published in ECCV2020 Use Transformers A framework for end-to-end target detection .

DETR Just use CNN Extraction of image features , And then use it alone Transformer You can predict the target bounding box and classification . It does not require non maximum suppression , Don't need to, Anchor Mechanism .

Above, DETR The network architecture of ,DETR Use CNN Extraction of image features , And then use it alone Transformer Get the predicted target bounding box , Bounding box and ground truth As a geometric prediction problem . It's a binary match (bipartite matching), There is no matching object homing no object This kind of .

The above figure is a more detailed description DETR Network structure , The image passes by CNN Get the feature , Plus the location code (poositioonal encoding), Then flatten and feed into transformer encoder,encoder The output of is sent to transformer decoder, stay decoder There is also object queries The input of ,decoder The output of is sent to the prediction head (prediction heads), There is a feedforward neural network in the prediction head FFN Predict object categories and bounding boxes .

Above, DETR in Transformer Specific architecture , It has Encoder and Decoder Two parts ,Encoder The input is CNN Extracted image features plus position coding , Send it to the multi head self attention module , Then it is sent to the feedforward neural network module . In this way Encoder There can be multiple layers , Then send it to Decoder,Decoder Yes Object queries, Is a learnable location embedded as input , After the multi head self attention module , after Encoder and Decoder Multi head mutual attention module , Then it is sent to the feedforward neural network for processing .Decoder Layers can also stack multiple , Finally, it is sent to the feedforward neural network FFN Carry out object category prediction and boundary box prediction .
边栏推荐
- Shell batch file name (excluding extension) lowercase to uppercase
- Signal strength (RSSI) knowledge sorting
- PAcP learning note 3: pcap method description
- High end for 8 years, how is Yadi now?
- RecyclerView的数据刷新
- 记一次 .NET 某新能源系统 线程疯涨 分析
- [learning notes] segment tree selection
- 10 张图打开 CPU 缓存一致性的大门
- 详细介绍六种开源协议(程序员须知)
- Initialization script
猜你喜欢

LIS 最长上升子序列问题(动态规划、贪心+二分)

飞桨EasyDL实操范例:工业零件划痕自动识别

数字ic设计——SPI

Isprs2021/ remote sensing image cloud detection: a geographic information driven method and a new large-scale remote sensing cloud / snow detection data set

cmake 学习使用笔记(一)

Milkdown 控件图标

【Presto Profile系列】Timeline使用

MongoDB内部的存储原理

靠卖概念上市,认养一头牛能走多远?
![[dark horse morning post] Huawei refutes rumors about](/img/d7/4671b5a74317a8f87ffd36be2b34e1.jpg)
[dark horse morning post] Huawei refutes rumors about "military master" Chen Chunhua; Hengchi 5 has a pre-sale price of 179000 yuan; Jay Chou's new album MV has played more than 100 million in 3 hours
随机推荐
【黑马早报】华为辟谣“军师”陈春花;恒驰5预售价17.9万元;周杰伦新专辑MV 3小时播放量破亿;法华寺回应万元月薪招人...
Practical example of propeller easydl: automatic scratch recognition of industrial parts
Users, groups, and permissions
About how appium closes apps (resolved)
PACP学习笔记三:PCAP方法说明
QQ medicine, Tencent ticket
The difference between cache and buffer
Vscade editor esp32 header file wavy line does not jump completely solved
Final review notes of single chip microcomputer principle
Scripy tutorial classic practice [New Concept English]
Simple and easy-to-use code specification
[Presto profile series] timeline use
php——laravel缓存cache
PACP学习笔记一:使用 PCAP 编程
Isprs2021/ remote sensing image cloud detection: a geographic information driven method and a new large-scale remote sensing cloud / snow detection data set
Mongodb command summary
Write it down once Net a new energy system thread surge analysis
JNA learning notes 1: Concepts
JS function 返回多个值
日本政企员工喝醉丢失46万信息U盘,公开道歉又透露密码规则