当前位置:网站首页>Detr introduction
Detr introduction
2022-07-07 13:27:00 【Name of algorithm】
DETR yes facebook Published in ECCV2020 Use Transformers A framework for end-to-end target detection .
DETR Just use CNN Extraction of image features , And then use it alone Transformer You can predict the target bounding box and classification . It does not require non maximum suppression , Don't need to, Anchor Mechanism .
Above, DETR The network architecture of ,DETR Use CNN Extraction of image features , And then use it alone Transformer Get the predicted target bounding box , Bounding box and ground truth As a geometric prediction problem . It's a binary match (bipartite matching), There is no matching object homing no object This kind of .
The above figure is a more detailed description DETR Network structure , The image passes by CNN Get the feature , Plus the location code (poositioonal encoding), Then flatten and feed into transformer encoder,encoder The output of is sent to transformer decoder, stay decoder There is also object queries The input of ,decoder The output of is sent to the prediction head (prediction heads), There is a feedforward neural network in the prediction head FFN Predict object categories and bounding boxes .
Above, DETR in Transformer Specific architecture , It has Encoder and Decoder Two parts ,Encoder The input is CNN Extracted image features plus position coding , Send it to the multi head self attention module , Then it is sent to the feedforward neural network module . In this way Encoder There can be multiple layers , Then send it to Decoder,Decoder Yes Object queries, Is a learnable location embedded as input , After the multi head self attention module , after Encoder and Decoder Multi head mutual attention module , Then it is sent to the feedforward neural network for processing .Decoder Layers can also stack multiple , Finally, it is sent to the feedforward neural network FFN Carry out object category prediction and boundary box prediction .
边栏推荐
- Coscon'22 community convening order is coming! Open the world, invite all communities to embrace open source and open a new world~
- Simple and easy-to-use code specification
- PACP学习笔记一:使用 PCAP 编程
- ESP32系列专栏
- 一文读懂数仓中的pg_stat
- 滑轨步进电机调试(全国海洋航行器大赛)(STM32主控)
- [QNX Hypervisor 2.2用户手册]6.3.4 虚拟寄存器(guest_shm.h)
- Storage principle inside mongodb
- Go language learning notes - structure
- Introduction and basic use of stored procedures
猜你喜欢
日本政企员工喝醉丢失46万信息U盘,公开道歉又透露密码规则
QQ medicine, Tencent ticket
自定义线程池拒绝策略
Esp32 ① compilation environment
Go language learning notes - structure
Cloud detection 2020: self attention generation countermeasure network for cloud detection in high-resolution remote sensing images
Scripy tutorial classic practice [New Concept English]
Differences between MySQL storage engine MyISAM and InnoDB
MySQL master-slave replication
Japanese government and enterprise employees got drunk and lost 460000 information USB flash drives. They publicly apologized and disclosed password rules
随机推荐
单片机原理期末复习笔记
MongoDB复制(副本集)总结
PAcP learning note 1: programming with pcap
如何让electorn打开的新窗口在window任务栏上面
Shell batch file name (excluding extension) lowercase to uppercase
[untitled]
Isprs2021/ remote sensing image cloud detection: a geographic information driven method and a new large-scale remote sensing cloud / snow detection data set
【学习笔记】zkw 线段树
为租客提供帮助
滑轨步进电机调试(全国海洋航行器大赛)(STM32主控)
Practical case: using MYCAT to realize read-write separation of MySQL
[untitled]
Clion mingw64 Chinese garbled code
飞桨EasyDL实操范例:工业零件划痕自动识别
How to make the new window opened by electorn on the window taskbar
[dark horse morning post] Huawei refutes rumors about "military master" Chen Chunhua; Hengchi 5 has a pre-sale price of 179000 yuan; Jay Chou's new album MV has played more than 100 million in 3 hours
LeetCode_ Binary search_ Medium_ 153. Find the minimum value in the rotation sort array
Per capita Swiss number series, Swiss number 4 generation JS reverse analysis
Why can basic data types call methods in JS
LIS longest ascending subsequence problem (dynamic programming, greed + dichotomy)