当前位置:网站首页>Detr introduction
Detr introduction
2022-07-07 13:27:00 【Name of algorithm】
DETR yes facebook Published in ECCV2020 Use Transformers A framework for end-to-end target detection .

DETR Just use CNN Extraction of image features , And then use it alone Transformer You can predict the target bounding box and classification . It does not require non maximum suppression , Don't need to, Anchor Mechanism .

Above, DETR The network architecture of ,DETR Use CNN Extraction of image features , And then use it alone Transformer Get the predicted target bounding box , Bounding box and ground truth As a geometric prediction problem . It's a binary match (bipartite matching), There is no matching object homing no object This kind of .

The above figure is a more detailed description DETR Network structure , The image passes by CNN Get the feature , Plus the location code (poositioonal encoding), Then flatten and feed into transformer encoder,encoder The output of is sent to transformer decoder, stay decoder There is also object queries The input of ,decoder The output of is sent to the prediction head (prediction heads), There is a feedforward neural network in the prediction head FFN Predict object categories and bounding boxes .

Above, DETR in Transformer Specific architecture , It has Encoder and Decoder Two parts ,Encoder The input is CNN Extracted image features plus position coding , Send it to the multi head self attention module , Then it is sent to the feedforward neural network module . In this way Encoder There can be multiple layers , Then send it to Decoder,Decoder Yes Object queries, Is a learnable location embedded as input , After the multi head self attention module , after Encoder and Decoder Multi head mutual attention module , Then it is sent to the feedforward neural network for processing .Decoder Layers can also stack multiple , Finally, it is sent to the feedforward neural network FFN Carry out object category prediction and boundary box prediction .
边栏推荐
- How to make the new window opened by electorn on the window taskbar
- clion mingw64中文乱码
- Shell batch file name (excluding extension) lowercase to uppercase
- Go language learning notes - structure
- Japanese government and enterprise employees got drunk and lost 460000 information USB flash drives. They publicly apologized and disclosed password rules
- Unity build error: the name "editorutility" does not exist in the current context
- Write it down once Net a new energy system thread surge analysis
- How to make join run faster?
- MongoDB优化的几点原则
- 【黑马早报】华为辟谣“军师”陈春花;恒驰5预售价17.9万元;周杰伦新专辑MV 3小时播放量破亿;法华寺回应万元月薪招人...
猜你喜欢

迅为iTOP-IMX6ULL开发板Pinctrl和GPIO子系统实验-修改设备树文件

【黑马早报】华为辟谣“军师”陈春花;恒驰5预售价17.9万元;周杰伦新专辑MV 3小时播放量破亿;法华寺回应万元月薪招人...

提升树莓派性能的方法

分布式事务解决方案
![[learning notes] zkw segment tree](/img/18/21f455a06e8629243fc5cf4df0044c.png)
[learning notes] zkw segment tree

【Presto Profile系列】Timeline使用

自定义线程池拒绝策略

PAcP learning note 1: programming with pcap

My "troublesome" subordinates after 00: not bad for money, against leaders, and resist overtime

QQ的药,腾讯的票
随机推荐
Mongodb command summary
10 张图打开 CPU 缓存一致性的大门
[untitled]
DHCP 动态主机设置协议 分析
My "troublesome" subordinates after 00: not bad for money, against leaders, and resist overtime
LeetCode_ Binary search_ Medium_ 153. Find the minimum value in the rotation sort array
Read PG in data warehouse in one article_ stat
JS缓动动画原理教学(超细节)
Final review notes of single chip microcomputer principle
How to make join run faster?
Digital IC Design SPI
聊聊伪共享
Write it down once Net a new energy system thread surge analysis
Initialization script
[QNX Hypervisor 2.2用户手册]6.3.4 虚拟寄存器(guest_shm.h)
信号强度(RSSI)知识整理
【学习笔记】zkw 线段树
滑轨步进电机调试(全国海洋航行器大赛)(STM32主控)
Realbasicvsr test pictures and videos
MongoDB 遇见 spark(进行整合)