当前位置:网站首页>Detr introduction
Detr introduction
2022-07-07 13:27:00 【Name of algorithm】
DETR yes facebook Published in ECCV2020 Use Transformers A framework for end-to-end target detection .
DETR Just use CNN Extraction of image features , And then use it alone Transformer You can predict the target bounding box and classification . It does not require non maximum suppression , Don't need to, Anchor Mechanism .
Above, DETR The network architecture of ,DETR Use CNN Extraction of image features , And then use it alone Transformer Get the predicted target bounding box , Bounding box and ground truth As a geometric prediction problem . It's a binary match (bipartite matching), There is no matching object homing no object This kind of .
The above figure is a more detailed description DETR Network structure , The image passes by CNN Get the feature , Plus the location code (poositioonal encoding), Then flatten and feed into transformer encoder,encoder The output of is sent to transformer decoder, stay decoder There is also object queries The input of ,decoder The output of is sent to the prediction head (prediction heads), There is a feedforward neural network in the prediction head FFN Predict object categories and bounding boxes .
Above, DETR in Transformer Specific architecture , It has Encoder and Decoder Two parts ,Encoder The input is CNN Extracted image features plus position coding , Send it to the multi head self attention module , Then it is sent to the feedforward neural network module . In this way Encoder There can be multiple layers , Then send it to Decoder,Decoder Yes Object queries, Is a learnable location embedded as input , After the multi head self attention module , after Encoder and Decoder Multi head mutual attention module , Then it is sent to the feedforward neural network for processing .Decoder Layers can also stack multiple , Finally, it is sent to the feedforward neural network FFN Carry out object category prediction and boundary box prediction .
边栏推荐
- 【等保】云计算安全扩展要求关注的安全目标和实现方式区分原则有哪些?
- JS判断一个对象是否为空
- LeetCode_二分搜索_中等_153.寻找旋转排序数组中的最小值
- PCAP学习笔记二:pcap4j源码笔记
- PAcP learning note 1: programming with pcap
- 记一次 .NET 某新能源系统 线程疯涨 分析
- Per capita Swiss number series, Swiss number 4 generation JS reverse analysis
- Cloud detection 2020: self attention generation countermeasure network for cloud detection in high-resolution remote sensing images
- JS function returns multiple values
- 提升树莓派性能的方法
猜你喜欢
【Presto Profile系列】Timeline使用
Milkdown 控件图标
基于鲲鹏原生安全,打造安全可信的计算平台
Cloud detection 2020: self attention generation countermeasure network for cloud detection in high-resolution remote sensing images
我那“不好惹”的00后下属:不差钱,怼领导,抵制加班
Isprs2021/ remote sensing image cloud detection: a geographic information driven method and a new large-scale remote sensing cloud / snow detection data set
Cmake learning and use notes (1)
TPG x AIDU|AI领军人才招募计划进行中!
About the problem of APP flash back after appium starts the app - (solved)
About how appium closes apps (resolved)
随机推荐
Vscode编辑器ESP32头文件波浪线不跳转彻底解决
DrawerLayout禁止侧滑显示
OSI seven layer model
1、深拷贝 2、call apply bind 3、for of for in 区别
Flink | 多流转换
日本政企员工喝醉丢失46万信息U盘,公开道歉又透露密码规则
Milkdown 控件图标
Pay close attention to the work of safety production and make every effort to ensure the safety of people's lives and property
高端了8年,雅迪如今怎么样?
Common text processing tools
centso7 openssl 报错Verify return code: 20 (unable to get local issuer certificate)
共创软硬件协同生态:Graphcore IPU与百度飞桨的“联合提交”亮相MLPerf
滑轨步进电机调试(全国海洋航行器大赛)(STM32主控)
Final review notes of single chip microcomputer principle
Initialization script
Coscon'22 community convening order is coming! Open the world, invite all communities to embrace open source and open a new world~
JS function 返回多个值
Isprs2021/ remote sensing image cloud detection: a geographic information driven method and a new large-scale remote sensing cloud / snow detection data set
记一次 .NET 某新能源系统 线程疯涨 分析
一文读懂数仓中的pg_stat