当前位置:网站首页>14、Transformer--VIT TNT BETR
14、Transformer--VIT TNT BETR
2022-07-05 20:18:00 【C--G】
VIT–Vision Transformer


VIT架构图



VIT为图像分类任务,这里使用了transformer的编码器,将图片分为九块,加上位置编码后并转化为一维再放入编码器,编码器此时有9个输入token,其中0号token与其他9位token进行了交互计算,融合了其他9位token的特征信息,因此只需要0号token即可,后面就是MLP Head和分类即可
- CNN的问题

- transformer优势

- 公式

- VIT格局

- 位置编码

- 效果分析

- 代码链接
https://github.com/WZMIAOMIAO/deep-learning-for-image-processing/tree/master/pytorch_classification/vision_transformer
TNT-Transformer in Transformer

- 基本组成

- 序列构建

- 基本计算

- 位置编码

- PatchEmbedding可视化

BETR
目标检测

基本思想

并行预测100个坐标框,没有物体,那就是背景网络架构

cnn获取一维特征图,positional encoding获取位置编码,与VIT不同,BETR没有0号token,与传统Transformer Decoder不同,BETR是由object queries一次产生多少坐标框,每个框并行与encoder输出进行匹配,再通过prediction heads判断是否是目标框
Encoder的任务

encoder提供目标的注意力结果优于cnn的特征图结果,有利于解码器快速识别目标,如图所示,encoder在有遮挡情况下也能很好的识别物体网络架构

输出匹配

注意力的作用


数据资源–大佬的博客
https://blog.csdn.net/qq_37774399/article/details/121748163
边栏推荐
- About the priority of Bram IP reset
- Four methods of random number generation | random | math | threadlocalrandom | securityrandom
- Autumn byte interviewer asked you any questions? In fact, you have stepped on thunder
- 字节跳动Dev Better技术沙龙成功举办,携手华泰分享Web研发效能提升经验
- 【数字IC验证快速入门】7、验证岗位中必备的数字电路基础知识(含常见面试题)
- After 95, Alibaba P7 published the payroll: it's really fragrant to make up this
- Is it safe for CICC fortune to open an account online?
- Go language | 02 for loop and the use of common functions
- C - sequential structure
- [quick start of Digital IC Verification] 3. Introduction to the whole process of Digital IC Design
猜你喜欢

leetcode刷题:二叉树10(完全二叉树的节点个数)

Unity editor extended UI control

【数字IC验证快速入门】1、浅谈数字IC验证,了解专栏内容,明确学习目标

解决Thinkphp框架应用目录下数据库配置信息修改后依然按默认方式连接

Leetcode brush question: binary tree 13 (the same tree)

leetcode刷题:二叉树11(平衡二叉树)

Practical demonstration: how can the production research team efficiently build the requirements workflow?
![[Yugong series] go teaching course in July 2022 004 go code Notes](/img/18/ffbab0a251dc2b78eb09ce281c2703.png)
[Yugong series] go teaching course in July 2022 004 go code Notes

PyTorch 1.12发布,正式支持苹果M1芯片GPU加速,修复众多Bug

Go language | 03 array, pointer, slice usage
随机推荐
【数字IC验证快速入门】2、通过一个SoC项目实例,了解SoC的架构,初探数字系统设计流程
Summer Challenge harmonyos - realize message notification function
CTF逆向基础
解决php无法将string转换为json的办法
JS implementation prohibits web page zooming (ctrl+ mouse, +, - zooming effective pro test)
A way to calculate LNX
计算lnx的一种方式
Reinforcement learning - learning notes 4 | actor critical
强化学习-学习笔记4 | Actor-Critic
信息学奥赛一本通 1337:【例3-2】单词查找树 | 洛谷 P5755 [NOI2000] 单词查找树
How to retrieve the root password of MySQL if you forget it
CTF reverse Foundation
【数字IC验证快速入门】9、Verilog RTL设计必会的有限状态机(FSM)
死信队列入门(两个消费者,一个生产者)
走入并行的世界
Debezium series: modify the source code to support UNIX_ timestamp() as DEFAULT value
USACO3.4 “破锣摇滚”乐队 Raucous Rockers - DP
Leetcode brush questions: binary tree 18 (largest binary tree)
[quick start of Digital IC Verification] 9. Finite state machine (FSM) necessary for Verilog RTL design
Leetcode(347)——前 K 个高频元素