当前位置:网站首页>14、Transformer--VIT TNT BETR
14、Transformer--VIT TNT BETR
2022-07-05 20:18:00 【C--G】
VIT–Vision Transformer
VIT架构图
VIT为图像分类任务,这里使用了transformer的编码器,将图片分为九块,加上位置编码后并转化为一维再放入编码器,编码器此时有9个输入token,其中0号token与其他9位token进行了交互计算,融合了其他9位token的特征信息,因此只需要0号token即可,后面就是MLP Head和分类即可
- CNN的问题
- transformer优势
- 公式
- VIT格局
- 位置编码
- 效果分析
- 代码链接
https://github.com/WZMIAOMIAO/deep-learning-for-image-processing/tree/master/pytorch_classification/vision_transformer
TNT-Transformer in Transformer
- 基本组成
- 序列构建
- 基本计算
- 位置编码
- PatchEmbedding可视化
BETR
目标检测
基本思想
并行预测100个坐标框,没有物体,那就是背景网络架构
cnn获取一维特征图,positional encoding获取位置编码,与VIT不同,BETR没有0号token,与传统Transformer Decoder不同,BETR是由object queries一次产生多少坐标框,每个框并行与encoder输出进行匹配,再通过prediction heads判断是否是目标框
Encoder的任务
encoder提供目标的注意力结果优于cnn的特征图结果,有利于解码器快速识别目标,如图所示,encoder在有遮挡情况下也能很好的识别物体网络架构
输出匹配
注意力的作用
数据资源–大佬的博客
https://blog.csdn.net/qq_37774399/article/details/121748163
边栏推荐
- Cocos2d-x项目总结中的一些遇到的问题
- Leetcode brush questions: binary tree 18 (largest binary tree)
- 银河证券在网上开户安全吗?
- A solution to PHP's inability to convert strings into JSON
- Leetcode brush question: binary tree 14 (sum of left leaves)
- Zero cloud new UI design
- Minimum commission for stock trading account opening, where to open an account with low commission? Is it safe to open an account on your mobile phone
- Based on vs2017 and cmake GUI configuration, zxing and opencv are used in win10 x64 environment, and simple detection of data matrix code is realized
- 2023年深圳市绿色低碳产业扶持计划申报指南
- Leetcode(695)——岛屿的最大面积
猜你喜欢
Autumn byte interviewer asked you any questions? In fact, you have stepped on thunder
PyTorch 1.12发布,正式支持苹果M1芯片GPU加速,修复众多Bug
【数字IC验证快速入门】6、Questasim 快速上手使用(以全加器设计与验证为例)
Convolution free backbone network: Pyramid transformer to improve the accuracy of target detection / segmentation and other tasks (with source code)
kubernetes资源对象介绍及常用命令(五)-(ConfigMap&Secret)
.Net分布式事務及落地解决方案
CADD课程学习(7)-- 模拟靶点和小分子相互作用 (半柔性对接 AutoDock)
计算lnx的一种方式
. Net distributed transaction and landing solution
leetcode刷题:二叉树14(左叶子之和)
随机推荐
618 "low key" curtain call, how can baiqiushangmei join hands with the brand to cross the "uncertain era"?
After 95, Alibaba P7 published the payroll: it's really fragrant to make up this
Leetcode: binary tree 15 (find the value in the lower left corner of the tree)
【数字IC验证快速入门】3、数字IC设计全流程介绍
C - sequential structure
本季度干货导航 | 2022年Q2
mongodb文档间关系
[quick start to digital IC Verification] 8. Typical circuits in digital ICs and their corresponding Verilog description methods
Wechat applet regular expression extraction link
Codeforces Round #804 (Div. 2) - A, B, C
Debezium series: modify the source code to support drop foreign key if exists FK
sort和投影
解决php无法将string转换为json的办法
中金财富在网上开户安全吗?
Database logic processing function
淺淺的談一下ThreadLocalInsecureRandom
Four methods of random number generation | random | math | threadlocalrandom | securityrandom
Unity编辑器扩展 UI控件篇
IC科普文:ECO的那些事儿
Schema和Model