当前位置:网站首页>Vit (vision transformer) principle and code elaboration
Vit (vision transformer) principle and code elaboration
2022-07-06 07:39:00 【bai666】
Course link : https://edu.51cto.com/course/30169.html
Transformer In many NLP( natural language processing ) In the mission SOTA The results of . ViT (Vision Transformer) yes Transformer be applied to CV( Computer vision ) Milestone work in the field , Later, more variants have been developed , Such as Swin Transformer.
ViT (Vision Transformer) Model published in paper An Image is Worth 16X16 Words: Transformer For Image Recognition At Scale, Use pure Transformer Image classification .ViT stay JFT-300M After pre training on the dataset , It can exceed convolutional neural network ResNet Performance of , And the training computing resources used can be less .
This course is right ViT Principle and PyTorch The implementation code is refined , To help you master its detailed principle and specific implementation . The code implementation includes two code implementation methods , One is to adopt timm library , The other is to adopt einops/einsum.
The principle part includes :Transformer An overview of the architecture 、Transformer Of Encoder 、Transformer Of Decoder、ViT Architecture Overview 、ViT The model, 、ViT Performance and analysis .
The refined part of the code uses Jupyter Notebook Yes ViT Of PyTorch Read the code line by line , Include : install PyTorch、ViT Of timm Library implementation code interpretation 、 einops/einsum 、ViT Of einops/einsum Implement code interpretation .
边栏推荐
- [computer skills]
- 洛谷P1836 数页码 题解
- Typescript variable scope
- Methods for JS object to obtain attributes (. And [] methods)
- After the hot update of uniapp, "mismatched versions may cause application exceptions" causes and Solutions
- Solution: système de surveillance vidéo intelligent de patrouille sur le chantier
- Typescript function definition
- 杰理之普通透传测试---做数传搭配 APP 通信【篇】
- Opencv learning notes 9 -- background modeling + optical flow estimation
- C # create database connection object SQLite database
猜你喜欢

Do you really think binary search is easy

Linked list interview questions (Graphic explanation)

Significance and measures of encryption protection for intelligent terminal equipment

Simulation of Michelson interferometer based on MATLAB

Redis builds clusters
![[computer skills]](/img/30/2a4506adf72eb4cb188dd64cce417d.jpg)
[computer skills]
![[MySQL learning notes 30] lock (non tutorial)](/img/9b/1e27575d83ff40bebde118b925f609.png)
[MySQL learning notes 30] lock (non tutorial)

Mise en œuvre du langage leecode - C - 15. Somme des trois chiffres - - - - - idées à améliorer

杰理之如若需要大包发送,需要手机端修改 MTU【篇】

opencv学习笔记八--答题卡识别
随机推荐
P3047 [USACO12FEB]Nearby Cows G(树形dp)
Le chemin du navigateur Edge obtient
Ble of Jerry [chapter]
TypeScript接口与泛型的使用
Solution: système de surveillance vidéo intelligent de patrouille sur le chantier
JMeter performance test steps practical tutorial
杰理之需要修改 gatt 的 profile 定义【篇】
Full Score composition generator: living on code
超级浏览器是指纹浏览器吗?怎样选择一款好的超级浏览器?
word中把帶有某個符號的行全部選中,更改為標題
剪映的相关介绍
Position() function in XPath uses
Basics of reptile - Scratch reptile
TypeScript 可索引类型
How MySQL merges data
The ECU of 21 Audi q5l 45tfsi brushes is upgraded to master special adjustment, and the horsepower is safely and stably increased to 305 horsepower
Related operations of Excel
word怎么只删除英语保留汉语或删除汉语保留英文
Wonderful use of TS type gymnastics string
杰理之BLE【篇】