当前位置:网站首页>Transformer principle and code elaboration
Transformer principle and code elaboration
2022-07-06 07:39:00 【bai666】
Course link : https://edu.51cto.com/course/30124.html
Transformer From NLP( natural language processing ), And cross-border application to CV( Computer vision ) field . At present, it has become a new paradigm of deep learning , Great influence and application prospects .
This course is right Transformer Principle and PyTorch And TensorFlow 2 Elaborate the code , To help you master its detailed principle and specific implementation .
Principle elaboration Part includes : Attention mechanism and self attention mechanism 、Transformer An overview of the architecture 、Encoder Long attention (Multi-Head Attention)、Encoder The location code of (Positional Encoding)、 Residual link (Residual Connection)、 Layer normalization (Layer Normalization)、FFN(Feed Forward Network)、Transformer Training and performance of 、Transformer Machine translation workflow .
Code elaboration Some use Jupyter Notebook Yes Transformer Of PyTorch And TensorFlow 2 The implementation code is interpreted line by line , Include : install PyTorch/TensorFlow、Transformer Data set loading and preprocessing code interpretation 、Transformer Position coding and multi head attention code interpretation 、Transformer Of Transformer Class code interpretation 、Transformer Optimizer and loss function code interpretation 、Transformer Interpretation of training code 、Transformer Reasoning and weight saving code interpretation .
边栏推荐
- Opencv learning notes 8 -- answer sheet recognition
- Google may return to the Chinese market after the Spring Festival.
- Le chemin du navigateur Edge obtient
- DataX self check error /datax/plugin/reader/_ drdsreader/plugin. Json] does not exist
- Force buckle day31
- [1. Delphi foundation] 1 Introduction to Delphi Programming
- In the era of digital economy, how to ensure security?
- 【mysql学习笔记29】触发器
- MEX有关的学习
- CF1036C Classy Numbers 题解
猜你喜欢

解决方案:智慧工地智能巡檢方案視頻監控系統
Comparison of usage scenarios and implementations of extensions, equal, and like in TS type Gymnastics
![[cf gym101196-i] waif until dark network maximum flow](/img/66/6b339fc23146b5fbdcd2a1fa0a2349.png)
[cf gym101196-i] waif until dark network maximum flow

杰理之开发板上电开机,就可以手机打开 NRF 的 APP【篇】
![Ble of Jerry [chapter]](/img/ed/32a5d045af8876d7b420ae9058534f.png)
Ble of Jerry [chapter]

Simulation of holographic interferogram and phase reconstruction of Fourier transform based on MATLAB
![If Jerry's Bluetooth device wants to send data to the mobile phone, the mobile phone needs to open the notify channel first [article]](/img/d6/92ad1c6f84415de6ab0dfd16cd6073.png)
If Jerry's Bluetooth device wants to send data to the mobile phone, the mobile phone needs to open the notify channel first [article]

Opencv learning notes 9 -- background modeling + optical flow estimation

Simulation of Michelson interferometer based on MATLAB

Pre knowledge reserve of TS type gymnastics to become an excellent TS gymnastics master
随机推荐
[CF Gym101196-I] Waif Until Dark 网络最大流
PHP Coding Standard
Bit operation XOR
Basics of reptile - Scratch reptile
In the era of digital economy, how to ensure security?
Redis builds clusters
Ble of Jerry [chapter]
Position() function in XPath uses
Luogu p4127 [ahoi2009] similar distribution problem solution
【mysql学习笔记29】触发器
The difference between TS Gymnastics (cross operation) and interface inheritance
Typescript interface and the use of generics
TypeScript 变量作用域
Ali's redis interview question is too difficult, isn't it? I was pressed on the ground and rubbed
Scala语言学习-08-抽象类
Get/post/put/patch/delete meaning
opencv学习笔记九--背景建模+光流估计
Summary of Digital IC design written examination questions (I)
合规、高效,加快药企数字化转型,全新打造药企文档资源中心
Force buckle day31