当前位置:网站首页>One architecture to complete all tasks - transformer architecture is unifying the AI Jianghu on its own
One architecture to complete all tasks - transformer architecture is unifying the AI Jianghu on its own
2022-07-04 14:21:00 【A Virgo procedural ape】
Catalog
An architecture to accomplish all tasks —Transformer The architecture is being unified on its own AI Rivers and lakes
Language model , Images 、 Video has been Transformer The architecture refreshes the model scale and performance benchmark at the same time . I still want to talk about Transformer All kinds of variants of have been brilliant in this year , At the same time NLP and CV The field frequently brushes the list .
In recent years, ,transformer Architecture gradually extends its influence to various new fields . first ,Transformers It is developed for natural language processing , Now it is becoming a Swiss Army knife for in-depth learning . 2021 year , They are used to find drugs 、 Recognize voice, painting and other tasks .
transformers Has proven to be good at visual tasks 、 Predicting earthquakes and classifying and generating proteins . In the past year , Researchers have pushed them into broad new fields .
TransGAN:TransGAN It's a generative confrontation network , It is a combination of transformer To ensure that each generated pixel is consistent with its previously generated pixel . This work has achieved the most advanced results in measuring the similarity between the generated image and the training data .
TimeSformer:Facebook Of TimeSformer This architecture is used to identify actions in video clips . It explains the sequence of video frames , Instead of the usual sequence of words in the text . Its performance is better than convolutional neural network , You can analyze longer clips in a shorter time , And use less power .
GPT-2:Facebook、Google And researchers at the University of California, Berkeley trained on the text GPT-2, Then it freezes its self attention and feedforward layer . They can fine tune in a variety of areas , Including mathematics 、 Logic problems and computer vision .
AlphaFold 2:DeepMind Released AlphaFold 2 Open source version of , It USES transformer Find the protein according to the amino acid sequence 3D shape . The model has aroused the interest of the medical community , Because it has the potential to promote drug discovery and reveal biological insights .
Vision Transformer(ViT) as well as Video ViT:
Transformer On 2017 Made its debut in , And quickly changed the language modeling . Its self attention mechanism tracks the relationship between each element in the sequence and each other element , Not only suitable for analyzing word sequences , It is also suitable for analyzing pixels 、 Video frame 、 Amino acids, 、 Seismic wave sequence . be based on transformer The large language model of has become an example of the emerging basic model variety —— A model of pre training on a large unlabeled corpus , Special tasks can be fine tuned for a limited number of markup examples .transformer The fact that they can work well in various fields , It may indicate the basis beyond language transformer The basic model of .
The history of deep learning has witnessed some rapidly popular ideas :ReLU Activation function 、Adam Optimizer 、 Attention mechanism and current transformer. Developments over the past year have shown that , This architecture is still working .
Reference article :https://read.deeplearning.ai/the-batch/issue-123/
边栏推荐
- [matlab] summary of conv, filter, conv2, Filter2 and imfilter convolution functions
- How to package QT and share exe
- 第十七章 进程内存
- redis 日常笔记
- Vscode common plug-ins summary
- Install MySQL
- Use of tiledlayout function in MATLAB
- What is the real meaning and purpose of doing things, and what do you really want
- ViewModel 初体验
- Use of arouter
猜你喜欢
sql优化之explain
Hardware Basics - diode Basics
Intelligence d'affaires bi analyse financière, analyse financière au sens étroit et analyse financière au sens large sont - ils différents?
Unity Shader学习(三)试着绘制一个圆
【MySQL从入门到精通】【高级篇】(五)MySQL的SQL语句执行流程
[MySQL from introduction to proficiency] [advanced chapter] (V) SQL statement execution process of MySQL
第十七章 进程内存
Vscode common plug-ins summary
安装Mysql
Excel快速合并多行数据
随机推荐
数据仓库面试问题准备
Excel快速合并多行数据
GCC【6】- 编译的4个阶段
IP lab monthly resumption · issue 5
Unity Shader学习(三)试着绘制一个圆
opencv3.2 和opencv2.4安装
Excel quickly merges multiple rows of data
What is the real meaning and purpose of doing things, and what do you really want
TestSuite and testrunner in unittest
C # WPF realizes the real-time screen capture function of screen capture box
Common content type correspondence table
Map of mL: Based on Boston house price regression prediction data set, an interpretable case of xgboost model using map value
R语言使用lattice包中的bwplot函数可视化箱图(box plot)、par.settings参数自定义主题模式
数据埋点的一些问题和想法
R语言使用epiDisplay包的dotplot函数通过点图的形式可视化不同区间数据点的频率、使用by参数指定分组参数可视化不同分组的点图分布
Understand chisel language thoroughly 12. Chisel project construction, operation and testing (IV) -- chisel test of chisel test
vscode 常用插件汇总
Understand chisel language thoroughly 09. Chisel project construction, operation and testing (I) -- build and run chisel project with SBT
MySQL之详解索引
Use of tiledlayout function in MATLAB