当前位置:网站首页>One architecture to complete all tasks - transformer architecture is unifying the AI Jianghu on its own
One architecture to complete all tasks - transformer architecture is unifying the AI Jianghu on its own
2022-07-04 14:21:00 【A Virgo procedural ape】
Catalog
An architecture to accomplish all tasks —Transformer The architecture is being unified on its own AI Rivers and lakes
Language model , Images 、 Video has been Transformer The architecture refreshes the model scale and performance benchmark at the same time . I still want to talk about Transformer All kinds of variants of have been brilliant in this year , At the same time NLP and CV The field frequently brushes the list .
In recent years, ,transformer Architecture gradually extends its influence to various new fields . first ,Transformers It is developed for natural language processing , Now it is becoming a Swiss Army knife for in-depth learning . 2021 year , They are used to find drugs 、 Recognize voice, painting and other tasks .
transformers Has proven to be good at visual tasks 、 Predicting earthquakes and classifying and generating proteins . In the past year , Researchers have pushed them into broad new fields .
TransGAN:TransGAN It's a generative confrontation network , It is a combination of transformer To ensure that each generated pixel is consistent with its previously generated pixel . This work has achieved the most advanced results in measuring the similarity between the generated image and the training data .
TimeSformer:Facebook Of TimeSformer This architecture is used to identify actions in video clips . It explains the sequence of video frames , Instead of the usual sequence of words in the text . Its performance is better than convolutional neural network , You can analyze longer clips in a shorter time , And use less power .
GPT-2:Facebook、Google And researchers at the University of California, Berkeley trained on the text GPT-2, Then it freezes its self attention and feedforward layer . They can fine tune in a variety of areas , Including mathematics 、 Logic problems and computer vision .
AlphaFold 2:DeepMind Released AlphaFold 2 Open source version of , It USES transformer Find the protein according to the amino acid sequence 3D shape . The model has aroused the interest of the medical community , Because it has the potential to promote drug discovery and reveal biological insights .
Vision Transformer(ViT) as well as Video ViT:
Transformer On 2017 Made its debut in , And quickly changed the language modeling . Its self attention mechanism tracks the relationship between each element in the sequence and each other element , Not only suitable for analyzing word sequences , It is also suitable for analyzing pixels 、 Video frame 、 Amino acids, 、 Seismic wave sequence . be based on transformer The large language model of has become an example of the emerging basic model variety —— A model of pre training on a large unlabeled corpus , Special tasks can be fine tuned for a limited number of markup examples .transformer The fact that they can work well in various fields , It may indicate the basis beyond language transformer The basic model of .
The history of deep learning has witnessed some rapidly popular ideas :ReLU Activation function 、Adam Optimizer 、 Attention mechanism and current transformer. Developments over the past year have shown that , This architecture is still working .
Reference article :https://read.deeplearning.ai/the-batch/issue-123/
边栏推荐
- R语言使用dplyr包的mutate函数对指定数据列进行标准化处理(使用mean函数和sd函数)并基于分组变量计算标准化后的目标变量的分组均值
- MySQL的触发器
- Understand chisel language thoroughly 05. Chisel Foundation (II) -- combinational circuits and operators
- R language uses dplyr package group_ The by function and the summarize function calculate the mean and standard deviation of the target variables based on the grouped variables
- Error in find command: paths must precede expression (turn)
- Test evaluation of software testing
- R语言使用epiDisplay包的followup.plot函数可视化多个ID(病例)监测指标的纵向随访图、使用stress.col参数指定强调线的id子集的颜色(色彩)
- Learning projects are self-made, and growth opportunities are self created
- 数据中台概念
- docker-compose公网部署redis哨兵模式
猜你喜欢

数据仓库面试问题准备

Understand chisel language thoroughly 11. Chisel project construction, operation and test (III) -- scalatest of chisel test

Hardware Basics - diode Basics

【FAQ】華為帳號服務報錯 907135701的常見原因總結和解决方法

MySQL之详解索引

Understand chisel language thoroughly 09. Chisel project construction, operation and testing (I) -- build and run chisel project with SBT

C # WPF realizes the real-time screen capture function of screen capture box

Understand chisel language thoroughly 12. Chisel project construction, operation and testing (IV) -- chisel test of chisel test

Excel quickly merges multiple rows of data

The font of markdown grammar is marked in red
随机推荐
【信息检索】分类和聚类的实验
测试流程整理(2)
Code hoof collection of wonderful secret place
R语言ggplot2可视化:gganimate包创建动画图(gif)、使用anim_save函数保存gif可视化动图
Understand chisel language thoroughly 09. Chisel project construction, operation and testing (I) -- build and run chisel project with SBT
gin集成支付宝支付
Remove duplicate letters [greedy + monotonic stack (maintain monotonic sequence with array +len)]
如何游戏出海代运营、游戏出海代投
R language uses the DOTPLOT function of epidisplay package to visualize the frequency of data points in different intervals in the form of point graph, and uses the by parameter to specify the groupin
How to package QT and share exe
Understand chisel language thoroughly 03. Write to the developer of Verilog to chisel (you can also see it without Verilog Foundation)
File creation, writing, reading, deletion (transfer) in go language
Mongodb commonly used 28 query statements (forward)
Why should Base64 encoding be used for image transmission
ViewModel 初体验
Common content type correspondence table
Unity shader learning (3) try to draw a circle
Gorm data insertion (transfer)
docker-compose公网部署redis哨兵模式
2022 game going to sea practical release strategy