当前位置:网站首页>One architecture to complete all tasks - transformer architecture is unifying the AI Jianghu on its own
One architecture to complete all tasks - transformer architecture is unifying the AI Jianghu on its own
2022-07-04 14:21:00 【A Virgo procedural ape】
Catalog
An architecture to accomplish all tasks —Transformer The architecture is being unified on its own AI Rivers and lakes
Language model , Images 、 Video has been Transformer The architecture refreshes the model scale and performance benchmark at the same time . I still want to talk about Transformer All kinds of variants of have been brilliant in this year , At the same time NLP and CV The field frequently brushes the list .
In recent years, ,transformer Architecture gradually extends its influence to various new fields . first ,Transformers It is developed for natural language processing , Now it is becoming a Swiss Army knife for in-depth learning . 2021 year , They are used to find drugs 、 Recognize voice, painting and other tasks .
transformers Has proven to be good at visual tasks 、 Predicting earthquakes and classifying and generating proteins . In the past year , Researchers have pushed them into broad new fields .
TransGAN:TransGAN It's a generative confrontation network , It is a combination of transformer To ensure that each generated pixel is consistent with its previously generated pixel . This work has achieved the most advanced results in measuring the similarity between the generated image and the training data .
TimeSformer:Facebook Of TimeSformer This architecture is used to identify actions in video clips . It explains the sequence of video frames , Instead of the usual sequence of words in the text . Its performance is better than convolutional neural network , You can analyze longer clips in a shorter time , And use less power .
GPT-2:Facebook、Google And researchers at the University of California, Berkeley trained on the text GPT-2, Then it freezes its self attention and feedforward layer . They can fine tune in a variety of areas , Including mathematics 、 Logic problems and computer vision .
AlphaFold 2:DeepMind Released AlphaFold 2 Open source version of , It USES transformer Find the protein according to the amino acid sequence 3D shape . The model has aroused the interest of the medical community , Because it has the potential to promote drug discovery and reveal biological insights .
Vision Transformer(ViT) as well as Video ViT:
Transformer On 2017 Made its debut in , And quickly changed the language modeling . Its self attention mechanism tracks the relationship between each element in the sequence and each other element , Not only suitable for analyzing word sequences , It is also suitable for analyzing pixels 、 Video frame 、 Amino acids, 、 Seismic wave sequence . be based on transformer The large language model of has become an example of the emerging basic model variety —— A model of pre training on a large unlabeled corpus , Special tasks can be fine tuned for a limited number of markup examples .transformer The fact that they can work well in various fields , It may indicate the basis beyond language transformer The basic model of .
The history of deep learning has witnessed some rapidly popular ideas :ReLU Activation function 、Adam Optimizer 、 Attention mechanism and current transformer. Developments over the past year have shown that , This architecture is still working .
Reference article :https://read.deeplearning.ai/the-batch/issue-123/
边栏推荐
- Yingshi Ruida rushes to the scientific and Technological Innovation Board: the annual revenue is 450million and the proposed fund-raising is 979million
- vscode 常用插件汇总
- Deming Lee listed on Shenzhen Stock Exchange: the market value is 3.1 billion, which is the husband and wife of Li Hu and Tian Hua
- sql优化之查询优化器
- 瑞吉外卖笔记
- 架构方面的进步
- Ws2818m is packaged in cpc8. It is a special circuit for three channel LED drive control. External IC full-color double signal 5v32 lamp programmable LED lamp with outdoor engineering
- R语言dplyr包summarise_if函数计算dataframe数据中所有数值数据列的均值和中位数、基于条件进行数据汇总分析(Summarize all Numeric Variables)
- Migration from go vendor project to mod project
- 按照功能对Boost库进行分类
猜你喜欢
随机推荐
The mouse wheel of xshell/bash/zsh and other terminals is garbled (turn)
Detailed index of MySQL
失败率高达80%,企业数字化转型路上有哪些挑战?
Common content type correspondence table
R language uses bwplot function in lattice package to visualize box plot and par Settings parameter custom theme mode
C# wpf 实现截屏框实时截屏功能
Vscode common plug-ins summary
如何游戏出海代运营、游戏出海代投
10.(地图数据篇)离线地形数据处理(供Cesium使用)
按照功能对Boost库进行分类
LiveData
Understand chisel language thoroughly 12. Chisel project construction, operation and testing (IV) -- chisel test of chisel test
Data warehouse interview question preparation
Deming Lee listed on Shenzhen Stock Exchange: the market value is 3.1 billion, which is the husband and wife of Li Hu and Tian Hua
sql优化之查询优化器
架构方面的进步
Haobo medical sprint technology innovation board: annual revenue of 260million Yonggang and Shen Zhiqun are the actual controllers
R语言使用dplyr包的group_by函数和summarise函数基于分组变量计算目标变量的均值、标准差
Matters needing attention in overseas game Investment Agency
LifeCycle