当前位置:网站首页>One architecture to complete all tasks - transformer architecture is unifying the AI Jianghu on its own
One architecture to complete all tasks - transformer architecture is unifying the AI Jianghu on its own
2022-07-04 14:21:00 【A Virgo procedural ape】
Catalog
An architecture to accomplish all tasks —Transformer The architecture is being unified on its own AI Rivers and lakes
Language model , Images 、 Video has been Transformer The architecture refreshes the model scale and performance benchmark at the same time . I still want to talk about Transformer All kinds of variants of have been brilliant in this year , At the same time NLP and CV The field frequently brushes the list .
In recent years, ,transformer Architecture gradually extends its influence to various new fields . first ,Transformers It is developed for natural language processing , Now it is becoming a Swiss Army knife for in-depth learning . 2021 year , They are used to find drugs 、 Recognize voice, painting and other tasks .
transformers Has proven to be good at visual tasks 、 Predicting earthquakes and classifying and generating proteins . In the past year , Researchers have pushed them into broad new fields .
TransGAN:TransGAN It's a generative confrontation network , It is a combination of transformer To ensure that each generated pixel is consistent with its previously generated pixel . This work has achieved the most advanced results in measuring the similarity between the generated image and the training data .
TimeSformer:Facebook Of TimeSformer This architecture is used to identify actions in video clips . It explains the sequence of video frames , Instead of the usual sequence of words in the text . Its performance is better than convolutional neural network , You can analyze longer clips in a shorter time , And use less power .
GPT-2:Facebook、Google And researchers at the University of California, Berkeley trained on the text GPT-2, Then it freezes its self attention and feedforward layer . They can fine tune in a variety of areas , Including mathematics 、 Logic problems and computer vision .
AlphaFold 2:DeepMind Released AlphaFold 2 Open source version of , It USES transformer Find the protein according to the amino acid sequence 3D shape . The model has aroused the interest of the medical community , Because it has the potential to promote drug discovery and reveal biological insights .
Vision Transformer(ViT) as well as Video ViT:
Transformer On 2017 Made its debut in , And quickly changed the language modeling . Its self attention mechanism tracks the relationship between each element in the sequence and each other element , Not only suitable for analyzing word sequences , It is also suitable for analyzing pixels 、 Video frame 、 Amino acids, 、 Seismic wave sequence . be based on transformer The large language model of has become an example of the emerging basic model variety —— A model of pre training on a large unlabeled corpus , Special tasks can be fine tuned for a limited number of markup examples .transformer The fact that they can work well in various fields , It may indicate the basis beyond language transformer The basic model of .
The history of deep learning has witnessed some rapidly popular ideas :ReLU Activation function 、Adam Optimizer 、 Attention mechanism and current transformer. Developments over the past year have shown that , This architecture is still working .
Reference article :https://read.deeplearning.ai/the-batch/issue-123/
边栏推荐
- Leetcode 61: 旋转链表
- 迅为IMX6Q开发板QT系统移植tinyplay
- R语言使用dplyr包的mutate函数对指定数据列进行标准化处理(使用mean函数和sd函数)并基于分组变量计算标准化后的目标变量的分组均值
- R language ggplot2 visualization: gganimate package creates animated graph (GIF) and uses anim_ The save function saves the GIF visual animation
- Yingshi Ruida rushes to the scientific and Technological Innovation Board: the annual revenue is 450million and the proposed fund-raising is 979million
- opencv3.2 和opencv2.4安装
- Abnormal value detection using shap value
- 一种架构来完成所有任务—Transformer架构正在以一己之力统一AI江湖
- Oppo find N2 product form first exposure: supplement all short boards
- ML之shap:基于boston波士顿房价回归预测数据集利用Shap值对LiR线性回归模型实现可解释性案例
猜你喜欢
TestSuite and testrunner in unittest
sharding key type not supported
DDD application and practice of domestic hotel transactions -- Code
[antd step pit] antd form cooperates with input Form The height occupied by item is incorrect
Excel quickly merges multiple rows of data
sql优化之查询优化器
JVM memory layout detailed, illustrated, well written!
【信息检索】分类和聚类的实验
docker-compose公网部署redis哨兵模式
Ruichengxin micro sprint technology innovation board: annual revenue of 367million, proposed to raise 1.3 billion, Datang Telecom is a shareholder
随机推荐
Understand chisel language thoroughly 09. Chisel project construction, operation and testing (I) -- build and run chisel project with SBT
Incremental ternary subsequence [greedy training]
gin集成支付宝支付
PHP log debugging
The font of markdown grammar is marked in red
opencv3.2 和opencv2.4安装
基于51单片机的超声波测距仪
vscode 常用插件汇总
去除重複字母[貪心+單調棧(用數組+len來維持單調序列)]
NowCoder 反转链表
迅为IMX6Q开发板QT系统移植tinyplay
基于PaddleX的智能零售柜商品识别
奇妙秘境 码蹄集
【MySQL从入门到精通】【高级篇】(五)MySQL的SQL语句执行流程
架构方面的进步
商業智能BI財務分析,狹義的財務分析和廣義的財務分析有何不同?
IP lab monthly resumption · issue 5
R language uses the mutation function of dplyr package to standardize the specified data column (using mean function and SD function), and calculates the grouping mean of the standardized target varia
MySQL之详解索引
流行框架:Glide的使用