当前位置:网站首页>One architecture to complete all tasks - transformer architecture is unifying the AI Jianghu on its own
One architecture to complete all tasks - transformer architecture is unifying the AI Jianghu on its own
2022-07-04 14:21:00 【A Virgo procedural ape】
Catalog
An architecture to accomplish all tasks —Transformer The architecture is being unified on its own AI Rivers and lakes
Language model , Images 、 Video has been Transformer The architecture refreshes the model scale and performance benchmark at the same time . I still want to talk about Transformer All kinds of variants of have been brilliant in this year , At the same time NLP and CV The field frequently brushes the list .
In recent years, ,transformer Architecture gradually extends its influence to various new fields . first ,Transformers It is developed for natural language processing , Now it is becoming a Swiss Army knife for in-depth learning . 2021 year , They are used to find drugs 、 Recognize voice, painting and other tasks .
transformers Has proven to be good at visual tasks 、 Predicting earthquakes and classifying and generating proteins . In the past year , Researchers have pushed them into broad new fields .
TransGAN:TransGAN It's a generative confrontation network , It is a combination of transformer To ensure that each generated pixel is consistent with its previously generated pixel . This work has achieved the most advanced results in measuring the similarity between the generated image and the training data .
TimeSformer:Facebook Of TimeSformer This architecture is used to identify actions in video clips . It explains the sequence of video frames , Instead of the usual sequence of words in the text . Its performance is better than convolutional neural network , You can analyze longer clips in a shorter time , And use less power .
GPT-2:Facebook、Google And researchers at the University of California, Berkeley trained on the text GPT-2, Then it freezes its self attention and feedforward layer . They can fine tune in a variety of areas , Including mathematics 、 Logic problems and computer vision .
AlphaFold 2:DeepMind Released AlphaFold 2 Open source version of , It USES transformer Find the protein according to the amino acid sequence 3D shape . The model has aroused the interest of the medical community , Because it has the potential to promote drug discovery and reveal biological insights .
Vision Transformer(ViT) as well as Video ViT:
Transformer On 2017 Made its debut in , And quickly changed the language modeling . Its self attention mechanism tracks the relationship between each element in the sequence and each other element , Not only suitable for analyzing word sequences , It is also suitable for analyzing pixels 、 Video frame 、 Amino acids, 、 Seismic wave sequence . be based on transformer The large language model of has become an example of the emerging basic model variety —— A model of pre training on a large unlabeled corpus , Special tasks can be fine tuned for a limited number of markup examples .transformer The fact that they can work well in various fields , It may indicate the basis beyond language transformer The basic model of .
The history of deep learning has witnessed some rapidly popular ideas :ReLU Activation function 、Adam Optimizer 、 Attention mechanism and current transformer. Developments over the past year have shown that , This architecture is still working .
Reference article :https://read.deeplearning.ai/the-batch/issue-123/
边栏推荐
- 卷积神经网络经典论文集合(深度学习分类篇)
- Assertion of unittest framework
- Innovation and development of independent industrial software
- AI与生命科学
- 第十七章 进程内存
- [antd] how to set antd in form There is input in item Get input when gourp Value of each input of gourp
- R language uses the mutation function of dplyr package to standardize the specified data column (using mean function and SD function), and calculates the grouping mean of the standardized target varia
- 如何游戏出海代运营、游戏出海代投
- Data warehouse interview question preparation
- R language ggplot2 visualization: gganimate package creates animated graph (GIF) and uses anim_ The save function saves the GIF visual animation
猜你喜欢

The font of markdown grammar is marked in red

Understand chisel language thoroughly 11. Chisel project construction, operation and test (III) -- scalatest of chisel test

【FAQ】华为帐号服务报错 907135701的常见原因总结和解决方法

数据仓库面试问题准备

测试流程整理(2)

Detailed index of MySQL

How to package QT and share exe

RK1126平台OSD的实现支持颜色半透明度多通道支持中文

Use of tiledlayout function in MATLAB

Unity shader learning (3) try to draw a circle
随机推荐
Matters needing attention in overseas game Investment Agency
Excel quickly merges multiple rows of data
What is the real meaning and purpose of doing things, and what do you really want
R语言使用lattice包中的bwplot函数可视化箱图(box plot)、par.settings参数自定义主题模式
游戏出海,全球化运营
Unity Shader学习(三)试着绘制一个圆
Test evaluation of software testing
2022 game going to sea practical release strategy
Rich text editing: wangeditor tutorial
RK1126平台OSD的实现支持颜色半透明度多通道支持中文
Data warehouse interview question preparation
Mask wearing detection based on yolov1
Understand chisel language thoroughly 09. Chisel project construction, operation and testing (I) -- build and run chisel project with SBT
MySQL之详解索引
统计php程序运行时间及设置PHP最长运行时间
R language uses dplyr package group_ The by function and the summarize function calculate the mean and standard deviation of the target variables based on the grouped variables
Introducing testfixture into unittest framework
Install MySQL
sql优化之explain
Intelligence d'affaires bi analyse financière, analyse financière au sens étroit et analyse financière au sens large sont - ils différents?