当前位置:网站首页>300+ documents! This article explains the latest progress of multimodal learning based on transformer
300+ documents! This article explains the latest progress of multimodal learning based on transformer
2022-07-03 04:00:00 【Zhiyuan community】
Address of thesis :
https://arxiv.org/abs/2206.06488
Abstract
Transformer It is a promising neural network learner , It has achieved great success in various machine learning tasks . Due to the recent popularity of multimodal applications and big data , be based on Transformer Multimodal learning has become a hot topic in artificial intelligence research .
This paper deals with the problem of multi-modal data Transformer Technology has been thoroughly investigated . The main content of this article includes :1) Multimodal learning 、Transformer Background of ecosystem and multi-modal big data era ;2) From a geometric topological perspective Vanilla Transformer、Vision Transformer and multimodal Transformer Theoretical review of ;3) Through two important paradigms , That is, multimodal pre training and specific multimodal tasks , For multimodality Transformer Review of application ;4) For multimodality Transformer Summary of common challenges and designs shared by models and Applications , as well as 5) Discussion of open issues and potential research directions in the community .
边栏推荐
- ZIP文件的导出
- IPv6 foundation construction experiment
- What is pytorch? Is pytorch a software?
- vim 的实用操作
- Ffmpeg download and installation tutorial and introduction
- Filter
- What is the correct way to compare ntext columns with constant values- What's the right way to compare an NTEXT column with a constant value?
- Cnopendata China Customs Statistics
- 第十届中国云计算大会·中国站:展望未来十年科技走向
- [learning notes] seckill - seckill project - (11) project summary
猜你喜欢
2022-07-02: what is the output of the following go language code? A: Compilation error; B:Panic; C:NaN。 package main import “fmt“ func main() { var a =
The 10th China Cloud Computing Conference · China Station: looking forward to the trend of science and technology in the next decade
pytorch是什么?pytorch是一个软件吗?
Some preliminary preparations for QQ applet development: make an appointment for a development account, download and install developer tools, and create QQ applet
"Final review" 16/32-bit microprocessor (8086) basic register
有监督预训练!文本生成又一探索!
The latest analysis of the main principals of hazardous chemical business units in 2022 and the simulated examination questions of the main principals of hazardous chemical business units
2022 tea master (intermediate) examination questions and analysis and tea master (intermediate) practical examination video
Mila、渥太华大学 | 用SE(3)不变去噪距离匹配进行分子几何预训练
【刷题篇】 找出第 K 小的数对距离
随机推荐
Arlo's thinking about himself
Debug: CD cannot be used in kaggle
2022 Shandong Province safety officer C certificate examination questions and Shandong Province safety officer C certificate simulation examination question bank
nodejs基础:浅聊url和querystring模块
redis在服务器linux下的启动的相关命令(安装和配置)
Error c2694 "void logger:: log (nvinfer1:: ilogger:: severity, const char *)": rewrite the restrictive exception specification of virtual functions than base class virtual member functions
Recursion: depth first search
CVPR 2022 | 大連理工提出自校准照明框架,用於現實場景的微光圖像增强
【刷题篇】 找出第 K 小的数对距离
[Blue Bridge Road -- bug free code] interpretation of some codes of matrix keyboard
中移物联网OneOS与OneNET入选《2021年物联网示范项目名单》
[embedded module] OLED display module
阿洛对自己的思考
Separable bonds and convertible bonds
Half of 2022 is over, so we must hurry up
Esp32 series (3): GPIO learning (take simple GPIO input and output, ADC, DAC as examples)
Hutool dynamically adds scheduled tasks
基于Pytorch和RDKit的QSAR模型建立脚本
C language hashtable/hashset library summary
[mathematical logic] propositional logic (judgment of the correctness of propositional logic reasoning | formal structure is eternal truth - equivalent calculus | deduction from premise - logical reas