当前位置:网站首页>300+篇文献!一文详解基于Transformer的多模态学习最新进展
300+篇文献!一文详解基于Transformer的多模态学习最新进展
2022-07-03 03:58:00 【智源社区】

论文地址:
https://arxiv.org/abs/2206.06488
摘要
Transformer 是一种很有前途的神经网络学习器,在各种机器学习任务中取得了巨大的成功。由于最近多模态应用和大数据的流行,基于 Transformer 的多模态学习已成为人工智能研究的热门话题。
本文对面向多模态数据的 Transformer 技术进行了全面调查。本文的主要内容包括:1)多模态学习、Transformer 生态系统和多模态大数据时代的背景;2)从一个几何拓扑视角进行 Vanilla Transformer、Vision Transformer 和 multimodal Transformer 的理论回顾;3)通过两个重要范式,即多模态预训练和特定多模态任务,对多模态 Transformer 应用的回顾;4)对多模态 Transformer 模型和应用所共有的共同挑战和设计的总结,以及 5)对社区的开放问题和潜在研究方向的讨论。
边栏推荐
- nodejs基础:浅聊url和querystring模块
- Nodejs Foundation: shallow chat URL and querystring module
- Half of 2022 is over, so we must hurry up
- C language hashtable/hashset library summary
- Wechat applet + Alibaba IOT platform + Hezhou air724ug built with server version system analysis
- 2022-07-02:以下go语言代码输出什么?A:编译错误;B:Panic;C:NaN。 package main import “fmt“ func main() { var a =
- C语言HashTable/HashSet库汇总
- pytorch怎么下载?pytorch在哪里下载?
- NPM: the 'NPM' item cannot be recognized as the name of a cmdlet, function, script file, or runnable program. Please check the spelling of the name. If the path is included, make sure the path is corr
- 以两列的瀑布流为例,我们应该怎么构建每一列的数组
猜你喜欢

Ffmpeg recording screen and screenshot
![[brush questions] connected with rainwater (one dimension)](/img/21/318fcb444b17be887562f4a9c1fac2.png)
[brush questions] connected with rainwater (one dimension)

Wechat applet + Alibaba IOT platform + Hezhou air724ug built with server version system analysis

在写web项目的时候,文件上传用到了smartupload,用了new string()进行转码,但是在数据库中,还是会出现类似扑克的乱码

ffmpeg录制屏幕和截屏

Nanning water leakage detection: warmly congratulate Guangxi Zhongshui on winning the first famous brand in Guangxi

Makefile demo

js中#号的作用

2022 tea master (intermediate) examination questions and analysis and tea master (intermediate) practical examination video

释放数据力量的Ceph-尚文网络xUP楠哥
随机推荐
pytorch是什么?pytorch是一个软件吗?
"Designer universe" argument: Data Optimization in the design field is finally reflected in cost, safety and health | chinabrand.com org
中移物联网OneOS与OneNET入选《2021年物联网示范项目名单》
Debug: CD cannot be used in kaggle
Role of JS No
Recursion: depth first search
8.8.2-PointersOnC-20220214
Web session management security issues
Shardingsphere dynamic data source
Makefile demo
golang xxx. Go code template
没有sXid,suid&sgid将进入险境!-尚文网络xUP楠哥
Is pytorch difficult to learn? How to learn pytorch well?
[mathematical logic] predicate logic (judge whether the first-order predicate logic formula is true or false | explain | example | predicate logic formula type | forever true | forever false | satisfi
2022 tea master (intermediate) examination questions and analysis and tea master (intermediate) practical examination video
Numpy warning visibledeprecationwarning: creating an ndarray from ragged needed sequences
pytorch开源吗?
Is pytorch open source?
Error c2694 "void logger:: log (nvinfer1:: ilogger:: severity, const char *)": rewrite the restrictive exception specification of virtual functions than base class virtual member functions
错误 C2694 “void Logger::log(nvinfer1::ILogger::Severity,const char *)”: 重写虚函数的限制性异常规范比基类虚成员函数