当前位置:网站首页>300+篇文献!一文详解基于Transformer的多模态学习最新进展
300+篇文献!一文详解基于Transformer的多模态学习最新进展
2022-07-03 03:58:00 【智源社区】
论文地址:
https://arxiv.org/abs/2206.06488
摘要
Transformer 是一种很有前途的神经网络学习器,在各种机器学习任务中取得了巨大的成功。由于最近多模态应用和大数据的流行,基于 Transformer 的多模态学习已成为人工智能研究的热门话题。
本文对面向多模态数据的 Transformer 技术进行了全面调查。本文的主要内容包括:1)多模态学习、Transformer 生态系统和多模态大数据时代的背景;2)从一个几何拓扑视角进行 Vanilla Transformer、Vision Transformer 和 multimodal Transformer 的理论回顾;3)通过两个重要范式,即多模态预训练和特定多模态任务,对多模态 Transformer 应用的回顾;4)对多模态 Transformer 模型和应用所共有的共同挑战和设计的总结,以及 5)对社区的开放问题和潜在研究方向的讨论。
边栏推荐
- [learning notes] seckill - seckill project - (11) project summary
- Dynamic programming: Longest palindrome substring and subsequence
- Intercept string fixed length to array
- npm : 无法将“npm”项识别为 cmdlet、函数、脚本文件或可运行程序的名称。请检查名称的拼写,如果包括路径,请确保路径正确,然后再试一次。
- eth入门之DAPP
- pytorch是什么?pytorch是一个软件吗?
- Commands related to the startup of redis under Linux server (installation and configuration)
- FileZilla client download and installation
- 2022 tea master (intermediate) examination questions and analysis and tea master (intermediate) practical examination video
- redis在服务器linux下的启动的相关命令(安装和配置)
猜你喜欢
简易版 微信小程序开发之页面跳转、数据绑定、获取用户信息、获取用户位置信息
Application of I2C protocol of STM32F103 (read and write EEPROM)
js中#号的作用
[embedded module] OLED display module
Role of JS No
[brush questions] connected with rainwater (one dimension)
pytorch是什么?pytorch是一个软件吗?
2022 tea master (primary) examination questions and tea master (primary) examination question bank
Mutex and rwmutex in golang
SAP UI5 应用开发教程之一百零五 - SAP UI5 Master-Detail 布局模式的联动效果实现明细介绍
随机推荐
[Blue Bridge Road - bug free code] pcf8591 - code analysis of AD conversion
In Net 6 project using startup cs
sigaction的使用
Null and undefined
Dynamic programming: Longest palindrome substring and subsequence
编译文件时报错:错误: 编码GBK的不可映射字符
Commands related to the startup of redis under Linux server (installation and configuration)
[mathematical logic] predicate logic (judge whether the first-order predicate logic formula is true or false | explain | example | predicate logic formula type | forever true | forever false | satisfi
"Designer universe" argument: Data Optimization in the design field is finally reflected in cost, safety and health | chinabrand.com org
Application of I2C protocol of STM32F103 (read and write EEPROM)
Applet (continuous update)
TCP, the heavyweight guest in tcp/ip model -- Kuige of Shangwen network
leetcode:297. 二叉树的序列化与反序列化
Hutool dynamically adds scheduled tasks
[Blue Bridge Road -- bug free code] DS18B20 temperature reading code analysis
[mathematical logic] propositional logic (judgment of the correctness of propositional logic reasoning | formal structure is eternal truth - equivalent calculus | deduction from premise - logical reas
pytorch怎么下载?pytorch在哪里下载?
Download and install node, NPM and yarn
Error in compiled file: error: unmapped character encoding GBK
Ffmpeg recording screen and screenshot