当前位置:网站首页>300+篇文献!一文详解基于Transformer的多模态学习最新进展
300+篇文献!一文详解基于Transformer的多模态学习最新进展
2022-07-03 03:58:00 【智源社区】

论文地址:
https://arxiv.org/abs/2206.06488
摘要
Transformer 是一种很有前途的神经网络学习器,在各种机器学习任务中取得了巨大的成功。由于最近多模态应用和大数据的流行,基于 Transformer 的多模态学习已成为人工智能研究的热门话题。
本文对面向多模态数据的 Transformer 技术进行了全面调查。本文的主要内容包括:1)多模态学习、Transformer 生态系统和多模态大数据时代的背景;2)从一个几何拓扑视角进行 Vanilla Transformer、Vision Transformer 和 multimodal Transformer 的理论回顾;3)通过两个重要范式,即多模态预训练和特定多模态任务,对多模态 Transformer 应用的回顾;4)对多模态 Transformer 模型和应用所共有的共同挑战和设计的总结,以及 5)对社区的开放问题和潜在研究方向的讨论。
边栏推荐
- Half of 2022 is over, so we must hurry up
- Makefile demo
- Read a paper_ ChineseBert
- Is pytorch open source?
- 释放数据力量的Ceph-尚文网络xUP楠哥
- Error in compiled file: error: unmapped character encoding GBK
- Table structure of Navicat export database
- MySQL MAC download and installation tutorial
- shardingsphere动态数据源
- 2022-07-02:以下go语言代码输出什么?A:编译错误;B:Panic;C:NaN。 package main import “fmt“ func main() { var a =
猜你喜欢

2022deepbrainchain biweekly report no. 104 (01.16-02.15)

Numpy warning visibledeprecationwarning: creating an ndarray from ragged needed sequences

Mutex and rwmutex in golang

Download and install node, NPM and yarn

Some preliminary preparations for QQ applet development: make an appointment for a development account, download and install developer tools, and create QQ applet

pytorch是什么?pytorch是一个软件吗?

The latest analysis of the main principals of hazardous chemical business units in 2022 and the simulated examination questions of the main principals of hazardous chemical business units

Bisher - based on SSM pet adoption center

IPv6 foundation construction experiment

"Final review" 16/32-bit microprocessor (8086) basic register
随机推荐
IPv6过渡技术-6to4手工隧道配置实验--尚文网络奎哥
pytorch项目怎么跑?
简易版 微信小程序开发之for指令、上传图片及展示效果优化
CEPH Shangwen network xUP Nange that releases the power of data
2022 tea master (primary) examination questions and tea master (primary) examination question bank
Nanning water leakage detection: warmly congratulate Guangxi Zhongshui on winning the first famous brand in Guangxi
Ffmpeg download and installation tutorial and introduction
[brush questions] most elements (super water king problem)
2022年已过半,得抓紧
Arlo's thinking about himself
Dynamic programming: longest common substring and longest common subsequence
编译文件时报错:错误: 编码GBK的不可映射字符
Applet get user avatar and nickname
递归:快速排序,归并排序和堆排序
sigaction的使用
eth入门之简介
Wechat applet + Alibaba IOT platform + Hezhou air724ug build a serverless IOT system (III) -- wechat applet is directly connected to Alibaba IOT platform aliiot
Arduino application development - LCD display GIF dynamic diagram
Docker install and start MySQL service
Makefile demo