当前位置:网站首页>300+ documents! This article explains the latest progress of multimodal learning based on transformer
300+ documents! This article explains the latest progress of multimodal learning based on transformer
2022-07-03 04:00:00 【Zhiyuan community】

Address of thesis :
https://arxiv.org/abs/2206.06488
Abstract
Transformer It is a promising neural network learner , It has achieved great success in various machine learning tasks . Due to the recent popularity of multimodal applications and big data , be based on Transformer Multimodal learning has become a hot topic in artificial intelligence research .
This paper deals with the problem of multi-modal data Transformer Technology has been thoroughly investigated . The main content of this article includes :1) Multimodal learning 、Transformer Background of ecosystem and multi-modal big data era ;2) From a geometric topological perspective Vanilla Transformer、Vision Transformer and multimodal Transformer Theoretical review of ;3) Through two important paradigms , That is, multimodal pre training and specific multimodal tasks , For multimodality Transformer Review of application ;4) For multimodality Transformer Summary of common challenges and designs shared by models and Applications , as well as 5) Discussion of open issues and potential research directions in the community .
边栏推荐
- QSAR model establishment script based on pytoch and rdkit
- Debug: CD cannot be used in kaggle
- pytorch项目怎么跑?
- 深潜Kotlin协程(二十):构建 Flow
- Half of 2022 is over, so we must hurry up
- [mathematical logic] propositional logic (judgment of the correctness of propositional logic reasoning | formal structure is eternal truth - equivalent calculus | deduction from premise - logical reas
- Esp32 series (3): GPIO learning (take simple GPIO input and output, ADC, DAC as examples)
- Dynamic programming: Longest palindrome substring and subsequence
- Nat. Comm. | 使用Tensor-cell2cell对细胞通讯进行环境感知去卷积
- [Blue Bridge Road -- bug free code] DS18B20 temperature reading code analysis
猜你喜欢

300+篇文献!一文详解基于Transformer的多模态学习最新进展

What is pytorch? Is pytorch a software?

【刷题篇】 找出第 K 小的数对距离
![[brush questions] connected with rainwater (one dimension)](/img/21/318fcb444b17be887562f4a9c1fac2.png)
[brush questions] connected with rainwater (one dimension)

Web session management security issues

Recursion: depth first search

leetcode:297. 二叉树的序列化与反序列化

Recursion: quick sort, merge sort and heap sort

因果AI,下一代可信AI的产业升级新范式?

中移物联网OneOS与OneNET入选《2021年物联网示范项目名单》
随机推荐
pytorch是什么?pytorch是一个软件吗?
Ffmpeg recording screen and screenshot
Arlo's thinking about himself
Ffmpeg one / more pictures synthetic video
[learning notes] seckill - seckill project - (11) project summary
编译文件时报错:错误: 编码GBK的不可映射字符
NPM: the 'NPM' item cannot be recognized as the name of a cmdlet, function, script file, or runnable program. Please check the spelling of the name. If the path is included, make sure the path is corr
【刷题篇】多数元素(超级水王问题)
pytorch项目怎么跑?
[mathematical logic] predicate logic (first-order predicate logic formula | example)
Half of 2022 is over, so we must hurry up
Nodejs Foundation: shallow chat URL and querystring module
Introduction to eth
2022deepbrainchain biweekly report no. 104 (01.16-02.15)
Mutex and rwmutex in golang
Arduino application development - LCD display GIF dynamic diagram
8.8.2-PointersOnC-20220214
For instruction, uploading pictures and display effect optimization of simple wechat applet development
Separable bonds and convertible bonds
Cnopendata China Customs Statistics