当前位置:网站首页>300+ documents! This article explains the latest progress of multimodal learning based on transformer
300+ documents! This article explains the latest progress of multimodal learning based on transformer
2022-07-03 04:00:00 【Zhiyuan community】
Address of thesis :
https://arxiv.org/abs/2206.06488
Abstract
Transformer It is a promising neural network learner , It has achieved great success in various machine learning tasks . Due to the recent popularity of multimodal applications and big data , be based on Transformer Multimodal learning has become a hot topic in artificial intelligence research .
This paper deals with the problem of multi-modal data Transformer Technology has been thoroughly investigated . The main content of this article includes :1) Multimodal learning 、Transformer Background of ecosystem and multi-modal big data era ;2) From a geometric topological perspective Vanilla Transformer、Vision Transformer and multimodal Transformer Theoretical review of ;3) Through two important paradigms , That is, multimodal pre training and specific multimodal tasks , For multimodality Transformer Review of application ;4) For multimodality Transformer Summary of common challenges and designs shared by models and Applications , as well as 5) Discussion of open issues and potential research directions in the community .
边栏推荐
- Error in compiled file: error: unmapped character encoding GBK
- 以两列的瀑布流为例,我们应该怎么构建每一列的数组
- [mathematical logic] predicate logic (predicate logic basic equivalent | eliminate quantifier equivalent | quantifier negative equivalent | quantifier scope contraction expansion equivalent | quantifi
- 动态规划:最长回文子串和子序列
- JS native common knowledge
- 基于Pytorch和RDKit的QSAR模型建立脚本
- 记一次 .NET 差旅管理后台 CPU 爆高分析
- 在 .NET 6 项目中使用 Startup.cs
- [mathematical logic] propositional logic (propositional logic reasoning | formal structure of reasoning | inference law | additional law | simplification law | hypothetical reasoning | refusal | disju
- [DRM] simple analysis of DRM bridge driver call process
猜你喜欢
Recursion: one dimensional linked lists and arrays
Is it better to speculate in the short term or the medium and long term? Comparative analysis of differences
In Net 6 project using startup cs
用户体验五要素
SAP ui5 application development tutorial 105 - detailed introduction to the linkage effect implementation of SAP ui5 master detail layout mode
pytorch怎么下载?pytorch在哪里下载?
2022 mobile crane driver examination registration and mobile crane driver operation examination question bank
释放数据力量的Ceph-尚文网络xUP楠哥
Some preliminary preparations for QQ applet development: make an appointment for a development account, download and install developer tools, and create QQ applet
pytorch开源吗?
随机推荐
Some preliminary preparations for QQ applet development: make an appointment for a development account, download and install developer tools, and create QQ applet
第十届中国云计算大会·中国站:展望未来十年科技走向
How to execute a swift for in loop in one step- How can I do a Swift for-in loop with a step?
2.14 simulation summary
毕设-基于SSM宠物领养中心
Appium自动化测试框架
Cnopendata China Customs Statistics
Debug: CD cannot be used in kaggle
2022 mobile crane driver examination registration and mobile crane driver operation examination question bank
[Apple Push] IMessage group sending condition document (push certificate) development tool pushnotification
Arduino application development - LCD display GIF dynamic diagram
How does the pytorch project run?
nodejs基础:浅聊url和querystring模块
2022 P cylinder filling examination content and P cylinder filling practice examination video
For instruction, uploading pictures and display effect optimization of simple wechat applet development
Message queue addition failure
2022 polymerization process examination questions and polymerization process examination skills
动态规划:最长公共子串和最长公共子序列
pytorch怎么下载?pytorch在哪里下载?
[daily question] dichotomy - find a single dog (Bushi)