当前位置:网站首页>ICML 2022 | flowformer: task generic linear complexity transformer
ICML 2022 | flowformer: task generic linear complexity transformer
2022-07-06 21:04:00 【Zhiyuan community】
Paper title :Flowformer: Linearizing Transformers with Conservation Flows
Thesis link :https://arxiv.org/pdf/2202.06258.pdf
Code link :https://github.com/thuml/Flowformer
This paper deeply studies the quadratic complexity of attention mechanism , By introducing the conservation principle in network flow into the design , Naturally, competition mechanism is introduced into attention Computing , It effectively avoids ordinary attention problems .
What we proposed Task common The backbone network Flowformer, Realized Linear complexity , At the same time Long sequence 、 Vision 、 Natural language 、 The time series 、 Reinforcement learning Achieved excellent results in five major tasks .
In the application of long sequence modeling , Such as protein structure prediction 、 Long text understanding, etc ,Flowformer It has good application potential . Besides ,Flowformer in “ No special inductive preference ” The design concept of is also of good enlightening significance to the research of general infrastructure .
Flow-Attention The pseudo-code is as follows :
Main experimental results :
边栏推荐
- 知识图谱之实体对齐二
- R语言可视化两个以上的分类(类别)变量之间的关系、使用vcd包中的Mosaic函数创建马赛克图( Mosaic plots)、分别可视化两个、三个、四个分类变量的关系的马赛克图
- Build your own application based on Google's open source tensorflow object detection API video object recognition system (IV)
- ICML 2022 | Flowformer: 任务通用的线性复杂度Transformer
- Value of APS application in food industry
- 1500萬員工輕松管理,雲原生數據庫GaussDB讓HR辦公更高效
- Pat 1078 hashing (25 points) ⼆ times ⽅ exploration method
- [wechat applet] operation mechanism and update mechanism
- C language games - minesweeping
- Taylor series fast Fourier transform (FFT)
猜你喜欢
The mail command is used in combination with the pipeline command statement
1_ Introduction to go language
(工作记录)2020年3月11日至2021年3月15日
3D人脸重建:从基础知识到识别/重建方法!
Manifest of SAP ui5 framework json
LLVM之父Chris Lattner:为什么我们要重建AI基础设施软件
APS taps home appliance industry into new growth points
基于STM32单片机设计的红外测温仪(带人脸检测)
Spark SQL chasing Wife Series (initial understanding)
use. Net analysis Net talent challenge participation
随机推荐
Utilisation de l'écran OLED
1_ Introduction to go language
The biggest pain point of traffic management - the resource utilization rate cannot go up
Reinforcement learning - learning notes 5 | alphago
KDD 2022 | 通过知识增强的提示学习实现统一的对话式推荐
【微信小程序】运行机制和更新机制
Simple continuous viewing PTA
请问sql group by 语句问题
Laravel笔记-自定义登录中新增登录5次失败锁账户功能(提高系统安全性)
什么是RDB和AOF
【论文解读】用于白内障分级/分类的机器学习技术
审稿人dis整个研究方向已经不仅仅是在审我的稿子了怎么办?
Leetcode hot topic Hot 100 day 32: "minimum coverage substring"
OLED屏幕的使用
Spiral square PTA
How to implement common frameworks
Solution to the 38th weekly match of acwing
[weekly pit] calculate the sum of primes within 100 + [answer] output triangle
强化学习-学习笔记5 | AlphaGo
知识图谱之实体对齐二