当前位置:网站首页>ICML 2022 | flowformer: task generic linear complexity transformer
ICML 2022 | flowformer: task generic linear complexity transformer
2022-07-06 21:04:00 【Zhiyuan community】
Paper title :Flowformer: Linearizing Transformers with Conservation Flows
Thesis link :https://arxiv.org/pdf/2202.06258.pdf
Code link :https://github.com/thuml/Flowformer
This paper deeply studies the quadratic complexity of attention mechanism , By introducing the conservation principle in network flow into the design , Naturally, competition mechanism is introduced into attention Computing , It effectively avoids ordinary attention problems .
What we proposed Task common The backbone network Flowformer, Realized Linear complexity , At the same time Long sequence 、 Vision 、 Natural language 、 The time series 、 Reinforcement learning Achieved excellent results in five major tasks .
In the application of long sequence modeling , Such as protein structure prediction 、 Long text understanding, etc ,Flowformer It has good application potential . Besides ,Flowformer in “ No special inductive preference ” The design concept of is also of good enlightening significance to the research of general infrastructure .
Flow-Attention The pseudo-code is as follows :
Main experimental results :
边栏推荐
- Taylor series fast Fourier transform (FFT)
- Comment faire une radio personnalisée
- SSO single sign on
- Performance test process and plan
- SAP UI5 框架的 manifest.json
- [diy] how to make a personalized radio
- Pinduoduo lost the lawsuit, and the case of bargain price difference of 0.9% was sentenced; Wechat internal test, the same mobile phone number can register two account functions; 2022 fields Awards an
- use. Net analysis Net talent challenge participation
- Mécanisme de fonctionnement et de mise à jour de [Widget Wechat]
- What is the difference between procedural SQL and C language in defining variables
猜你喜欢
15million employees are easy to manage, and the cloud native database gaussdb makes HR office more efficient
How to upgrade high value-added links in the textile and clothing industry? APS to help
【深度学习】PyTorch 1.12发布,正式支持苹果M1芯片GPU加速,修复众多Bug
Reviewer dis's whole research direction is not just reviewing my manuscript. What should I do?
Laravel notes - add the function of locking accounts after 5 login failures in user-defined login (improve system security)
Application layer of tcp/ip protocol cluster
基于深度学习的参考帧生成
Kubernetes learning summary (20) -- what is the relationship between kubernetes and microservices and containers?
Utilisation de l'écran OLED
APS taps home appliance industry into new growth points
随机推荐
使用.Net驱动Jetson Nano的OLED显示屏
Interviewer: what is the internal implementation of ordered collection in redis?
Swagger UI tutorial API document artifact
js中,字符串和数组互转(一)——字符串转为数组的方法
2110 summary of knowledge points and common problems in redis class
Entity alignment two of knowledge map
2022 Guangdong Provincial Safety Officer C certificate third batch (full-time safety production management personnel) simulation examination and Guangdong Provincial Safety Officer C certificate third
2022 nurse (primary) examination questions and new nurse (primary) examination questions
Logic is a good thing
Web开发小妙招:巧用ThreadLocal规避层层传值
No Yum source to install SPuG monitoring
知识图谱构建流程步骤详解
Leetcode hot topic Hot 100 day 32: "minimum coverage substring"
use. Net analysis Net talent challenge participation
User defined current limiting annotation
Simple continuous viewing PTA
How to upgrade high value-added links in the textile and clothing industry? APS to help
[weekly pit] calculate the sum of primes within 100 + [answer] output triangle
Laravel笔记-自定义登录中新增登录5次失败锁账户功能(提高系统安全性)
C # use Oracle stored procedure to obtain result set instance