当前位置:网站首页>ICML 2022 | flowformer: task generic linear complexity transformer
ICML 2022 | flowformer: task generic linear complexity transformer
2022-07-06 21:04:00 【Zhiyuan community】
Paper title :Flowformer: Linearizing Transformers with Conservation Flows
Thesis link :https://arxiv.org/pdf/2202.06258.pdf
Code link :https://github.com/thuml/Flowformer
This paper deeply studies the quadratic complexity of attention mechanism , By introducing the conservation principle in network flow into the design , Naturally, competition mechanism is introduced into attention Computing , It effectively avoids ordinary attention problems .
What we proposed Task common The backbone network Flowformer, Realized Linear complexity , At the same time Long sequence 、 Vision 、 Natural language 、 The time series 、 Reinforcement learning Achieved excellent results in five major tasks .
In the application of long sequence modeling , Such as protein structure prediction 、 Long text understanding, etc ,Flowformer It has good application potential . Besides ,Flowformer in “ No special inductive preference ” The design concept of is also of good enlightening significance to the research of general infrastructure .
Flow-Attention The pseudo-code is as follows :
Main experimental results :
边栏推荐
- Entity alignment two of knowledge map
- Hardware development notes (10): basic process of hardware development, making a USB to RS232 module (9): create ch340g/max232 package library sop-16 and associate principle primitive devices
- [DSP] [Part 1] start DSP learning
- What is the difference between procedural SQL and C language in defining variables
- [asp.net core] set the format of Web API response data -- formatfilter feature
- No Yum source to install SPuG monitoring
- 15 millions d'employés sont faciles à gérer et la base de données native du cloud gaussdb rend le Bureau des RH plus efficace
- 监控界的最强王者,没有之一!
- Regular expression collection
- 1500萬員工輕松管理,雲原生數據庫GaussDB讓HR辦公更高效
猜你喜欢
[DIY]如何制作一款个性的收音机
Pytest (3) - Test naming rules
[DSP] [Part 2] understand c6678 and create project
SAP UI5 框架的 manifest.json
[DSP] [Part 1] start DSP learning
Swagger UI教程 API 文档神器
Data Lake (VIII): Iceberg data storage format
[diy] self designed Microsoft makecode arcade, official open source software and hardware
Common English vocabulary that every programmer must master (recommended Collection)
Pinduoduo lost the lawsuit, and the case of bargain price difference of 0.9% was sentenced; Wechat internal test, the same mobile phone number can register two account functions; 2022 fields Awards an
随机推荐
PG基础篇--逻辑结构管理(事务)
[DIY]自己设计微软MakeCode街机,官方开源软硬件
Database - how to get familiar with hundreds of tables of the project -navicat these unique skills, have you got it? (exclusive experience)
3D face reconstruction: from basic knowledge to recognition / reconstruction methods!
(工作记录)2020年3月11日至2021年3月15日
The most comprehensive new database in the whole network, multidimensional table platform inventory note, flowus, airtable, seatable, Vig table Vika, flying Book Multidimensional table, heipayun, Zhix
Pat 1078 hashing (25 points) ⼆ times ⽅ exploration method
Leetcode hot topic Hot 100 day 32: "minimum coverage substring"
Pytest (3) - Test naming rules
Simple continuous viewing PTA
LLVM之父Chris Lattner:为什么我们要重建AI基础设施软件
PHP online examination system version 4.0 source code computer + mobile terminal
Web开发小妙招:巧用ThreadLocal规避层层传值
Statistical inference: maximum likelihood estimation, Bayesian estimation and variance deviation decomposition
What key progress has been made in deep learning in 2021?
Regular expression collection
Web开发小妙招:巧用ThreadLocal规避层层传值
1500萬員工輕松管理,雲原生數據庫GaussDB讓HR辦公更高效
Swagger UI tutorial API document artifact
【微信小程序】運行機制和更新機制