当前位置:网站首页>Shiftvit uses the precision of swing transformer to outperform the speed of RESNET, and discusses that the success of Vit does not lie in attention!
Shiftvit uses the precision of swing transformer to outperform the speed of RESNET, and discusses that the success of Vit does not lie in attention!
2022-07-03 22:31:00 【Zhiyuan community】
Attention mechanism is widely considered to be Vision Transformer(ViT) The key to success , Because it provides a flexible and powerful way to model spatial relationships . However , The attention mechanism is really ViT An integral part of ? Can it be replaced by some other alternatives ? In order to uncover the role of attention mechanism , The author reduces it to a very simple case :ZERO FLOP and ZERO parameter.
To be specific , The author reexamined Shift operation . It does not contain any parameters or arithmetic calculations . The only operation is to exchange a small number of channels between adjacent features . Based on this simple operation , The author constructs a new Backbone, namely ShiftViT, among ViT The attention layer in is shift Operation replaced .
It's amazing ,ShiftViT Worked well on several mainstream tasks , Such as the classification 、 Detection and segmentation . Performance is even better than Swin Transformer Better . These results suggest that , The attention mechanism may not make ViT The key to success . It can even be replaced by an operation with zero parameters . In the future work , We should pay more attention to ViT The rest of .
Thesis link :
https://arxiv.org/abs/2201.10801
Shift Block The detailed architecture of is shown in the figure below :
边栏推荐
- Plug - in Oil Monkey
- (POJ - 2912) rochambau (weighted concurrent search + enumeration)
- 油猴插件
- [template summary] - binary search tree BST - Basics
- China HDI market production and marketing demand and investment forecast analysis report Ⓢ 2022 ~ 2028
- Tkinter Huarong Road 4x4 tutorial III
- Common problems in multi-threaded learning (I) ArrayList under high concurrency and weird hasmap under concurrency
- The latest analysis of R1 quick opening pressure vessel operation in 2022 and the examination question bank of R1 quick opening pressure vessel operation
- China's Call Center Industry 14th five year plan direction and operation analysis report Ⓔ 2022 ~ 2028
- Is the account opening of Guotai Junan Securities safe and reliable? How to open Guotai Junan Securities Account
猜你喜欢
BUUCTF,Misc:LSB
4 environment construction -standalone ha
[Android reverse] application data directory (files data directory | lib application built-in so dynamic library directory | databases SQLite3 database directory | cache directory)
320. Energy Necklace (ring, interval DP)
[Android reverse] use DB browser to view and modify SQLite database (download DB browser installation package | install DB browser tool)
1 Introduction to spark Foundation
[dynamic planning] counting garlic customers: the log of garlic King (the longest increasing public subsequence)
[flax high frequency question] leetcode 426 Convert binary search tree to sorted double linked list
2022 safety officer-b certificate examination summary and safety officer-b certificate simulation test questions
How to restore the factory settings of HP computer
随机推荐
Development mode and Prospect of China's IT training industry strategic planning trend report Ⓣ 2022 ~ 2028
China's coal industry investment strategic planning future production and marketing demand forecast report Ⓘ 2022 ~ 2028
Go Technology Daily (2022-02-13) - Summary of experience in database storage selection
Awk getting started to proficient series - awk quick start
(POJ - 2912) rochambau (weighted concurrent search + enumeration)
Flutter internationalized Intl
SDMU OJ#P19. Stock trading
Yyds dry goods inventory Prometheus alarm Art
Pointer concept & character pointer & pointer array yyds dry inventory
[SRS] build a specified version of SRS
Cesium terrain clipping draw polygon clipping
DR-AP40X9-A-Qualcomm-IPQ-4019-IPQ-4029-5G-4G-LTE-aluminum-body-dual-band-wifi-router-2.4GHZ-5GHz-QSD
Conditional statements of shell programming
Creation of the template of the password management software keepassdx
Blue Bridge Cup Guoxin Changtian single chip microcomputer -- led lamp module (V)
The latest analysis of crane driver (limited to bridge crane) in 2022 and the test questions and analysis of crane driver (limited to bridge crane)
Pat grade A - 1164 good in C (20 points)
[golang] leetcode intermediate - alphabetic combination of island number and phone number
[dynamic programming] Ji Suan Ke: Suan tou Jun breaks through the barrier (variant of the longest increasing subsequence)
Teach you how to run two or more MySQL databases at the same time in one system