当前位置:网站首页>Shiftvit uses the precision of swing transformer to outperform the speed of RESNET, and discusses that the success of Vit does not lie in attention!
Shiftvit uses the precision of swing transformer to outperform the speed of RESNET, and discusses that the success of Vit does not lie in attention!
2022-07-03 22:31:00 【Zhiyuan community】

Attention mechanism is widely considered to be Vision Transformer(ViT) The key to success , Because it provides a flexible and powerful way to model spatial relationships . However , The attention mechanism is really ViT An integral part of ? Can it be replaced by some other alternatives ? In order to uncover the role of attention mechanism , The author reduces it to a very simple case :ZERO FLOP and ZERO parameter.
To be specific , The author reexamined Shift operation . It does not contain any parameters or arithmetic calculations . The only operation is to exchange a small number of channels between adjacent features . Based on this simple operation , The author constructs a new Backbone, namely ShiftViT, among ViT The attention layer in is shift Operation replaced .
It's amazing ,ShiftViT Worked well on several mainstream tasks , Such as the classification 、 Detection and segmentation . Performance is even better than Swin Transformer Better . These results suggest that , The attention mechanism may not make ViT The key to success . It can even be replaced by an operation with zero parameters . In the future work , We should pay more attention to ViT The rest of .
Thesis link :
https://arxiv.org/abs/2201.10801

Shift Block The detailed architecture of is shown in the figure below :

边栏推荐
- Data consistency between redis and database
- 1068. Consolidation of ring stones (ring, interval DP)
- Blue Bridge Cup Guoxin Changtian MCU -- program download (III)
- IPhone development swift foundation 08 encryption and security
- js demo 計算本年度還剩下多少天
- Exness: the Central Bank of England will raise interest rates again in March, and inflation is coming
- How does sentinel, a traffic management artifact, make it easy for business parties to access?
- Sow of PMP
- Overview of Yunxi database executor
- Cesium terrain clipping draw polygon clipping
猜你喜欢

Learning notes of raspberry pie 4B - IO communication (SPI)

Bluebridge cup Guoxin Changtian single chip microcomputer -- hardware environment (I)
![[SRS] build a specified version of SRS](/img/01/0d2d762e01b304220b8924d20277e3.jpg)
[SRS] build a specified version of SRS

Exclusive interview with the person in charge of openkruise: to what extent has cloud native application automation developed now?

Opengauss database log management guide

C deep anatomy - the concept of keywords and variables # dry inventory #

Quick one click batch adding video text watermark and modifying video size simple tutorial

Yyds dry goods inventory Spring Festival "make" your own fireworks

How to connect a laptop to a projector

The latest analysis of crane driver (limited to bridge crane) in 2022 and the test questions and analysis of crane driver (limited to bridge crane)
随机推荐
4 environment construction -standalone ha
China HDI market production and marketing demand and investment forecast analysis report Ⓢ 2022 ~ 2028
Report on the development status and investment planning trends of China's data center industry Ⓡ 2022 ~ 2028
How the computer flushes the local DNS cache
How to switch between dual graphics cards of notebook computer
Blue Bridge Cup Guoxin Changtian MCU -- program download (III)
How to solve the problem of computer networking but showing no Internet connection
Harbor integrated LDAP authentication
Development mode and Prospect of China's IT training industry strategic planning trend report Ⓣ 2022 ~ 2028
Morning flowers and evening flowers
How can enterprises and developers take advantage of the explosion of cloud native landing?
Mysql database - Advanced SQL statement (I)
SDMU OJ#P19. Stock trading
Covariance
Team collaborative combat penetration tool CS artifact cobalt strike
[Android reverse] use DB browser to view and modify SQLite database (download DB browser installation package | install DB browser tool)
Pointer concept & character pointer & pointer array yyds dry inventory
Summary of basic knowledge of exception handling
Data consistency between redis and database
QGIS grid processing DEM data reclassification