当前位置:网站首页>Swing V2: towards a larger model with larger capacity and higher resolution
Swing V2: towards a larger model with larger capacity and higher resolution
2022-07-29 03:20:00 【Autumn ink】
Swin Transformer In target detection 、 Examples segmentation and other computer vision tasks have achieved SOTA Performance of . Swin Transformer The field of computer vision has been broken CNN( Convolutional neural networks ) long-term “ rule ” The situation of , It accelerates the transformation of the basic model architecture in the field of computer vision , This work is therefore To obtain the 2021 year ICCV Best Paper Award —— Mar prize .
Swin Transformer Two key concepts are introduced to solve the original problem ViT Problems faced —— Hierarchical feature mapping and window attention conversion .
Network structure ( Innovation points )
- Swin Transformer The hierarchical construction method similar to that in convolutional neural network is used (Hierarchical feature maps), For example, there is down sampling of the image in the feature map size 4 Times ,8 Times and 16 Times , In this way backbone It is helpful to build a target detection system on this basis , Instance segmentation and other tasks . And before Vision Transformer It is directly down sampling from the beginning 16 times , The following characteristic diagram also keeps the sampling rate unchanged .
- stay Swin Transformer Used in Windows Multi-Head Self-Attention(W-MSA) The concept of , For example, in the figure below 4 Double down sampling and 8 Times down sampling , The feature graph is divided into multiple disjoint regions (Window), also Multi-Head Self-Attention Only in each window (Window) Inside . be relative to Vision Transformer Directly to the whole (Global) The characteristic diagram is used for Multi-Head Self-Attention, The purpose of this is to reduce the amount of calculation , Especially when the shallow characteristic map is very large . Although this reduces the amount of calculation, it will also isolate the information transmission between different windows , So in the paper, the author puts forward Shifted Windows Multi-Head Self-Attent
边栏推荐
- MySQL installation and configuration super detailed tutorial and simple database and table building method
- Verilog: blocking assignment and non blocking assignment
- 单例模式(饿汉式 懒汉式)
- 2. Nodejs -- path (\dirname, \filname), URL URL, querystring module, mime module, various paths (relative paths), web page loading (interview questions *)
- Flask creation process day05-06 creation project
- Navicat new database
- Minesweeping simple version
- C traps and defects Chapter 2 syntax "traps" 2.6 problems caused by "hanging" else
- Alibaba Sentinel - 工作流程及原理解析
- Unity game special effects
猜你喜欢

The Federal Reserve raised interest rates again, Powell "let go of doves" at 75 basis points, and US stocks reveled

2022-07-28 第四小组 修身课 学习笔记(every day)

Tp5.0 applet users do not need to log in and directly obtain the user's mobile number.

How to deploy sentinel cluster of redis

Rongyun real-time community solution

During the year, the first "three consecutive falls" of No. 95 gasoline returned to the "8 Yuan era"“

Flask的创建的流程day05-06之创建项目

2022-07-28 顾宇佳 学习笔记

Chapter 2 VRP command line

2. Nodejs -- path (\dirname, \filname), URL URL, querystring module, mime module, various paths (relative paths), web page loading (interview questions *)
随机推荐
2022-07-28 第四小组 修身课 学习笔记(every day)
mysql的timestamp存在的时区问题怎么解决
「PHP基础知识」输出圆周率的近似值
12_ UE4 advanced_ Change a more beautiful character model
ShardingSphere之水平分表实战(三)
Rongyun real-time community solution
国产ERP有没有机会击败SAP ?
Redis之sentinel哨兵集群怎么部署
C traps and defects Chapter 3 semantic "traps" 3.8 operators &, |, and!
MYSQL入门与进阶(十二)
MYSQL入门与进阶(十三)
Unity game special effects
12_ue4进阶_换一个更好看的人物模型
单例模式(饿汉式 懒汉式)
04 | background login: login method based on account and password (Part 1)
MYSQL入门与进阶(十四)
What is SOA (Service Oriented Architecture)?
军品技术文件划分及说明
Typescript学习(一)
Principle knowledge is useful