当前位置:网站首页>【ARXIV2204】Vision Transformers for Single Image Dehazing
【ARXIV2204】Vision Transformers for Single Image Dehazing
2022-07-28 05:00:00 【AI frontier theory group @ouc】

The paper :https://arxiv.org/abs/2204.03883
Code :https://github.com/IDKiro/DehazeFormer
1、 Research motivation
The author puts forward DehazeFormer For image defogging , Inspiration comes from Swin Transformer , The interesting part of the paper is reflection padding and The calculation of attention
2、 The main method
The method framework is shown in the figure below , It's a 5 Stage UNET structure , Convolution block is DehazeFormer block replace .

Reflection padding
stay SWIN in , Use shfited window To realize the interaction of information between windows , But the author believes that this operation is not friendly to the image edge region . For classification tasks , The target area is always in the middle of the image , Therefore use shift window There is no problem , But for the image restoration task , Marginal areas are equally important , Such operation is inappropriate . So , The author puts forward reflection padding operation , As shown in the figure below .

The input image size is 8x8, In the picture window yes 4x4 Of , So for the edge area replication 2 individual patch, The image size becomes 12x12, In this way, it can become 3x3=9 individual window. Here 9 individual window Local calculation in attention, After the calculation , Put the middle 8x8 Cut out the area of .
The authors also point out that , Such operations will cause the consumption of computing and memory resources .
W-MHSA with parallel convolution
The author believes that due to MHSA The aggregation weight of is dynamic and normalized , The author believes that static 、 Learnable and unconstrained aggregation weights help complement MHSA. So the author is right V Additional convolution is performed . You can also see in the overall architecture diagram of the paper V There is a convolution layer behind , And attention Add the calculation result of .
The experimental part can refer to the author's paper , There is not much here .
边栏推荐
- Special topic of APP performance design and Optimization - poor implementation affecting performance
- 使用nfpm制作rpm包
- Summary and review of puppeter
- [daily question 1] 735. Planetary collision
- Activation functions sigmoid, tanh, relu in convolutional neural networks
- Angr (XI) - official document (Part2)
- The first artificial intelligence security competition starts. Three competition questions are waiting for you to fight
- 全方位分析STEAM和创客教育的差异化
- (3.1) [Trojan horse synthesis technology]
- Machine learning and deep learning -- normalization processing
猜你喜欢

启发国内学子学习少儿机器人编程教育

Real intelligence has been certified by two of the world's top market research institutions and has entered the global camp of excellence

FreeRTOS learning (I)

HashSet add

Testcafe provides automatic waiting mechanism and live operation mode

After a year of unemployment, I learned to do cross-border e-commerce and earned 520000. Only then did I know that going to work really delayed making money!

What is the reason why the easycvr national standard protocol access equipment is online but the channel is not online?
![[idea] check out master invalid path problem](/img/83/d36362ba314177cd6f1f74f3e922cd.png)
[idea] check out master invalid path problem

Gan: generative advantageous nets -- paper analysis and the mathematical concepts behind it

为什么md5不可逆,却还可能被md5免费解密网站解密
随机推荐
(克隆虚拟机步骤)
Supervisor series: 5. Log
为什么md5不可逆,却还可能被md5免费解密网站解密
Installing MySQL under Linux
[learning record] data enhancement 1
数据安全逐步落地,必须紧盯泄露源头
Wang Shuang assembly language detailed learning notes 3: registers (memory access)
C语言ATM自动取款机系统项目的设计与开发
Program life | how to switch to software testing? (software testing learning roadmap attached)
Visual studio 2019 new OpenGL project does not need to reconfigure the environment
MySQL(5)
机器人教育在STEM课程中的设计研究
Machine learning and deep learning -- normalization processing
excel实战应用案例100讲(十一)-Excel插入图片小技巧
Domain name (subdomain name) collection method of Web penetration
HDU 1530 maximum clique
Cloudcompare & PCL point cloud least square fitting plane
解析智能扫地机器人中蕴含的情感元素
POJ 1330 Nearest Common Ancestors (lca)
jsonp 单点登录 权限检验