当前位置:网站首页>【ARXIV2204】Vision Transformers for Single Image Dehazing
【ARXIV2204】Vision Transformers for Single Image Dehazing
2022-07-28 05:00:00 【AI frontier theory group @ouc】

The paper :https://arxiv.org/abs/2204.03883
Code :https://github.com/IDKiro/DehazeFormer
1、 Research motivation
The author puts forward DehazeFormer For image defogging , Inspiration comes from Swin Transformer , The interesting part of the paper is reflection padding and The calculation of attention
2、 The main method
The method framework is shown in the figure below , It's a 5 Stage UNET structure , Convolution block is DehazeFormer block replace .

Reflection padding
stay SWIN in , Use shfited window To realize the interaction of information between windows , But the author believes that this operation is not friendly to the image edge region . For classification tasks , The target area is always in the middle of the image , Therefore use shift window There is no problem , But for the image restoration task , Marginal areas are equally important , Such operation is inappropriate . So , The author puts forward reflection padding operation , As shown in the figure below .

The input image size is 8x8, In the picture window yes 4x4 Of , So for the edge area replication 2 individual patch, The image size becomes 12x12, In this way, it can become 3x3=9 individual window. Here 9 individual window Local calculation in attention, After the calculation , Put the middle 8x8 Cut out the area of .
The authors also point out that , Such operations will cause the consumption of computing and memory resources .
W-MHSA with parallel convolution
The author believes that due to MHSA The aggregation weight of is dynamic and normalized , The author believes that static 、 Learnable and unconstrained aggregation weights help complement MHSA. So the author is right V Additional convolution is performed . You can also see in the overall architecture diagram of the paper V There is a convolution layer behind , And attention Add the calculation result of .
The experimental part can refer to the author's paper , There is not much here .
边栏推荐
- Mysql database -- first knowledge database
- Method of converting UI file to py file
- The go zero singleton service uses generics to simplify the registration of handler routes
- MySQL(5)
- Look at the experience of n-year software testing summarized by people who came over the test
- RT_ Use of thread mailbox
- Basic knowledge of network security - password (I)
- Service object creation and use
- HDU 3585 maximum shortest distance
- Simulink automatically generates STM32 code details
猜你喜欢

Domain name (subdomain name) collection method of Web penetration

Analyze the emotional elements contained in intelligent sweeping robot

MySQL(5)

Win10 machine learning environment construction pycharm, anaconda, pytorch

Automated test tool playwright (quick start)

Introduction to testcafe

C语言ATM自动取款机系统项目的设计与开发

Dcgan:deep volume general adaptive networks -- paper analysis
![[daily one] visual studio2015 installation in ancient times](/img/b1/066ed0b9e93b8f378c89ee974163e5.png)
[daily one] visual studio2015 installation in ancient times

Read the paper -- a CNN RNN framework for clip yield prediction
随机推荐
如何在 FastReport VCL 中通过 Outlook 发送和接收报告?
Driving the powerful functions of EVM and xcm, how subwallet enables Boca and moonbeam
为什么md5不可逆,却还可能被md5免费解密网站解密
CPU and memory usage are too high. How to modify RTSP round robin detection parameters to reduce server consumption?
Leetcode 18. sum of four numbers
POJ 3417 network (lca+ differential on tree)
go-zero单体服务使用泛型简化注册Handler路由
String 0123456789abcdef, what is the number of substrings (not empty and not the same string itself) [Hangzhou multi tester] [Hangzhou multi tester _ Wang Sir]
HDU 3666 the matrix problemdifferential constraint + stack optimization SPFA negative ring
[high CPU consumption] software_ reporter_ tool.exe
POJ 3728 the merchant (online query + double LCA)
[每日一氵]上古年代的 Visual Studio2015 安装
Check box error
How to analyze fans' interests?
What tools do software testers need to know?
提升学生群体中的STEAM教育核心素养
数据安全逐步落地,必须紧盯泄露源头
MySQL(5)
Activation functions sigmoid, tanh, relu in convolutional neural networks
多御安全浏览器将改进安全模式,让用户浏览更安全