当前位置:网站首页>FusionGAN:A generative adversarial network for infrared and visible image fusion article study notes
FusionGAN:A generative adversarial network for infrared and visible image fusion article study notes
2022-08-01 21:33:00 【qq_46165876】
FusionGAN: A generative adversarial network for infrared and visible image fusion article study notes
Generative Adversarial Networks for Fusion of Infrared and Visible Light Images
Main contributions to the article
We propose a generative adversarial architecture and design a loss function specifically for infrared and visible image fusion.The feasibility and superiority of GANs for image fusion are discussed.To the best of our knowledge, this is the first time a genetic algorithm is employed to solve the task of image fusion.
The proposed FusionGAN is an end-to-end model in which fused images can be automatically generated from input source images without the need to manually design activity level measurements or fusion rules.
We conduct experiments on publicly available infrared and visible image fusion datasets and compare qualitatively and quantitatively with state-of-the-art methods.Compared with previous methods, the proposed FusionGAN can obtain clear infrared images with clear salient objects and rich textures.
The proposed fusion method generalizes to fusing source images of different resolutions, such as low-resolution infrared images and high-resolution visible images.It produces high-resolution resulting images that are not affected by noise caused by upsampling of infrared information.
Model Structure

First, we connect the infrared image Ir and visible light in the channel dimensionImage Iv.Then, the concatenated images are fed into the generator G, whose output is the fused image If.Due to the loss function of the generator designed in this paper, in the absence of the discriminator D, If tends to preserve the thermal radiation information of the infrared image Ir and the gradient information of the visible light image Iv.After that, we feed the fused image If and the visible light image Iv to the discriminator D, which aims to distinguish If and Iv.The proposed FusionGAN establishes an adversarial game between the generator G and the discriminator D, If will gradually include more and more detailed information in the visual image Iv.In the training phase, once the generator G generates samples (i.e., If) that the discriminator D cannot distinguish, we can get the desired fused image If.
Loss function
The loss function of the generator includes the adversarial loss L_adv between the generator and the decider and the content loss L_content between the generated image and the real image, Generator loss functionDefined as follows:
where α is balanced confrontationBalance factor for loss and content loss.L_adv and L_content are defined as follows:
Among them, I_f^n represents the nth fused image in the same batch of N, and D(∙) represents the output of the decider.
where H and W represent the height of the input image, respectivelyand width, ‖∙‖_F represents the matrix Frobenius norm, ∇ represents the gradient operator.The first item of L_content aims to preserve the thermal radiation information of the infrared image I_r in the fusion image I_f, and the second item of L_content aims to preserve the gradients contained in the visible image I_vinformation, ε is a positive parameter that controls the trade-off between the two terms.
Decision:
where a andb represents the label of the fused image I_f and visible light image I_v, respectively, and D(I_v ) and D(I_f ) represent the classification results of the visible light and fused image, respectively.
The discriminator is designed to discriminate between fused and visible images based on features extracted from visible images.We use a least squares loss function, which follows minimizing the Pearson χ2 divergence.This makes the training process more stable and the discriminator's loss function converges faster.
Experimental results in the text

From left to right: fusion results of infrared image, visible light image, classical method, and fusion result of FusionGAN.
The third image is the fusion result obtained by using the classic method.Obviously, this traditional method can only preserve more texture details in the source image, while the high contrast between the target and the background in the infrared image cannot be preserved in the fusion image.In fact, the key information in the infrared image (i.e. the thermal radiation distribution) is completely lost in the fused image.
The far right image is the fusion result of FusionGAN.In contrast, our results preserve the thermal radiation distribution in the infrared image, so the target can be easily detected.At the same time, background details (i.e. trees, roads, and aquatic plants) in visible light images are also well preserved.
FusionGAN results can keep both the thermal radiation distribution in the infrared image and the apparent texture in the visible image.
Summary of other bloggersExcellent image fusion algorithms in the past five years
边栏推荐
- 作业8.1 孤儿进程与僵尸进程
- groupByKey和reduceBykey的区别
- C Pitfalls and pitfalls Appendix B Interview with Koenig and Moo
- 51.【结构体初始化的两种方法】
- Anacoda的用途
- 分类接口,淘宝分类详情 API
- 牛血清白蛋白刺槐豆胶壳聚糖缓释纳米微球/多西紫杉醇的纳米微球DTX-DHA-BSA-NPs
- 小程序--独立分包&分包预下载
- The difference between groupByKey and reduceBykey
- C pitfalls and pitfalls Chapter 8 Suggestions and answers 8.2 Answers
猜你喜欢

深拷贝浅拷贝

基于php在线学习平台管理系统获取(php毕业设计)

树莓派的信息显示小屏幕,显示时间、IP地址、CPU信息、内存信息(c语言),四线的i2c通信,0.96寸oled屏幕

The difference between groupByKey and reduceBykey

数字图像处理 第十二章——目标识别

C专家编程 前言

Based on php film and television information website management system acquisition (php graduation design)

CS-NP白蛋白包覆壳聚糖纳米颗粒/人血清白蛋白-磷酸钙纳米颗粒无机复合材料

File operations of WEB penetration

对C语言结构体内存对齐的理解
随机推荐
C专家编程 第1章 C:穿越时空的迷雾 1.3 标准I/O库和C预处理器
R语言 线性回归的有关方法
附录A printf、varargs与stdarg A.2 使用varargs.h来实现可变参数列表
XSS漏洞
C专家编程 第1章 C:穿越时空的迷雾 1.2 C语言的早期体验
C陷阱与缺陷 附录B Koenig和Moo夫妇访谈
Anacoda的用途
51.【结构体初始化的两种方法】
TP5-NPs负载噻吩类化合物TP5白蛋白纳米粒/阿魏酸钠新糖牛血清蛋白纳米粒
P7215 [JOISC2020] 首都 题解
C Expert Programming Chapter 1 C: Through the Fog of Time and Space 1.2 Early Experience of C Language
树莓派的信息显示小屏幕,显示时间、IP地址、CPU信息、内存信息(c语言),四线的i2c通信,0.96寸oled屏幕
通过这两个 hook 回顾 Set/Map 基础知识
数据库练习
R语言进行相关的操作
Classification interface, Taobao classification details API
C Pitfalls and Defects Chapter 7 Portability Defects 7.11 An Example of a Portability Problem
Spark练习题+答案
基于php动漫周边商城管理系统(php毕业设计)
方舟开服需要知道的那些事