当前位置：网站首页>Development Series III of GaN (lapgan, srgan)

Development Series III of GaN (lapgan, srgan)

2022-07-24 17:33:00 【51CTO】

GAN Development Series III of （LapGAN、SRGAN）

We have already introduced it in the previous article GAN The introduction to generating countermeasure networks and some GAN series , In the following album will continue to introduce some of the more classic GAN.

GAN Introduction to generating countermeasure network

GAN The development of the series one （CGAN、DCGAN、WGAN、WGAN-GP、LSGAN、BEGAN）

GAN The development of series 2 （PGGAN、SinGAN）

One 、 LapGAN
The paper ：《Deep Generative Image Models using a Laplacian Pyramid of Adversarial Networks》
Address of thesis ：https://arxiv.org/abs/1506.05751
1、 The basic idea
LapGAN It's based on GAN and CGAN On the basis of , use Laplacian Pyramid The pyramid of Laplace To generate images from thick to thin , So as to generate high-resolution images . At each level of the pyramid, there are learning residuals with adjacent levels , By constantly stacking CGAN Get the final resolution .CGAN As we mentioned in the previous article, it is in GAN Add conditional constraints on the basis of , To alleviate the original GAN The generator generates samples too freely .
original GAN The formula of is ：

GAN Development Series III of （LapGAN、SRGAN）_ The pyramid of Gauss

CGAN The formula of is ：

GAN Development Series III of （LapGAN、SRGAN）_ Loss function _02

2、 The pyramid of Laplace
Laplacian pyramid is the result of continuous up sampling of images in scale space , Gaussian pyramid is the result of continuous down sampling of images in scale space . First build Gaussian pyramid , To image I0 For continuous K Next sampling , obtain

GAN Development Series III of （LapGAN、SRGAN）_ Loss function _03

Is the first K The Laplace pyramid on level is

GAN Development Series III of （LapGAN、SRGAN）_ The pyramid of Gauss _04

The Laplace pyramids on other levels are ：

GAN Development Series III of （LapGAN、SRGAN）_ The pyramid of Gauss _05

Laplacian pyramid No k The layer is equal to the Gaussian pyramid k Layer minus Gaussian pyramid k+1 Upper sampling of layer .
Use the Laplace pyramid to restore the image ：

GAN Development Series III of （LapGAN、SRGAN）_ The pyramid of Gauss _06

3、LapGAN principle
With K=3 For example , At this time, the pyramid of Laplace is 4 The layer structure , contain 4 A generator G0、G1、G2、G3, Generate... Separately 4 A resolution image 64x64、32x32、16x16、8x8, The lowest resolution image to train the original GAN, Input only noise , Later, higher resolution image training CGAN, Input the image sampled on the Gaussian pyramid image with noise and the same level .
LapGAN Through a series of CGAN In series , Constantly generate higher resolution .

GAN Development Series III of （LapGAN、SRGAN）_ generator _07

GAN Development Series III of （LapGAN、SRGAN）_ The pyramid of Gauss _08

LAPGAN stay CIFAR10、STL10 and LSUN Experiments were conducted on three data sets , The generated image is as follows ：

GAN Development Series III of （LapGAN、SRGAN）_ The pyramid of Gauss _09

Two 、 SRGAN
The paper 《Photo-Realistic Single Image Super-Resolution Using a Generative Adversarial Network》
Address of thesis ：https://arxiv.org/pdf/1609.04802.pdf
Code address ：https://github.com/OUCMachineLearning/OUCML/blob/master/GAN/srgan_celebA/srgan.py
1、 The basic idea
SRGAN Yes, it will GAN Applied to the field of image super-resolution ,CNN Convolutional neural network has achieved very good results in the traditional super-resolution reconstruction , High peak signal-to-noise ratio can be achieved PSNR, With MSE Is the objective function of minimization .SRGAN Is the first to recover 4 The algorithm framework of down sampling image , The author proposes a perceptual loss function , Including confrontation loss and content loss , The counter loss comes from the discriminator , It is used to distinguish the real image from the generated super-resolution image , Content loss focuses on visual similarity .

GAN Development Series III of （LapGAN、SRGAN）_ The pyramid of Gauss _10

Usually, image super-resolution algorithm uses the mean square error between the reconstructed super-resolution image and the real image MSE As an objective function , Optimize MSE So as to improve PSNR, however MSE and PSNR The value of is not a good indicator of the visual effect , The following figure PSNR The vision with the highest value is not good .

GAN Development Series III of （LapGAN、SRGAN）_ Loss function _11

2、 Network structure
Usually per pixel MSE Due to excessive smoothing, it is difficult to deal with the super-resolution details of the image , This paper designs a new loss function , Will be per pixel MSE Replace loss with content loss . Perceived loss is expressed as the weighted sum of content loss and adversarial loss ,

GAN Development Series III of （LapGAN、SRGAN）_ The pyramid of Gauss _12

Content Loss It is the loss per pixel of the feature map of a certain layer as the content loss ,

GAN Development Series III of （LapGAN、SRGAN）_ Loss function _13

Adversarial Loss Against the loss

GAN Development Series III of （LapGAN、SRGAN）_ generator _14

The network structure proposed by the author is as follows , The generator consists of a residual structure Residual blocks form ,

GAN Development Series III of （LapGAN、SRGAN）_ Loss function _15

The author uses sub-pixel Network as a generative network , use VGG As a discriminant network GAN Got very good results , But this uses the difference per pixel as the loss function . after , The author tries to use the perceptual loss function proposed by himself as the optimization goal , although PSNR and SSIM Not high , But the visual effect is better than other networks , Avoid the over smooth characteristics of other methods .

GAN Development Series III of （LapGAN、SRGAN）_ generator _16