当前位置:网站首页>[paper reading] unpaired image to image translation using cycle consistent advantageous networks
[paper reading] unpaired image to image translation using cycle consistent advantageous networks
2022-07-25 20:24:00 【xiongxyowo】
[ Address of thesis ][ Code ][ICCV 17]
Abstract
Image to image translation is a kind of visual and graphic problems , The goal is to use training of a set of aligned image pairs to learn the mapping between input images and output images . However , For many tasks , Paired training data is not available . We propose a way , Learn to remove images from the source domain without pairing instances X Translate to the target domain Y. Our goal is to learn a mapping G : X − > Y G:X->Y G:X−>Y, send G ( X ) G(X) G(X) Image distribution and use of antagonistic loss Y Y Y The distribution is indistinguishable . Because this mapping is highly under constrained , Let's map it to a reverse F : Y − > X F: Y -> X F:Y−>X Combine , And introduce a cyclic consistency loss to promote F ( G ( X ) ) X F(G(X)) ~ X F(G(X)) X( vice versa ). Qualitative results are presented on several tasks that do not have paired training data , Including the transfer of collection style 、 Object conversion 、 Shift of seasons 、 Photo enhancement and so on . The quantitative comparison with several previous methods shows the superiority of our method .
Method
This article is famous CycleGAN, The core idea of the method is as follows :
It consists of two generators ( G , F ) (G, F) (G,F) And two discriminators ( D X , D Y ) (D_X, D_Y) (DX,DY) constitute . For the input source domain image X X X, Send it to the first generator G G G, Then you can get a false target domain image G ( X ) G(X) G(X). Judging device D Y D_Y DY Need to be able to distinguish the actual target domain image Y Y Y And false target domain images G ( X ) G(X) G(X), So that the generated G ( X ) G(X) G(X) The style features included are more ; At the same time , Swap the target domain with the source domain , Then the target and image Y Y Y Sending a generator F F F after , You can get a fake source domain image G ( Y ) G(Y) G(Y). Judging device D X D_X DX You need to be able to distinguish the actual source domain image X X X And fake source domain images G ( Y ) G(Y) G(Y), So that the generated G ( Y ) G(Y) G(Y) The style features contained are more realistic .
The advantage of this is , Because the task of image style conversion in this paper is " Unsupervised ", No matching " From the - Target domain " The image is right , It is equivalent to only being able to constrain whether the generated image meets the new style , There is no way to constrain whether the generated image is consistent in content . And with cycle After the form , After a picture goes in , First, it becomes G ( X ) G(X) G(X), And then it becomes F ( G ( X ) ) F(G(X)) F(G(X)), By restraint X X X Should be the same F ( G ( X ) ) F(G(X)) F(G(X)) As similar as possible , So as to ensure that the network still needs to maintain details as much as possible while learning how to change styles , To achieve one " Self supervision ".
The loss function consists of two parts , One is to restrict the image style to complete the conversion of the confrontation loss : L GAN ( G , D Y , X , Y ) = E y ∼ p data ( y ) [ log D Y ( y ) ] + E x ∼ p data ( x ) [ log ( 1 − D Y ( G ( x ) ) ] \mathcal{L}_{\text{GAN}}(G,\ D_{Y},\ X,\ Y) = \mathbb{E}_{y\sim p_{\text{data}}(y)}[\log D_{Y}(y)]+\mathbb{E}_{x\sim p_{\text{data}}(x)}[\log(1- D_{Y}(G(x))] LGAN(G, DY, X, Y)=Ey∼pdata(y)[logDY(y)]+Ex∼pdata(x)[log(1−DY(G(x))]
This loss is necessary as long as style conversion is done , There's nothing to say . The other is the cyclic consistency loss of keeping the constraint content consistent : L cyc ( G , F ) = E x ∼ p data ( x ) [ ∥ F ( G ( x ) ) − x ∥ 1 ] + E y ∼ p data ( ( y ) [ ∥ G ( F ( y ) ) − y ∥ 1 ] \mathcal{L}_{\text{cyc}}(G,\ F)=\mathbb{E}_{x\sim p_{\text{data}}(x)}[\Vert F(G(x))-x \Vert_{1}]+\mathbb{E}_{y\sim p_{\text{data}}((y)}[\Vert G(F(y))-y \Vert_{1}] Lcyc(G, F)=Ex∼pdata(x)[∥F(G(x))−x∥1]+Ey∼pdata((y)[∥G(F(y))−y∥1]
For this kind of " Unsupervised " In terms of image style conversion , The upper limit of its effect is Pix2Pix such " Supervised " In the form of .CycleGAN One of the main problems of is the inability to deal with geometric transformations , Because the loss of cyclic consistency will make the content of the image as unchanged as possible in the process of converting to the target domain , That is, it is more likely to be " cat => cat => cat ", And it's hard " cat => Dog => cat ".
边栏推荐
- wallys//wifi6 wifi5 router IPQ6018 IPQ4019 IPQ4029 802.11ax 802.11ac
- [advanced mathematics] [3] Application of differential mean value theorem and derivative
- CarSim simulation quick start (16) - ADAS sensor objects of CarSim sensor simulation (2)
- wallys//IPQ5018/IPQ6010/PD-60 802.3AT Input Output 10/100/1000M
- 如何保证定制滑环质量
- 9. < tag dynamic programming and subsequence, subarray> lt.718. Longest repeated subarray + lt.1143. Longest common subsequence
- FormatDateTime说解[通俗易懂]
- 导电滑环在机械设备方面的应用
- [tensorrt] dynamic batch reasoning
- Export and call of onnx file of pytorch model
猜你喜欢

Application of conductive slip ring in mechanical equipment
![[today in history] July 15: Mozilla foundation was officially established; The first operation of Enigma cipher machine; Nintendo launches FC game console](/img/7d/7a01c8c6923077d6c201bf1ae02c8c.png)
[today in history] July 15: Mozilla foundation was officially established; The first operation of Enigma cipher machine; Nintendo launches FC game console

【高等数学】【3】微分中值定理与导数的应用

PMP采用最新考纲,这里有【敏捷项目管理】

Myormframeworkjdbc review and problem analysis of user-defined persistence layer framework, and thought analysis of user-defined persistence layer framework

Mobile web layout method
![[today in history] July 13: the father of database passed away; Apple buys cups code; IBM chip Alliance](/img/2d/c23a367c9e8e2806ffd5384de273d2.png)
[today in history] July 13: the father of database passed away; Apple buys cups code; IBM chip Alliance

雷达水位计的工作原理及安装维护注意事项
![[advanced mathematics] [8] differential equation](/img/83/b6b07540e3cf6d6433e57447d42ee9.png)
[advanced mathematics] [8] differential equation

PMP practice once a day | don't get lost in the exam -7.25
随机推荐
【高等数学】【4】不定积分
Three skills of interface request merging, and the performance is directly exploded!
Docker 搭建 Redis Cluster集群
Introduction and construction of consul Registration Center
10.< tag-动态规划和子序列, 子数组>lt.53. 最大子数组和 + lt.392. 判断子序列 dbc
Myormframeworkjdbc review and problem analysis of user-defined persistence layer framework, and thought analysis of user-defined persistence layer framework
Technology cloud report: what is the difference between zero trust and SASE? The answer is not really important
Fanoutexchange switch code tutorial
网络爬虫原理解析「建议收藏」
各厂商网络虚拟化的优势
Link list of sword finger offer question bank summary (III) (C language version)
PMP采用最新考纲,这里有【敏捷项目管理】
[advanced mathematics] [6] differential calculus of multivariate functions
参与开源社区还有证书拿?
Clickhouse notes 02 -- installation test clickvisual
[onnx] export pytorch model to onnx format: support multi parameter and dynamic input
String of sword finger offer question bank summary (II) (C language version)
Web crawler principle analysis "suggestions collection"
Socket error Event: 32 Error: 10053. Connection closing...Socket close
Distributed link logging minbox logging usage document