当前位置:网站首页>【CVPR 2019】Semantic Image Synthesis with Spatially-Adaptive Normalization(SPADE)
【CVPR 2019】Semantic Image Synthesis with Spatially-Adaptive Normalization(SPADE)
2022-06-26 09:24:00 【_ Summer tree】
List of articles

# Spatial adaptive regularization
We propose spatially-adaptive normalization, a simplebut effective layer for synthesizing photorealistic images given an input semantic layout. we propose usingthe input layout for modulating the activations in normal-ization layers through a spatially-adaptive, learned trans-formation.
Experiments on several challenging datasetsdemonstrate the advantage of the proposed method over ex-isting approaches, regarding both visual fidelity and align-ment with input layouts. Finally, our model allows usercontrol over both semantic and style.
Introduction
In this paper, we show that the conventional net-work architecture [22, 48], which is built by stacking con-volutional, normalization, and nonlinearity layers, is at best suboptimal because their normalization layers tend to “washaway” information contained in the input semantic masks. // In this paper , We proved the traditional network architecture [22,48], It's made up of convolution layers 、 The normalized layer and the nonlinear layer are superimposed , In the best case, it is suboptimal , Because their normalized layers tend to “ Eliminate ” Information contained in the input semantic mask .
To address the issue, we proposespatially-adaptive normal-ization, a conditional normalization layer that modulates theactivations using input semantic layouts through a spatially-adaptive, learned transformation and can effectively propa-gate the semantic information throughout the network. // To solve this problem , We propose a spatial adaptive standardization , It is a conditional normalization layer , The activation of input semantic layout is adjusted through spatial adaptive learning transformation , And it can effectively spread semantic information throughout the network .
This passage is like abstract.

Figure 1: Our model allows user control over both semantic and style as synthesizing an image. The semantic (e.g., theexistence of a tree) is controlled via a label map (the top row), while the style is controlled via the reference style image (theleftmost column). Please visit our website for interactive image synthesis demos. // chart 1: Our model allows users to control the semantics and style of synthetic images . semantics ( for example , The existence of trees ) Is mapped by tags ( The top row ) Controlled , And the style is by referring to the style image ( The leftmost column ) Controlled . Please visit our website for interactive image synthesis demonstration .
our goal is to design a generator forstyle and semantics disentanglement. We focus on provid-ing the semantic information in the context of modulatingnormalized activations. We use semantic maps in differentscales, which enables coarse-to-fine generation. The readeris encouraged to review their work for more details. // Our goal is to design a generator for style and semantic decomposition . We focus on providing semantic information in the context of regulating canonical activation . We use semantic mapping at different scales , This makes coarse to fine generation possible . Readers are encouraged to review their work for more details .
3. Semantic Image Synthesis
Our goal is to learn a mapping function , It can split an input mask Convert to realistic images
Spatially-adaptive denormalization.

Figure 2: In the SPADE, the mask is first projected onto anembedding space and then convolved to produce the modu-lation parametersγandβ. Unlike prior conditional normal-ization methods,γandβare not vectors, but tensors withspatial dimensions. The producedγandβare multipliedand added to the normalized activation element-wise // chart 2: stay SPADE in , The mask is first projected into the embedding space , Then convolution generates modulation parameters γ and β. Different from the prior conditional normalization method ,γ and β It's not a vector , It's a tensor with a spatial dimension . The generated γ and β Multiply and add to the normalized active elements .
The activation value at site(n∈N,c∈Ci,y∈Hi,x∈Wi)is

\gamma and \beta are thelearned modulation parameters of the normalization layer/
In contrast to the BatchNorm [21], they depend on the in-put segmentation mask and vary with respect to the location(y,x).


Figure 3: Comparing results given uniform segmentationmaps: while the SPADE generator produces plausible tex-tures, the pix2pixHD generator [48] produces two identicaloutputs due to the loss of the semantic information after thenormalization layer.
SPADE generator.
Figure 4: In the SPADE generator, each normalization layer uses the segmentation mask to modulate the layer activations.(left)Structure of one residual block with the SPADE.(right)The generator contains a series of the SPADE residual blockswith upsampling layers. Our architecture achieves better performance with a smaller number of parameters by removing thedownsampling layers of leading image-to-image translation networks such as the pix2pixHD model [48].
conclusion
We propose spatial adaptive normalization , Layout with input semantics , Affine transformation is performed in the normalization layer . The proposed normalization leads to the first semantic image synthesis model , The model can be generated including indoor 、 Outside 、 Realistic output of various scenes including landscape and street scenes . We further demonstrate its application in multimodal synthesis and guided image synthesis .
边栏推荐
- 【Sensors 2021】Relation-Based Deep Attention Network with Hybrid Memory for One-Shot Person Re-Id
- Phpcms mobile station module implements custom pseudo static settings
- Self taught neural network series - 4 learning of neural network
- Thinkphp5 using the composer installation plug-in prompts that the PHP version is too high
- 【CVPR 2021】Joint Generative and Contrastive Learning for Unsupervised Person Re-identification
- 《一周搞定模电》-二极管
- 集合对象复制
- Unity connects to Turing robot
- Pycharm occasionally encounters low disk space
- Merrill Lynch data technology expert team | application of recommendation of relevant contents in group system data retrieval
猜你喜欢

《一周搞定模电》-光耦等元器件

How to convert wechat applet into Baidu applet

Dedecms applet plug-in is officially launched, and one click installation does not require any PHP or SQL Foundation

《單片機原理及應用》——概述

How to view the data mini map quickly and conveniently after importing data in origin

php提取txt文本存储json数据中的域名

What is optimistic lock and what is pessimistic lock

【开源】使用PhenoCV-WeedCam进行更智能、更精确的杂草管理

Self taught neural network series - 9 convolutional neural network CNN

3大问题!Redis缓存异常及处理方案总结
随机推荐
Detectron2 save (according to maxap50) model during training_ best. PTH weight
《一周搞定数电》-逻辑门
《一周搞定模电》—功率放大器
【pulsar学习】pulsar架构原理
Spark based distributed parallel processing optimization strategy - Merrill Lynch data
MySQL cannot be found in the service (not uninstalled)
Unity webgl publishing cannot run problem
51 single chip microcomputer ROM and ram
首期Techo Day腾讯技术开放日,628等你
Shared by Merrill Lynch data technology expert team, smoking detection related practice based on Jetson nano
Catalogue gradué de revues scientifiques et technologiques de haute qualité dans le domaine de l'informatique
計算領域高質量科技期刊分級目錄
Principe et application du micro - ordinateur à puce unique - Aperçu
【CVPR 2021】Intra-Inter Camera Similarity for Unsupervised Person Re-Identification (IICS++)
《一周搞定模电》-二极管
Self taught neural network series - 4 learning of neural network
Phpcms applet interface new universal interface get_ diy. php
Upgrade phpcms applet plug-in API interface to 4.3 (add batch acquisition interface, search interface, etc.)
行为树 文件说明
Classified catalogue of high quality sci-tech periodicals in the field of computing