Split Variational AutoEncoder

Last update: Sep 02, 2022

Related tags

Overview

Split-VAE

Split Variational AutoEncoder

Introduction

This repository contains and implemementation of a Split Variational AutoEncoder (SVAE). In a SVAE the output y is computed as a weighted sum

sigma * y1 + (1-sigma) * y2

where y1 and y2 are two distinct generated images, and sigma is a learned compositional map.

A Split VAE is trained as a normal VAE: no additional loss is added over the splitted images y1 and y2.

Splitting is meant to offer to the network a more flexible way to learn fruitful and independent features: as a result the variable collapse phenomenon is greatly reduced and the possibility of exploiting a larger number of latent variables improves the quality and diversity of generated samples.

Types of Splitting

The decomposition is nondeterministic, but follows two main schemes, that we may roughly categorize as either syntactical or semantical.

Syntactic decomposition

In this case, the compositional map tends to exploit the strong correlation between adjacent pixels, splitting the image in two complementary high frequency sub-images.

Below are some examples of syntactic splitting. In all the following pictures, the first row is the compositional map, then in order y1, y2 and y.

Semantic decomposition

In this case, the map typically focuses on the contours of objects, splitting the image in interesting variations of its content, with more marked and distinctive features.

Here are some examples of semantic splitting:

In case of sematic splitting, the Frèchet Inception Distance (FID) of y1 and y2 is frequently lower (hence better) than that of y, that clearly suffers from being the average of the formers.

In a sense, a SVAE forces the Variational Autoencoder to make choices, in contrast with its intrinsic tendency to average between alternatives with the aim to minimize the reconstruction loss towards a specific sample.

More examples of GENERATED images

Examples of Mnist-like gnerated digits (FID=7.47)

Here are some additional examples of semantic compositonal maps generated for CelebA, quite similar to drawings. The quality and precision of contours is both unexpected and remarkable.

And some generated faces (FID=35.1). Observe in particular the wide differentiation in pose, illumination, colors, age and expressions.

Split Variational AutoEncoder

Related tags

Overview

Split-VAE

Introduction

Types of Splitting

Syntactic decomposition

Semantic decomposition

More examples of GENERATED images

Owner

Andrea Asperti

Repo for "TableParser: Automatic Table Parsing with Weak Supervision from Spreadsheets" at [email protected]

A novel method to tune language models. Codes and datasets for paper ``GPT understands, too''.

Chess reinforcement learning by AlphaGo Zero methods.

The open source code of SA-UNet: Spatial Attention U-Net for Retinal Vessel Segmentation.

PyTorch code for the ICCV'21 paper: "Always Be Dreaming: A New Approach for Class-Incremental Learning"

A curated list of awesome open source libraries to deploy, monitor, version and scale your machine learning

Unofficial pytorch-lightning implement of Mip-NeRF

A new data augmentation method for extreme lighting conditions.

Transfer Reinforcement Learning for Differing Action Spaces via Q-Network Representations

Milano is a tool for automating hyper-parameters search for your models on a backend of your choice.

PyTorch implementation of Rethinking Positional Encoding in Language Pre-training

Theano is a Python library that allows you to define, optimize, and evaluate mathematical expressions involving multi-dimensional arrays efficiently. It can use GPUs and perform efficient symbolic differentiation.

EASY - Ensemble Augmented-Shot Y-shaped Learning: State-Of-The-Art Few-Shot Classification with Simple Ingredients.

学习 python3 以来写的一些垃圾玩具……

Dewarping Document Image By Displacement Flow Estimation with Fully Convolutional Network.

Scalable, event-driven, deep-learning-friendly backtesting library

Codebase for the Summary Loop paper at ACL2020

StyleGAN2-ADA-training-jupyter - Training custom datasets in styleGAN2-ADA by NVIDIA using Jupyter

Keras attention models including botnet,CoaT,CoAtNet,CMT,cotnet,halonet,resnest,resnext,resnetd,volo,mlp-mixer,resmlp,gmlp,levit

Image restoration with neural networks but without learning.