Official Tensorflow implementation of U-GAT-IT: Unsupervised Generative Attentional Networks with Adaptive Layer-Instance Normalization for Image-to-Image Translation (ICLR 2020)

Last update: Jan 04, 2023

Related tags

Deep Learning UGATIT

Overview

U-GAT-IT — Official TensorFlow Implementation (ICLR 2020)

: Unsupervised Generative Attentional Networks with Adaptive Layer-Instance Normalization for Image-to-Image Translation

Paper | Official Pytorch code

This repository provides the official Tensorflow implementation of the following paper:

U-GAT-IT: Unsupervised Generative Attentional Networks with Adaptive Layer-Instance Normalization for Image-to-Image Translation
Junho Kim (NCSOFT), Minjae Kim (NCSOFT), Hyeonwoo Kang (NCSOFT), Kwanghee Lee (Boeing Korea)

Abstract We propose a novel method for unsupervised image-to-image translation, which incorporates a new attention module and a new learnable normalization function in an end-to-end manner. The attention module guides our model to focus on more important regions distinguishing between source and target domains based on the attention map obtained by the auxiliary classifier. Unlike previous attention-based methods which cannot handle the geometric changes between domains, our model can translate both images requiring holistic changes and images requiring large shape changes. Moreover, our new AdaLIN (Adaptive Layer-Instance Normalization) function helps our attention-guided model to flexibly control the amount of change in shape and texture by learned parameters depending on datasets. Experimental results show the superiority of the proposed method compared to the existing state-of-the-art models with a fixed network architecture and hyper-parameters.

Requirements

python == 3.6
tensorflow == 1.14

Pretrained model

We released 50 epoch and 100 epoch checkpoints so that people could test more widely.

Dataset

selfie2anime dataset

Web page

Selfie2Anime by Nathan Glover
Selfie2Waifu by creke

Telegram Bot

Selfie2AnimeBot by Alex Spirin

Usage

├── dataset
   └── YOUR_DATASET_NAME
       ├── trainA
           ├── xxx.jpg (name, format doesn't matter)
           ├── yyy.png
           └── ...
       ├── trainB
           ├── zzz.jpg
           ├── www.png
           └── ...
       ├── testA
           ├── aaa.jpg 
           ├── bbb.png
           └── ...
       └── testB
           ├── ccc.jpg 
           ├── ddd.png
           └── ...

Train

> python main.py --dataset selfie2anime

If the memory of gpu is not sufficient, set --light to True
- But it may not perform well
- paper version is --light to False

Test

> python main.py --dataset selfie2anime --phase test

Architecture

Results

Ablation study

User study

Kernel Inception Distance (KID)

Citation

If you find this code useful for your research, please cite our paper:

@inproceedings{
Kim2020U-GAT-IT:,
title={U-GAT-IT: Unsupervised Generative Attentional Networks with Adaptive Layer-Instance Normalization for Image-to-Image Translation},
author={Junho Kim and Minjae Kim and Hyeonwoo Kang and Kwang Hee Lee},
booktitle={International Conference on Learning Representations},
year={2020},
url={https://openreview.net/forum?id=BJlZ5ySKPH}
}

Author

Junho Kim, Minjae Kim, Hyeonwoo Kang, Kwanghee Lee

Official Tensorflow implementation of U-GAT-IT: Unsupervised Generative Attentional Networks with Adaptive Layer-Instance Normalization for Image-to-Image Translation (ICLR 2020)

Related tags

Overview

U-GAT-IT — Official TensorFlow Implementation (ICLR 2020)

: Unsupervised Generative Attentional Networks with Adaptive Layer-Instance Normalization for Image-to-Image Translation

Paper | Official Pytorch code

Requirements

Pretrained model

Dataset

Web page

Telegram Bot

Usage

Train

Test

Architecture

Results

Ablation study

User study

Kernel Inception Distance (KID)

Citation

Author

Owner

Junho Kim

Official PyTorch implementation of RobustNet (CVPR 2021 Oral)

A novel benchmark dataset for Monocular Layout prediction

The implement of papar "Enhanced Graph Learning for Collaborative Filtering via Mutual Information Maximization"

Using LSTM write Tang poetry

Tracking code for the winner of track 1 in the MMP-Tracking Challenge at ICCV 2021 Workshop.

This project uses Template Matching technique for object detecting by detection of template image over base image.

Official implementation for the paper "Attentive Prototypes for Source-free Unsupervised Domain Adaptive 3D Object Detection"

Unofficial PyTorch implementation of "RTM3D: Real-time Monocular 3D Detection from Object Keypoints for Autonomous Driving" (ECCV 2020)

Project dự đoán giá cổ phiếu bằng thuật toán LSTM gồm: code train và code demo

Production First and Production Ready End-to-End Speech Recognition Toolkit

Implementation for our AAAI2021 paper (Entity Structure Within and Throughout: Modeling Mention Dependencies for Document-Level Relation Extraction).

Pretrained SOTA Deep Learning models, callbacks and more for research and production with PyTorch Lightning and PyTorch

Memory-efficient optimum einsum using opt_einsum planning and PyTorch kernels.

We will see a basic program that is basically a hint to brute force attack to crack passwords. In other words, we will make a program to Crack Any Password Using Python. Show some ❤️ by starring this repository!

ML course - EPFL Machine Learning Course, Fall 2021

Entity-Based Knowledge Conflicts in Question Answering.

A Robust Unsupervised Ensemble of Feature-Based Explanations using Restricted Boltzmann Machines

Pytorch implementation of "MOSNet: Deep Learning based Objective Assessment for Voice Conversion"

These are the materials for the paper "Few-Shot Out-of-Domain Transfer Learning of Natural Language Explanations"

Open-L2O: A Comprehensive and Reproducible Benchmark for Learning to Optimize Algorithms