This is the pytorch re-implementation of the IterNorm

Last update: Dec 27, 2022

Related tags

Deep Learning IterNorm-pytorch

Overview

IterNorm-pytorch

Pytorch reimplementation of the IterNorm methods, which is described in the following paper:

Iterative Normalization: Beyond Standardization towards Efficient Whitening

Lei Huang, Yi Zhou, Fan Zhu, Li Liu, Ling Shao

IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2019 (accepted). arXiv:1904.03441

This project also provide the pytorch implementation of Decorrelated Batch Normalization (CVPR 2018, arXiv:1804.08450), more details please refer to the Torch project.

Requirements and Dependency

Install PyTorch with CUDA (for GPU). (Experiments are validated on python 3.6.8 and pytorch-nightly 1.0.0)
(For visualization if needed), install the dependency visdom by:

pip install visdom

Experiments

1. VGG-network on Cifar-10 datasets:

run the scripts in the ./cifar10/experiments/vgg. Note that the dataset root dir should be altered by setting the para '--dataset-root', and the dataset style is described as:

-<dataset-root>
|-cifar10-batches-py
||-data_batch_1
||-data_batch_2
||-data_batch_3
||-data_batch_4
||-data_batch_5
||-test_batch

If the dataset is not exist, the script will download it, under the conditioning that the dataset-root dir is existed

2. Wide-Residual-Network on Cifar-10 datasets:

run the scripts in the ./cifar10/experiments/wrn.

3. ImageNet experiments.

run the scripts in the ./ImageNet/experiment. Note that resnet18 experimetns are run on one GPU, and resnet-50/101 are run on 4 GPU in the scripts.

Note that the dataset root dir should be altered by setting the para '--dataset-root'. and the dataset style is described as:

-<dataset-root>
|-train
||-class1
||-...
||-class1000  
|-var
||-class1
||-...
||-class1000

Using IterNorm in other projects/tasks

(1) copy ./extension/normalization/iterative_normalization.py to the respective dir.

(2) import the IterNorm class in iterative_normalization.py

(3) generally speaking, replace the BatchNorm layer by IterNorm, or add it in any place if you want to the feature/channel decorrelated. Considering the efficiency (Note that BatchNorm is intergrated in cudnn while IterNorm is based on the pytorch script without optimization), we recommend 1) replace the first BatchNorm; 2) insert extra IterNorm before the first skip connection in resnet; 3) inserted before the final linear classfier as described in the paper.

(4) Some tips related to the hyperparamters (Group size G and Iterative Number T). We recommend G=64 (i.e., the channel number in per group is 64) and T=5 by default. If you run on large batch size (e.g.>1024), you can either increase G or T. For fine tunning, fix G=64 or G=32, and search T={3,4,5,6,7,8} may help.

This is the pytorch re-implementation of the IterNorm

Related tags

Overview

IterNorm-pytorch

Requirements and Dependency

Experiments

1. VGG-network on Cifar-10 datasets:

2. Wide-Residual-Network on Cifar-10 datasets:

3. ImageNet experiments.

Using IterNorm in other projects/tasks

Owner

Lei Huang

Course on computational design, non-linear optimization, and dynamics of soft systems at UIUC.

Experiments and examples converting Transformers to ONNX

Yet Another Robotics and Reinforcement (YARR) learning framework for PyTorch.

A minimalist environment for decision-making in autonomous driving

PyTorch image models, scripts, pretrained weights -- ResNet, ResNeXT, EfficientNet, EfficientNetV2, NFNet, Vision Transformer, MixNet, MobileNet-V3/V2, RegNet, DPN, CSPNet, and more

A PyTorch Implementation of Single Shot MultiBox Detector

Official code repository for the EMNLP 2021 paper

Neural machine translation between the writings of Shakespeare and modern English using TensorFlow

This is a code repository for paper OODformer: Out-Of-Distribution Detection Transformer

[CVPR2021 Oral] End-to-End Video Instance Segmentation with Transformers

Decision Transformer: A brand new Offline RL Pattern

TopFormer: Token Pyramid Transformer for Mobile Semantic Segmentation, CVPR2022

Parallel and High-Fidelity Text-to-Lip Generation; AAAI 2022 ; Official code

Applications using the GTN library and code to reproduce experiments in "Differentiable Weighted Finite-State Transducers"

Sample and Computation Redistribution for Efficient Face Detection

Pytorch implementation of winner from VQA Chllange Workshop in CVPR'17

TakeInfoatNistforICS - Take Information in NIST NVD for ICS

Video Corpus Moment Retrieval with Contrastive Learning (SIGIR 2021)

Character-Input - Create a program that asks the user to enter their name and their age

pytorch implementation of Attention is all you need