Iterative Normalization: Beyond Standardization towards Efficient Whitening

Last update: Dec 27, 2022

Related tags

Deep Learning IterNorm

Overview

IterNorm

Code for reproducing the results in the following paper:

Iterative Normalization: Beyond Standardization towards Efficient Whitening

Lei Huang, Yi Zhou, Fan Zhu, Li Liu, Ling Shao

IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2019. arXiv:1904.03441

This is the torch implementation (results of experimetns are based on this implementation). Other implementation are shown as follows:

1. Pytorch re-implementation

2. Tensorflow implementation by Lei Zhao.

=======================================================================

Requirements and Dependency

Install Torch with CUDA (for GPU).
Install cudnn.
Install the dependency optnet by:

luarocks install optnet

Experiments

1. Reproduce the results of VGG-network on Cifar-10 datasets:

Prepare the data: download CIFAR-10 , and put the data files under ./data/.

Run:

bash y_execute_vggE_base.sh               //basic configuration
bash y_execute_vggE_b1024.sh              //batch size of 1024
bash y_execute_vggE_b16.sh                //batch size of 16
bash y_execute_vggE_LargeLR.sh            //10x larger learning rate
bash y_execute_vggE_IterNorm_Iter.sh      //effect of iteration number
bash y_execute_vggE_IterNorm_Group.sh     //effect of group size

Note that the scripts don't inculde the setups of Decorrelated Batch Noarmalizaiton (DBN). To reproduce the results of DBN please follow the instructions of the DBN project, and the corresponding hyper-parameters described in the paper.

2. Reproduce the results of Wide-Residual-Networks on Cifar-10 datasets:

Prepare the data: same as in VGG-network on Cifar-10 experiments.

Run:

bash y_execute_wr.sh

3. Reproduce the ImageNet experiments.

Download ImageNet and put it in: /data/lei/imageNet/input_torch/ (you can also customize the path in opts_imageNet.lua)
Install the IterNorm module to Torch as a Lua package: go to the directory ./models/imagenet/cuSpatialDBN/ and run luarocks make cudbn-1.0-0.rockspec. (Note that the modules in ./models/imagenet/cuSpatialDBN/ are the same as in the ./module/, and the installation by luarocks is for convinience in training ImageNet with multithreads.)
run the script with `z_execute_imageNet_***'

This project is based on the training scripts of Wide Residual Network repo and Facebook's ResNet repo.

Contact

Email: [email protected].. Discussions and suggestions are welcome!

Iterative Normalization: Beyond Standardization towards Efficient Whitening

Related tags

Overview

IterNorm

1. Pytorch re-implementation

2. Tensorflow implementation by Lei Zhao.

Requirements and Dependency

Experiments

1. Reproduce the results of VGG-network on Cifar-10 datasets:

2. Reproduce the results of Wide-Residual-Networks on Cifar-10 datasets:

3. Reproduce the ImageNet experiments.

This project is based on the training scripts of Wide Residual Network repo and Facebook's ResNet repo.

Contact

Owner

Lei Huang

An Agnostic Computer Vision Framework - Pluggable to any Training Library: Fastai, Pytorch-Lightning with more to come

[CVPR'20] TTSR: Learning Texture Transformer Network for Image Super-Resolution

OpenL3: Open-source deep audio and image embeddings

Source code and notebooks to reproduce experiments and benchmarks on Bias Faces in the Wild (BFW).

Official source code of paper 'IterMVS: Iterative Probability Estimation for Efficient Multi-View Stereo'

Facestar dataset. High quality audio-visual recordings of human conversational speech.

A simple image/video to Desmos graph converter run locally

💊 A 3D Generative Model for Structure-Based Drug Design (NeurIPS 2021)

Ganilla - Official Pytorch implementation of GANILLA

YOLOX is a high-performance anchor-free YOLO, exceeding yolov3~v5 with ONNX, TensorRT, ncnn, and OpenVINO supported.

Tutel MoE: An Optimized Mixture-of-Experts Implementation

Python framework for Stochastic Differential Equations modeling

PyTorch implementation of: Michieli U. and Zanuttigh P., "Continual Semantic Segmentation via Repulsion-Attraction of Sparse and Disentangled Latent Representations", CVPR 2021.

The official codes of our CVPR2022 paper: A Differentiable Two-stage Alignment Scheme for Burst Image Reconstruction with Large Shift

Local trajectory planner based on a multilayer graph framework for autonomous race vehicles.

Official MegEngine implementation of CREStereo(CVPR 2022 Oral).

An Inverse Kinematics library aiming performance and modularity

Implementation of temporal pooling methods studied in [ICIP'20] A Comparative Evaluation Of Temporal Pooling Methods For Blind Video Quality Assessment

Project code for weakly supervised 3D object detectors using wide-baseline multi-view traffic camera data: WIBAM.

A minimal implementation of Gaussian process regression in PyTorch