Repository for "Exploring Sparsity in Image Super-Resolution for Efficient Inference", CVPR 2021

Last update: Dec 26, 2022

Related tags

Deep Learning SMSR

Overview

SMSR

Reposity for "Exploring Sparsity in Image Super-Resolution for Efficient Inference"

[arXiv]

Highlights

Locate and skip redundant computation in SR networks at a fine-grained level for efficient inference.
Maintain state-of-the-art performance with significant FLOPs reduction and a speedup on mobile devices.
Efficient implementation of sparse convolution based on original Pytorch APIs for easier migration and deployment.

Network Architecture

Implementation of Sparse Convolution

For easier migration and deployment, we use an efficient implementation of sparse convolution based on original Pytorch APIs rather than the commonly applied CUDA-based implementation. Specifically, sparse features are first extracted from the input, as shown in the following figure. Then, matrix multiplication is executed to produce the output features.

Requirements

Python 3.6
PyTorch == 1.1.0
numpy
skimage
imageio
matplotlib
cv2

Train

Prepare training data

Download DIV2K training data (800 training + 100 validtion images) from DIV2K dataset or SNU_CVLab.
Specify '--dir_data' based on the HR and LR images path. In option.py, '--ext' is set as 'sep_reset', which first convert .png to .npy. If all the training images (.png) are converted to .npy files, then set '--ext sep' to skip converting files.

For more informaiton, please refer to EDSR(PyTorch).

Begin to train

python main.py --model SMSR --save SMSR_X2 --scale 2 --patch_size 96 --batch_size 16

Test

Prepare test data

Download benchmark datasets (e.g., Set5, Set14 and other test sets) and prepare HR/LR images in testsets/benchmark following the example of testsets/benchmark/Set5.

Demo

python main.py --dir_data testsets --data_test Set5 --scale 2 --model SMSR --save SMSR_X2 --pre_train experiment/SMSR_X2/model/model_1000.pt --test_only --save_results

Results

Visualization of Sparse Masks

Citation

@InProceedings{Wang2020Exploring,
  author    = {Wang, Longguang and Dong, Xiaoyu and Wang, Yingqian and Ying, Xinyi and Lin, Zaiping and An, Wei and Guo, Yulan},
  title     = {Exploring Sparsity in Image Super-Resolution for Efficient Inference},
  booktitle = {CVPR},
  year      = {2021},
}

Acknowledgements

This code is built on EDSR (PyTorch). We thank the authors for sharing the codes.

Repository for "Exploring Sparsity in Image Super-Resolution for Efficient Inference", CVPR 2021

Related tags

Overview

SMSR

Highlights

Network Architecture

Implementation of Sparse Convolution

Requirements

Train

Prepare training data

Begin to train

Test

Prepare test data

Demo

Results

Visualization of Sparse Masks

Citation

Acknowledgements

Owner

Longguang Wang

A repository built on the Flow software package to explore cyber-security attacks on intelligent transportation systems.

Boostcamp CV Serving For Python

CoSMA: Convolutional Semi-Regular Mesh Autoencoder. From Paper "Mesh Convolutional Autoencoder for Semi-Regular Meshes of Different Sizes"

A fast, distributed, high performance gradient boosting (GBT, GBDT, GBRT, GBM or MART) framework based on decision tree algorithms, used for ranking, classification and many other machine learning tasks.

Implementation of Stochastic Image-to-Video Synthesis using cINNs.

Stacked Hourglass Network with a Multi-level Attention Mechanism: Where to Look for Intervertebral Disc Labeling

Parallel and High-Fidelity Text-to-Lip Generation; AAAI 2022 ; Official code

Rainbow: Combining Improvements in Deep Reinforcement Learning

Transformer - Transformer in PyTorch

PRIN/SPRIN: On Extracting Point-wise Rotation Invariant Features

Implementation of DocFormer: End-to-End Transformer for Document Understanding, a multi-modal transformer based architecture for the task of Visual Document Understanding (VDU)

A task-agnostic vision-language architecture as a step towards General Purpose Vision

DetCo: Unsupervised Contrastive Learning for Object Detection

Code, Models and Datasets for OpenViDial Dataset

Official implement of Evo-ViT: Slow-Fast Token Evolution for Dynamic Vision Transformer

Semi-Supervised Graph Prototypical Networks for Hyperspectral Image Classification, IGARSS, 2021.

Wenzhou-Kean University AI-LAB

RTS3D: Real-time Stereo 3D Detection from 4D Feature-Consistency Embedding Space for Autonomous Driving

BossNAS: Exploring Hybrid CNN-transformers with Block-wisely Self-supervised Neural Architecture Search

Repository for the Bias Benchmark for QA dataset.