Region-aware Contrastive Learning for Semantic Segmentation, ICCV 2021

Abstract

Recent works have made great success in semantic segmentation by exploiting contextual information in a local or global manner within individual image and supervising the model with pixel-wise cross entropy loss. However, from the holistic view of the whole dataset, semantic relations not only exist inside one single image, but also prevail in the whole training data, which makes solely considering intra-image correlations insufficient. Inspired by recent progress in unsupervised contrastive learning, we propose the region-aware contrastive learning (RegionContrast) for semantic segmentation in the supervised manner. In order to enhance the similarity of semantically similar pixels while keeping the discrimination from others, we employ contrastive learning to realize this objective. With the help of memory bank, we explore to store all the representative features into the memory. Without loss of generality, to efficiently incorporate all training data into the memory bank while avoiding taking too much computation resource, we propose to construct region centers to represent features from different categories for every image. Hence, the proposed region-aware contrastive learning is performed in a region level for all the training data, which saves much more memory than methods exploring the pixel-level relations. The proposed RegionContrast brings little computation cost during training and requires no extra overhead for testing. Extensive experiments demonstrate that our method achieves state-of-the-art performance on three benchmark datasets including Cityscapes, ADE20K and COCO Stuff. For more details, please refer to our ICCV paper (paper).

Installation

Check INSTALL.md for installation instructions.

Training and Evaluation

cd experiments/v3_contrast
bash train.sh

Citation

@InProceedings{Hu_2021_ICCV,
    author    = {Hu, Hanzhe and Cui, Jinshi and Wang, Liwei},
    title     = {Region-Aware Contrastive Learning for Semantic Segmentation},
    booktitle = {Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV)},
    month     = {October},
    year      = {2021},
    pages     = {16291-16301}
}

TODO

Dynamic Sampling

Region-aware Contrastive Learning for Semantic Segmentation, ICCV 2021

Related tags

Overview

Region-aware Contrastive Learning for Semantic Segmentation, ICCV 2021

Abstract

Installation

Training and Evaluation

Citation

TODO

Owner

Hanzhe Hu

A Data Annotation Tool for Semantic Segmentation, Object Detection and Lane Line Detection.(In Development Stage)

BlueFog Tutorials

Topic Discovery via Latent Space Clustering of Pretrained Language Model Representations

House3D: A Rich and Realistic 3D Environment

Code for Greedy Gradient Ensemble for Visual Question Answering （ICCV 2021, Oral）

Learning Representations that Support Robust Transfer of Predictors

Inhomogeneous Social Recommendation with Hypergraph Convolutional Networks

[ICCV2021] Learning to Track Objects from Unlabeled Videos

Dynamics-aware Adversarial Attack of 3D Sparse Convolution Network

Line-level Handwritten Text Recognition (HTR) system implemented with TensorFlow.

Implementation for the paper SMPLicit: Topology-aware Generative Model for Clothed People (CVPR 2021)

RM Operation can equivalently convert ResNet to VGG, which is better for pruning; and can help RepVGG perform better when the depth is large.

A High-Level Fusion Scheme for Circular Quantities published at the 20th International Conference on Advanced Robotics

Gauge equivariant mesh cnn

My solution for the 7th place / 245 in the Umoja Hack 2022 challenge

Volsdf - Volume Rendering of Neural Implicit Surfaces

A PyTorch-based open-source framework that provides methods for improving the weakly annotated data and allows researchers to efficiently develop and compare their own methods.

Speech Separation Using an Asynchronous Fully Recurrent Convolutional Neural Network

Trading Gym is an open source project for the development of reinforcement learning algorithms in the context of trading.

FANet - Real-time Semantic Segmentation with Fast Attention