Per-Pixel Classification is Not All You Need for Semantic Segmentation

Last update: Jan 08, 2023

Related tags

Deep Learning MaskFormer

Overview

MaskFormer: Per-Pixel Classification is Not All You Need for Semantic Segmentation

Bowen Cheng, Alexander G. Schwing, Alexander Kirillov

[arXiv] [Project] [BibTeX]

Features

Better results while being more efficient.
Unified view of semantic- and instance-level segmentation tasks.
Support major semantic segmentation datasets: ADE20K, Cityscapes, COCO-Stuff, Mapillary Vistas.
Support ALL Detectron2 models.

Installation

See installation instructions.

Getting Started

See Preparing Datasets for MaskFormer.

See Getting Started with MaskFormer.

Model Zoo and Baselines

We provide a large set of baseline results and trained models available for download in the MaskFormer Model Zoo.

License

Shield:

The majority of MaskFormer is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.

However portions of the project are available under separate license terms: Swin-Transformer-Semantic-Segmentation is licensed under the MIT license.

Citing MaskFormer

If you use MaskFormer in your research or wish to refer to the baseline results published in the Model Zoo, please use the following BibTeX entry.

@article{cheng2021maskformer,
  title={Per-Pixel Classification is Not All You Need for Semantic Segmentation},
  author={Bowen Cheng and Alexander G. Schwing and Alexander Kirillov},
  journal={arXiv},
  year={2021}
}

Per-Pixel Classification is Not All You Need for Semantic Segmentation

Related tags

Overview

MaskFormer: Per-Pixel Classification is Not All You Need for Semantic Segmentation

Features

Installation

Getting Started

Model Zoo and Baselines

License

Citing MaskFormer

Owner

Facebook Research

The 1st Place Solution of the Facebook AI Image Similarity Challenge (ISC21) : Descriptor Track.

A benchmark framework for Tensorflow

A pytorch implementation of the ACL2019 paper "Simple and Effective Text Matching with Richer Alignment Features".

PyTorch implementation of Masked Autoencoders Are Scalable Vision Learners for self-supervised ViT.

Material related to the Principles of Cloud Computing course.

Providing the solutions for high-frequency trading (HFT) strategies using data science approaches (Machine Learning) on Full Orderbook Tick Data.

Official PyTorch Implementation of paper "Deep 3D Mask Volume for View Synthesis of Dynamic Scenes", ICCV 2021.

Scripts and a shader to get you started on setting up an exported Koikatsu character in Blender.

An implementation of the WHATWG URL Standard in JavaScript

Transfer Learning Shootout for PyTorch's model zoo (torchvision)

CaLiGraph Ontology as a Challenge for Semantic Reasoners ([email protected]'21)

Official and maintained implementation of the paper "OSS-Net: Memory Efficient High Resolution Semantic Segmentation of 3D Medical Data" [BMVC 2021].

We have made you a wrapper you can't refuse

Mouse Brain in the Model Zoo

Dewarping Document Image By Displacement Flow Estimation with Fully Convolutional Network.

Project for music generation system based on object tracking and CGAN

reimpliment of DFANet: Deep Feature Aggregation for Real-Time Semantic Segmentation

A playable implementation of Fully Convolutional Networks with Keras.

Data and analysis code for an MS on SK VOC genomes phenotyping/neutralisation assays

A quantum game modeling of pandemic (QHack 2022)