Mask2Former: Masked-attention Mask Transformer for Universal Image Segmentation in TensorFlow 2

Last update: Dec 16, 2021

Related tags

Deep Learning Mask2Former

Overview

Mask2Former: Masked-attention Mask Transformer for Universal Image Segmentation in TensorFlow 2

Bowen Cheng, Ishan Misra, Alexander G. Schwing, Alexander Kirillov, Rohit Girdhar [arXiv]

Features

A single architecture for three tasks: panoptic, instance and semantic segmentation. This straightforward mini project was built as part of the main project, IST: A TensorFlow 2 compatible instance segmentation toolbox, with the purpose of adapting recent research into segmentation approaches into TensorFlow.
Support common benchmark datasets: ADE20K, Cityscapes, COCO, Mapillary Vistas.

Getting started

Project is currently being built, with SwinTransformerV1 and SwinTransformerV2 and a few bits and pieces ready.

License

Shield:

The majority of MaskFormer is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.

However portions of the project are available under separate license terms: Swin-Transformer-Semantic-Segmentation is licensed under the MIT license.

Citation

@article{cheng2021mask2former,
  title={Masked-attention Mask Transformer for Universal Image Segmentation},
  author={Bowen Cheng and Ishan Misra and Alexander G. Schwing and Alexander Kirillov and Rohit Girdhar},
  journal={arXiv},
  year={2021}
}

Mask2Former: Masked-attention Mask Transformer for Universal Image Segmentation in TensorFlow 2

Related tags

Overview

Mask2Former: Masked-attention Mask Transformer for Universal Image Segmentation in TensorFlow 2

Features

Getting started

License

Citation

Owner

Phan Nguyen

Prototypical Pseudo Label Denoising and Target Structure Learning for Domain Adaptive Semantic Segmentation (CVPR 2021)

FastFace: Lightweight Face Detection Framework

A small fun project using python OpenCV, mediapipe, and pydirectinput

This is the official pytorch implementation for the paper: Instance Similarity Learning for Unsupervised Feature Representation.

git《FSCE: Few-Shot Object Detection via Contrastive Proposal Encoding》(CVPR 2021) GitHub: [fig8]

Official implementation of the paper 'Details or Artifacts: A Locally Discriminative Learning Approach to Realistic Image Super-Resolution' in CVPR 2022

Providing the solutions for high-frequency trading (HFT) strategies using data science approaches (Machine Learning) on Full Orderbook Tick Data.

The official repository for "Revealing unforeseen diagnostic image features with deep learning by detecting cardiovascular diseases from apical four-chamber ultrasounds"

Official PyTorch implementation of "RMGN: A Regional Mask Guided Network for Parser-free Virtual Try-on" (IJCAI-ECAI 2022)

learning and feeling SLAM together with hands-on-experiments

Project looking into use of autoencoder for semi-supervised learning and comparing data requirements compared to supervised learning.

DirectVoxGO reconstructs a scene representation from a set of calibrated images capturing the scene.

A modular framework for vision & language multimodal research from Facebook AI Research (FAIR)

Official PyTorch implementation of "Synthesis of Screentone Patterns of Manga Characters"

FEDn is an open-source, modular and ML-framework agnostic framework for Federated Machine Learning

Info and sample codes for "NTU RGB+D Action Recognition Dataset"

Official implementation of paper Gradient Matching for Domain Generalization

Python module providing a framework to trace individual edges in an image using Gaussian process regression.

Proximal Backpropagation - a neural network training algorithm that takes implicit instead of explicit gradient steps

This repository contains the official MATLAB implementation of the TDA method for reverse image filtering