List of awesome things around semantic segmentation 🎉

Overview

Awesome Semantic Segmentation

Awesome

List of awesome things around semantic segmentation 🎉

Semantic segmentation is a computer vision task in which we label specific regions of an image according to what's being shown. Semantic segmentation awswers for the question: "What's in this image, and where in the image is it located?".

Semantic segmentation is a critical module in robotics related applications, especially autonomous driving, remote sensing. Most of the research on semantic segmentation is focused on improving the accuracy with less attention paid to computationally efficient solutions.

Seft-driving-car

The recent appoarch in semantic segmentation is using deep neural network, specifically Fully Convolutional Network (a.k.a FCN). We can follow the trend of semantic segmenation approach at: paper-with-code.

Evaluate metrics: mIOU, accuracy, speed,...

State-Of-The-Art (SOTA) methods of Semantic Segmentation

Paper Benchmark on PASALVOC12 Release Implement
EfficientNet-L2+NAS-FPN Rethinking Pre-training and Self-training 90.5% NeurIPS 2020 TF
DeepLab V3+ Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation 89% ECCV 2018 TF, Keras, Pytorch, Demo
DeepLab V3 Rethinking Atrous Convolution for Semantic Image Segmentation 86.9% 17 Jun 2017 TF, TF
Smooth Network with Channel Attention Block Learning a Discriminative Feature Network for Semantic Segmentation 86.2% CVPR 2018 Pytorch
PSPNet Pyramid Scene Parsing Network 85.4% CVPR 2017 Keras, Pytorch, Pytorch
ResNet-38 MS COCO Wider or Deeper: Revisiting the ResNet Model for Visual Recognition 84.9% 30 Nov 2016 MXNet
RefineNet RefineNet: Multi-Path Refinement Networks for High-Resolution Semantic Segmentation 84.2% CVPR 2017 Matlab, Keras
GCN Large Kernel Matters -- Improve Semantic Segmentation by Global Convolutional Network 83.6% CVPR 2017 TF
CRF-RNN Conditional Random Fields as Recurrent Neural Networks 74.7% ICCV 2015 Matlab, TF
ParseNet ParseNet: Looking Wider to See Better 69.8% 15 Jun 2015 Caffe
Dilated Convolutions Multi-Scale Context Aggregation by Dilated Convolutions 67.6% 23 Nov 2015 Caffe
FCN Fully Convolutional Networks for Semantic Segmentation 67.2% CVPR 2015 Caffe

Variants

  • FCN with VGG(Resnet, Densenet) backbone: pytorch
  • The easiest implementation of fully convolutional networks (FCN8s VGG): pytorch
  • TernausNet (UNet model with VGG11 encoder pre-trained on Kaggle Carvana dataset paper: pytorch
  • TernausNetV2: Fully Convolutional Network for Instance Segmentation: pytorch

Review list of Semantic Segmentation

  • Evolution of Image Segmentation using Deep Convolutional Neural Network: A Survey 2020 (University of Gour Banga,India)
  • A peek of Semantic Segmentation 2018 (mc.ai)
  • Semantic Segmentation guide 2018 (towardds)
  • An overview of semantic image segmentation (jeremyjordan.me)
  • Recent progress in semantic image segmentation 2018 (arxiv, towardsdatascience)
  • A 2017 Guide to Semantic Segmentation Deep Learning Review (blog.qure.ai)
  • Review popular network architecture (medium-towardds)
  • Lecture 11 - Detection and Segmentation - CS231n (slide, vid):
  • A Survey of Semantic Segmentation 2016 (arxiv)

Case studies

  • Dstl Satellite Imagery Competition, 3rd Place Winners' Interview: Vladimir & Sergey: Blog, Code
  • Carvana Image Masking Challenge–1st Place Winner's Interview: Blog, Code
  • Data Science Bowl 2017, Predicting Lung Cancer: Solution Write-up, Team Deep Breath: Blog
  • MICCAI 2017 Robotic Instrument Segmentation: Code and explain
  • 2018 Data Science Bowl Find the nuclei in divergent images to advance medical discovery: 1st place, 2nd, 3rd, 4th, 5th, 10th
  • Airbus Ship Detection Challenge: 4th place, 6th

Most used loss functions

  • Pixel-wise cross entropy loss:
  • Dice loss: which is pretty nice for balancing dataset
  • Focal loss:
  • Lovasz-Softmax loss:

Datasets

Frameworks for segmentation

Related techniques

Feel free to show your ❤️ by giving a star

🎁 Check Out the List of Contributors - Feel free to add your details here!

Owner
Dam Minh Tien
Tech enthusiast
Dam Minh Tien
Realtime segmentation with ENet, the fast and accurate segmentation net.

Enet This is a realtime segmentation net with almost 22 fps on GTX1080 ti, and the model size is very small with only 28M. This repo contains the infe

JinTian 14 Aug 30, 2022
CVPR '21: In the light of feature distributions: Moment matching for Neural Style Transfer

In the light of feature distributions: Moment matching for Neural Style Transfer (CVPR 2021) This repository provides code to recreate results present

Nikolai Kalischek 49 Oct 13, 2022
The repository for our EMNLP 2021 paper "Finnish Dialect Identification: The Effect of Audio and Text"

Finnish Dialect Identification The repository for our EMNLP 2021 paper "Finnish Dialect Identification: The Effect of Audio and Text". We present a te

Rootroo Ltd 2 Dec 25, 2021
Official code for CVPR2022 paper: Depth-Aware Generative Adversarial Network for Talking Head Video Generation

📖 Depth-Aware Generative Adversarial Network for Talking Head Video Generation (CVPR 2022) 🔥 If DaGAN is helpful in your photos/projects, please hel

Fa-Ting Hong 503 Jan 04, 2023
MicRank is a Learning to Rank neural channel selection framework where a DNN is trained to rank microphone channels.

MicRank: Learning to Rank Microphones for Distant Speech Recognition Application Scenario Many applications nowadays envision the presence of multiple

Samuele Cornell 20 Nov 10, 2022
Attention-guided gan for synthesizing IR images

SI-AGAN Attention-guided gan for synthesizing IR images This repository contains the Tensorflow code for "Pedestrian Gender Recognition by Style Trans

1 Oct 25, 2021
[CVPR 2022] CoTTA Code for our CVPR 2022 paper Continual Test-Time Domain Adaptation

CoTTA Code for our CVPR 2022 paper Continual Test-Time Domain Adaptation Prerequisite Please create and activate the following conda envrionment. To r

Qin Wang 87 Jan 08, 2023
Deep Inertial Prediction (DIPr)

Deep Inertial Prediction For more information and context related to this repo, please refer to our website. Getting Started (non Docker) Note: you wi

Arcturus Industries 12 Nov 11, 2022
HairCLIP: Design Your Hair by Text and Reference Image

Overview This repository hosts the official PyTorch implementation of the paper: "HairCLIP: Design Your Hair by Text and Reference Image". Our single

322 Jan 06, 2023
FishNet: One Stage to Detect, Segmentation and Pose Estimation

FishNet FishNet: One Stage to Detect, Segmentation and Pose Estimation Introduction In this project, we combine target detection, instance segmentatio

1 Oct 05, 2022
Research on controller area network Intrusion Detection Systems

Group members information Member 1: Lixue Liang Member 2: Yuet Lee Chan Member 3: Xinruo Zhang Member 4: Yifei Han User Manual Generate Attack Packets

Roche 4 Aug 30, 2022
A Low Complexity Speech Enhancement Framework for Full-Band Audio (48kHz) based on Deep Filtering.

DeepFilterNet A Low Complexity Speech Enhancement Framework for Full-Band Audio (48kHz) based on Deep Filtering. libDF contains Rust code used for dat

Hendrik Schröter 292 Dec 25, 2022
DGCNN - Dynamic Graph CNN for Learning on Point Clouds

DGCNN is the author's re-implementation of Dynamic Graph CNN, which achieves state-of-the-art performance on point-cloud-related high-level tasks including category classification, semantic segmentat

Wang, Yue 1.3k Dec 26, 2022
Very Deep Convolutional Networks for Large-Scale Image Recognition

pytorch-vgg Some scripts to convert the VGG-16 and VGG-19 models [1] from Caffe to PyTorch. The converted models can be used with the PyTorch model zo

Justin Johnson 217 Dec 05, 2022
PartImageNet is a large, high-quality dataset with part segmentation annotations

PartImageNet: A Large, High-Quality Dataset of Parts We will release our dataset and scripts soon after cleaning and approval. Introduction PartImageN

Ju He 77 Nov 30, 2022
Reproduction of Vision Transformer in Tensorflow2. Train from scratch and Finetune.

Vision Transformer(ViT) in Tensorflow2 Tensorflow2 implementation of the Vision Transformer(ViT). This repository is for An image is worth 16x16 words

sungjun lee 42 Dec 27, 2022
Pose estimation for iOS and android using TensorFlow 2.0

💃 Mobile 2D Single Person (Or Your Own Object) Pose Estimation for TensorFlow 2.0 This repository is forked from edvardHua/PoseEstimationForMobile wh

tucan9389 165 Nov 16, 2022
Disentangled Lifespan Face Synthesis

Disentangled Lifespan Face Synthesis Project Page | Paper Demo on Colab Preparation Please follow this github to prepare the environments and dataset.

何森 50 Sep 20, 2022
NeuTex: Neural Texture Mapping for Volumetric Neural Rendering

NeuTex: Neural Texture Mapping for Volumetric Neural Rendering Paper: https://arxiv.org/abs/2103.00762 Running Run on the provided DTU scene cd run ba

Fanbo Xiang 67 Dec 28, 2022
Code for pre-training CharacterBERT models (as well as BERT models).

Pre-training CharacterBERT (and BERT) This is a repository for pre-training BERT and CharacterBERT. DISCLAIMER: The code was largely adapted from an o

Hicham EL BOUKKOURI 31 Dec 05, 2022