implement of SwiftNet:Real-time Video Object Segmentation

Last update: Dec 14, 2022

Related tags

Overview

SwiftNet

The official PyTorch implementation of SwiftNet:Real-time Video Object Segmentation, which has been accepted by CVPR2021.

Requirements

Python >= 3.6
Pytorch 1.5
Numpy
Pillow
opencv-python
scipy
tqdm

Training

The training pipeline of Swiftnet is similar with the training pipeline of STM, which can be found in our reproduced STM training code.

Inference

Usage

python eval.py -g 0 -y 17 -s val -D 'path to davis'

Performance

Performance on Davis-17 val set.

backbone	J&F	J	F	FPS	weights
resnet-18	77.6	75.5	79.7	65	`link`

Note: The FPS is tested on one P100, which does not include the time of image loading and evaluation cost.

Acknowledgement

This repository is partially founded on the official STM repository.

Citation

If you find this repository helpful and want to cite SwiftNet in your own projects, please use the following citation info.

@inproceedings{wang2021swiftnet,
  title={SwiftNet: Real-time Video Object Segmentation},
  author={Wang, Haochen and Jiang, Xiaolong and Ren, Haibing and Hu, Yao and Bai, Song},
  booktitle={Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition},
  pages={1296--1305},
  year={2021}
}

implement of SwiftNet:Real-time Video Object Segmentation

Related tags

Overview

SwiftNet

Requirements

Training

Inference

Performance

Acknowledgement

Citation

Owner

haochen wang

Example of a Quantum LSTM

Implementation for Panoptic-PolarNet (CVPR 2021)

A Novel Incremental Learning Driven Instance Segmentation Framework to Recognize Highly Cluttered Instances of the Contraband Items

This repository contains the implementation of the following paper: Cross-Descriptor Visual Localization and Mapping

領域を指定し、キーを入力することで画像を保存するツールです。クラス分類用のデータセット作成を想定しています。

House3D: A Rich and Realistic 3D Environment

The code repository for "PyCIL: A Python Toolbox for Class-Incremental Learning" in PyTorch.

Implementation of STAM (Space Time Attention Model), a pure and simple attention model that reaches SOTA for video classification

Useful materials and tutorials for 110-1 NTU DBME5028 (Application of Deep Learning in Medical Imaging)

[MedIA2021]MIDeepSeg: Minimally Interactive Segmentation of Unseen Objects from Medical Images Using Deep Learning

This is the pytorch code for the paper Curious Representation Learning for Embodied Intelligence.

A PyTorch implementation of "Cluster-GCN: An Efficient Algorithm for Training Deep and Large Graph Convolutional Networks" (KDD 2019).

CN24 is a complete semantic segmentation framework using fully convolutional networks

UT-Sarulab MOS prediction system using SSL models

Interactive Image Segmentation via Backpropagating Refinement Scheme

Theano is a Python library that allows you to define, optimize, and evaluate mathematical expressions involving multi-dimensional arrays efficiently. It can use GPUs and perform efficient symbolic differentiation.

A Differentiable Recipe for Learning Visual Non-Prehensile Planar Manipulation

PyTorch implementation of the TTC algorithm

Predicting Event Memorability from Contextual Visual Semantics

Official code for Next Check-ins Prediction via History and Friendship on Location-Based Social Networks (MDM 2018)