Unifying Global-Local Representations in Salient Object Detection with Transformer

Last update: Aug 24, 2022

Related tags

Overview

GLSTR (Global-Local Saliency Transformer)

This is the official implementation of paper "Unifying Global-Local Representations in Salient Object Detection with Transformer" by Sucheng Ren, Qiang Wen, Nanxuan Zhao, Guoqiang Han, Shengfeng He

Prerequisites

The whole training process can be done on eight RTX2080Ti or four RTX3090.

Pytorch 1.6

Datasets

Training Set

We use the training set of DUTS (DUTS-TR) to train our model.

/path/to/DUTS-TR/
   img/
      img1.jpg
   label/
      label1.png

Testing Set

We test our model on the testing set of DUTS, ECSSD, HKU-IS, PASCAL-S, DUT-OMRON, and SOD to test our model.

Training

Download the pretrained transformer backbone on ImageNet.

# input the path to training data and pretrained backbone in train.sh
bash train.sh

Testing

Download the pretrained model from Baidu pan(code: uo0a), Google drive, and put it int ./ckpt/

python test.py

Evaluation

The precomputed saliency maps (DUTS-TE, ECSSD, HKU-IS, PASCAL-S, DUT-OMRON, and SOD) can be found at Baidu pan(code: uo0a), Google drive.

After paper submission, we retrain the model, and the performance is improved. Feel free to use the results of our paper or the precomputed saliency maps.

Contact

If you have any questions, feel free to email Sucheng Ren :) ([email protected])

Citation

Please cite our paper if you think the code and paper are helpful.

@article{ren2021unifying,
  title={Unifying Global-Local Representations in Salient Object Detection with Transformer},
  author={Ren, Sucheng and Wen, Qiang and Zhao, Nanxuan and Han, Guoqiang and He, Shengfeng},
  journal={arXiv preprint arXiv:2108.02759},
  year={2021}
}

Unifying Global-Local Representations in Salient Object Detection with Transformer

Related tags

Overview

GLSTR (Global-Local Saliency Transformer)

Prerequisites

Datasets

Training Set

Testing Set

Training

Testing

Evaluation

Contact

Citation

Owner

TumorInsight is a Brain Tumor Detection and Classification model built using RESNET50 architecture.

MogFace: Towards a Deeper Appreciation on Face Detection

Deep High-Resolution Representation Learning for Human Pose Estimation

Lava-DL, but with PyTorch-Lightning flavour

A library for using chemistry in your applications

Pytorch for Segmentation

Linear Variational State Space Filters

E2C implementation in PyTorch

An Open Source Machine Learning Framework for Everyone

A pytorch reproduction of { Co-occurrence Feature Learning from Skeleton Data for Action Recognition and Detection with Hierarchical Aggregation }.

Official implementation of Meta-StyleSpeech and StyleSpeech

A solution to ensure Crowd Management with Contactless and Safe systems.

WaveFake: A Data Set to Facilitate Audio DeepFake Detection

Bayes-Newton—A Gaussian process library in JAX, with a unifying view of approximate Bayesian inference as variants of Newton's algorithm.

Project code for weakly supervised 3D object detectors using wide-baseline multi-view traffic camera data: WIBAM.

SubOmiEmbed: Self-supervised Representation Learning of Multi-omics Data for Cancer Type Classification

ThunderGBM: Fast GBDTs and Random Forests on GPUs

The AugNet Python module contains functions for the fast computation of image similarity.

Probabilistic Entity Representation Model for Reasoning over Knowledge Graphs

Codes for TIM2021 paper "Anchor-Based Spatio-Temporal Attention 3-D Convolutional Networks for Dynamic 3-D Point Cloud Sequences"