TransCD: Scene Change Detection via Transformer-based Architecture

Last update: Dec 11, 2022

Related tags

Overview

TransCD: Scene Change Detection via Transformer-based Architecture

Requirements

Python 3.7.0  
Pytorch 1.6.0  
Visdom 0.1.8.9  
Torchvision 0.7.0

Datasets

CD2014 dataset
- paper: changedetection.net: A new change detection benchmark dataset
- paper: CDnet 2014: An Expanded Change Detection Benchmark Dataset
- dataset: http://changedetection.net/
VL-CMU-CD
- paper: Street-view change detection with deconvolutional networks
- dataset: https://ghsi.github.io/proj/RSS2016.html

Pretrained Model

Pretrained models for CDNet-2014 and VL-CMU-CD are available. You can download them from the following link.

CDNet-2014: [Baiduyun] the password is 78cp. [GoogleDrive].
- We uploaded six models trained on CDNet-2014 dataset, they are SViT_E1_D1_16, SViT_E1_D1_32, SViT_E4_D4_16, SViT_E4_D4_32, Res_SViT_E1_D1_16 and Res_SViT_E4_D4_16.
VL-CMU-CD: [Baiduyun] the password is ydzl. [GoogleDrive].
- We uploaded four models trained on VL-CMU-CD dataset, ther are SViT_E1_D1_16, SViT_E1_D1_32, Res_SViT_E1_D1_16 and Res_SViT_E1_D1_32.

Test

Before test, please download datasets and predtrained models. Copy pretrained models to folder './dataset_name/outputs/best_weights', and run the following command:

cd TransCD_ROOT
python test.py --net_cfg 
   
     --train_cfg

Use --save_changemap True to save predicted changemaps. For example:

python test.py --net_cfg SVit_E1_D1_32 --train_cfg CDNet_2014 --save_changemap True

Training

Before training, please download datasets and revise dataset path in configs.py to your path. CD TransCD_ROOT

python -m visdom.server
python train.py --net_cfg 
   
     --train_cfg

For example:

python -m visdom.server
python train.py --net_cfg Res_SViT_E1_D1_16 --train_cfg VL_CMU_CD

To display training processing, copy 'http://localhost:8097' to your browser.

Citing TransCD

If you use this repository or would like to refer the paper, please use the following BibTex entry.

@inproceddings{TransCD,
title={TransCD: Scene Change Detection via Transformer-based Architecture},
author={ZHIXUE WANG, YU ZHANG*, LIN LUO, NAN WANG},
journal={Optics Express},
yera={2021},
organization={The Optical Society},
}

Reference

-Akcay, Samet, Amir Atapour-Abarghouei, and Toby P. Breckon. "Ganomaly: Semi-supervised anomaly detection via adversarial training." Asian conference on computer vision. Springer, Cham, 2018.
-Chen, Jieneng, et al. "Transunet: Transformers make strong encoders for medical image segmentation." arXiv preprint arXiv:2102.04306 (2021).

TransCD: Scene Change Detection via Transformer-based Architecture

Related tags

Overview

TransCD: Scene Change Detection via Transformer-based Architecture

Requirements

Datasets

Pretrained Model

Test

Training

Citing TransCD

Reference

Owner

wangzhixue

Reduce end to end training time from days to hours (or hours to minutes), and energy requirements/costs by an order of magnitude using coresets and data selection.

How to Learn a Domain Adaptive Event Simulator? ACM MM, 2021

Cowsay - A rewrite of cowsay in python

This is an implementation of Googles Yogi-Optimizer in Keras (tf.keras)

AdaDM: Enabling Normalization for Image Super-Resolution

Implementation for the paper: Invertible Denoising Network: A Light Solution for Real Noise Removal (CVPR2021).

Self-Supervised Contrastive Learning of Music Spectrograms

Run Effective Large Batch Contrastive Learning on Limited Memory GPU

这是一个mobilenet-yolov4-lite的库，把yolov4主干网络修改成了mobilenet，修改了Panet的卷积组成，使参数量大幅度缩小。

Auto-Encoding Score Distribution Regression for Action Quality Assessment

FairMOT - A simple baseline for one-shot multi-object tracking

This repo contains the code for paper Inverse Weighted Survival Games

Code for testing convergence rates of Lipschitz learning on graphs

TrTr: Visual Tracking with Transformer

(ICCV'21) Official PyTorch implementation of Relational Embedding for Few-Shot Classification

This code provides various models combining dilated convolutions with residual networks

CVPR2021: Temporal Context Aggregation Network for Temporal Action Proposal Refinement

A repo for Causal Imitation Learning under Temporally Correlated Noise

Source code for the paper: Variance-Aware Machine Translation Test Sets (NeurIPS 2021 Datasets and Benchmarks Track)

Polynomial-time Meta-Interpretive Learning