Vision Transformer for 3D medical image registration (Pytorch).

Last update: Dec 20, 2022

Overview

ViT-V-Net: Vision Transformer for Volumetric Medical Image Registration

keywords: vision transformer, convolutional neural networks, image registration

This is a PyTorch implementation of my short paper:

Chen, Junyu, et al. "ViT-V-Net: Vision Transformer for Unsupervised Volumetric Medical Image Registration. " arXiv, 2021.

train.py is the training script. models.py contains ViT-V-Net model.

Pretrained ViT-V-Net: pretrained model

Dataset: Due to restrictions, we cannot distribute our brain MRI data. However, several brain MRI datasets are publicly available online: IXI, ADNI, OASIS, ABIDE, etc. Note that those datasets may not contain labels (segmentation). To generate labels, you can use FreeSurfer, which is an open-source software for normalizing brain MRI images. Here are some useful commands in FreeSurfer: Brain MRI preprocessing and subcortical segmentation using FreeSurfer.

Model Architecture:

Vision Transformer Achitecture:

Example Results:

Quantitative Results:

Reference:

TransUnet

ViT-pytorch

VoxelMorph

If you find this code is useful in your research, please consider to cite:

@misc{chen2021vitvnet,
title={ViT-V-Net: Vision Transformer for Unsupervised Volumetric Medical Image Registration}, 
author={Junyu Chen and Yufan He and Eric C. Frey and Ye Li and Yong Du},
year={2021},
eprint={2104.06468},
archivePrefix={arXiv},
primaryClass={eess.IV}
}

Vision Transformer for 3D medical image registration (Pytorch).

Related tags

Overview

ViT-V-Net: Vision Transformer for Volumetric Medical Image Registration

Model Architecture:

Vision Transformer Achitecture:

Example Results:

Quantitative Results:

Reference:

About Me

Owner

Junyu Chen

Easy Parallel Library (EPL) is a general and efficient deep learning framework for distributed model training.

CycleTransGAN-EVC: A CycleGAN-based Emotional Voice Conversion Model with Transformer

Official Codes for Graph Modularity:Towards Understanding the Cross-Layer Transition of Feature Representations in Deep Neural Networks.

code for TCL: Vision-Language Pre-Training with Triple Contrastive Learning, CVPR 2022

This repo contains the implementation of YOLOv2 in Keras with Tensorflow backend.

Code for the paper Progressive Pose Attention for Person Image Generation in CVPR19 (Oral).

Cascading Feature Extraction for Fast Point Cloud Registration (BMVC 2021)

Preprossing-loan-data-with-NumPy - In this project, I have cleaned and pre-processed the loan data that belongs to an affiliate bank based in the United States.

Code for ICCV 2021 paper "Distilling Holistic Knowledge with Graph Neural Networks"

Pytorch implementation of ProjectedGAN

Deep Q-network learning to play flappybird.

PantheonRL is a package for training and testing multi-agent reinforcement learning environments.

A toy compiler that can convert Python scripts to pickle bytecode 🥒

Wandb-predictions - WANDB Predictions With Python

这是一个mobilenet-yolov4-lite的库，把yolov4主干网络修改成了mobilenet，修改了Panet的卷积组成，使参数量大幅度缩小。

Ros2-voiceroid2 - ROS2 wrapper package of VOICEROID2

MASA-SR: Matching Acceleration and Spatial Adaptation for Reference-Based Image Super-Resolution (CVPR2021)

[ICML 2021] DouZero: Mastering DouDizhu with Self-Play Deep Reinforcement Learning | 斗地主AI

Read and write layered TIFF ImageSourceData and ImageResources tags

PyTorch common framework to accelerate network implementation, training and validation