Pytorch Implementation of Various Point Transformers

Recently, various methods applied transformers to point clouds: PCT: Point Cloud Transformer (Meng-Hao Guo et al.), Point Transformer (Nico Engel et al.), Point Transformer (Hengshuang Zhao et al.). This repo is a pytorch implementation for these methods and aims to compare them under a fair setting. Currently, all three methods are implemented, while tuning their hyperparameters.

Classification

Data Preparation

Download alignment ModelNet here and save in modelnet40_normal_resampled.

Run

Change which method to use in config/config.yaml and run

python train.py

Results

Using Adam with learning rate decay 0.3 for every 50 epochs, train for 200 epochs; data augmentation follows this repo. For Hengshuang and Nico, initial LR is 1e-3 (maybe fine-tuned later); for Menghao, initial LR is 1e-4, as suggested by the author. ModelNet40 classification results (instance average) are listed below:

Model	Accuracy
Hengshuang	89.6
Menghao	92.6
Nico	85.5

Miscellaneous

Some code and training settings are borrowed from https://github.com/yanx27/Pointnet_Pointnet2_pytorch. Code for PCT: Point Cloud Transformer (Meng-Hao Guo et al.) is adapted from the author's Jittor implementation https://github.com/MenghaoGuo/PCT.

Pytorch Implementation of Various Point Transformers

Related tags

Overview

Pytorch Implementation of Various Point Transformers

Classification

Data Preparation

Run

Results

Miscellaneous

Owner

Neil You

PAIRED in PyTorch 🔥

CoMoGAN: continuous model-guided image-to-image translation. CVPR 2021 oral.

This is an official implementation of the CVPR2022 paper "Blind2Unblind: Self-Supervised Image Denoising with Visible Blind Spots".

A library for researching neural networks compression and acceleration methods.

Explainability of the Implications of Supervised and Unsupervised Face Image Quality Estimations Through Activation Map Variation Analyses in Face Recognition Models

ColBERT: Contextualized Late Interaction over BERT (SIGIR'20)

InsightFace: 2D and 3D Face Analysis Project on MXNet and PyTorch

A `Neural = Symbolic` framework for sound and complete weighted real-value logic

Ranger deep learning optimizer rewrite to use newest components

Tensorflow implementation of our method: "Triangle Graph Interest Network for Click-through Rate Prediction".

Group R-CNN for Point-based Weakly Semi-supervised Object Detection (CVPR2022)

CRLT: A Unified Contrastive Learning Toolkit for Unsupervised Text Representation Learning

A Protein-RNA Interface Predictor Based on Semantics of Sequences

Torch implementation of "Enhanced Deep Residual Networks for Single Image Super-Resolution"

A Loss Function for Generative Neural Networks Based on Watson’s Perceptual Model

Music Generation using Neural Networks Streamlit App

Point-NeRF: Point-based Neural Radiance Fields

Deep learning based hand gesture recognition using LSTM and MediaPipie.

SiT: Self-supervised vIsion Transformer

A large-scale face dataset for face parsing, recognition, generation and editing.