Long-Short Transformer (Transformer-LS)

This repository hosts the code and models for the paper:

Long-Short Transformer: Efficient Transformers for Language and Vision

Updates

July 23, 2021: Release the code and models for ImageNet classification and Long-Range Arena

Architecture

Long-short Transformer substitutes the full self attention of the original Transformer models with an efficient attention that considers both long-range and short-term correlations. Each query attends to tokens from the segment-wise sliding window to capture short-term correlations, and the dynamically projected features to capture long-range correlations. To align the norms of the original and projected feature vectors and improve the efficacy of the aggregation, we normalize the original and project feature vectors with two sets of Layer Normalizations.

Tasks

>>> Transformer-LS for ImageNet classification
>>> Transformer-LS for Long Range Areana
>>> Transformer-LS for autoregressive language modeling

Official implementation of Long-Short Transformer in PyTorch.

Related tags

Overview

Long-Short Transformer (Transformer-LS)

Updates

Architecture

Tasks

Owner

NVIDIA Corporation

C3d-pytorch - Pytorch porting of C3D network, with Sports1M weights

Multiview 3D object detection on MultiviewC dataset through moft3d.

Cooperative multi-agent reinforcement learning for high-dimensional nonequilibrium control

A small tool to joint picture including gif

A lightweight python AUTOmatic-arRAY library.

Deep learning with dynamic computation graphs in TensorFlow

Code for the prototype tool in our paper "CoProtector: Protect Open-Source Code against Unauthorized Training Usage with Data Poisoning".

Feedback is important: response-aware feedback mechanism for background based conversation

Pytorch0.4.1 codes for InsightFace

Optimising chemical reactions using machine learning

A deep learning network built with TensorFlow and Keras to classify gender and estimate age.

PyTorch implementation of Towards Accurate Alignment in Real-time 3D Hand-Mesh Reconstruction (ICCV 2021).

Official repo for AutoInt: Automatic Integration for Fast Neural Volume Rendering in CVPR 2021

AAI supports interdisciplinary research to help better understand human, animal, and artificial cognition.

Implementation of Stochastic Image-to-Video Synthesis using cINNs.

DSAC* for Visual Camera Re-Localization (RGB or RGB-D)

Facial detection, landmark tracking and expression transfer library for Windows, Linux and Mac

A large-scale video dataset for the training and evaluation of 3D human pose estimation models

nnFormer: Interleaved Transformer for Volumetric Segmentation Code for paper "nnFormer: Interleaved Transformer for Volumetric Segmentation "

Official Pytorch implementation of the paper "MotionCLIP: Exposing Human Motion Generation to CLIP Space"