ConvLSTM_pytorch

This file contains the implementation of Convolutional LSTM in PyTorch made by me and DavideA.

We started from this implementation and heavily refactored it add added features to match our needs.

Please note that in this repository we implement the following dynamics:

which is a bit different from the one in the original paper.

How to Use

The ConvLSTM module derives from nn.Module so it can be used as any other PyTorch module.

The ConvLSTM class supports an arbitrary number of layers. In this case, it can be specified the hidden dimension (that is, the number of channels) and the kernel size of each layer. In the case more layers are present but a single value is provided, this is replicated for all the layers. For example, in the following snippet each of the three layers has a different hidden dimension but the same kernel size.

Example usage:

model = ConvLSTM(input_dim=channels,
                 hidden_dim=[64, 64, 128],
                 kernel_size=(3, 3),
                 num_layers=3,
                 batch_first=True
                 bias=True,
                 return_all_layers=False)

TODO (in progress...)

Comment code
Add docs
Add example usage on a toy problem
Implement stateful mechanism
...

Disclaimer

This is still a work in progress and is far from being perfect: if you find any bug please don't hesitate to open an issue.

Implementation of Convolutional LSTM in PyTorch.

Related tags

Overview

ConvLSTM_pytorch

How to Use

TODO (in progress...)

Disclaimer

Owner

Andrea Palazzi

Code for "Diffusion is All You Need for Learning on Surfaces"

Kalidokit is a blendshape and kinematics solver for Mediapipe/Tensorflow.js face, eyes, pose, and hand tracking models

Experiments and examples converting Transformers to ONNX

Deep RGB-D Saliency Detection with Depth-Sensitive Attention and Automatic Multi-Modal Fusion (CVPR'2021, Oral)

Implementation for paper "STAR: A Structure-aware Lightweight Transformer for Real-time Image Enhancement" (ICCV 2021).

Only a Matter of Style: Age Transformation Using a Style-Based Regression Model

Reducing Information Bottleneck for Weakly Supervised Semantic Segmentation (NeurIPS 2021)

Source code for GNN-LSPE (Graph Neural Networks with Learnable Structural and Positional Representations)

A visualisation tool for Deep Reinforcement Learning

Code Release for Learning to Adapt to Evolving Domains

GAN JAX - A toy project to generate images from GANs with JAX

Convolutional Neural Network to detect deforestation in the Amazon Rainforest

Nested cross-validation is necessary to avoid biased model performance in embedded feature selection in high-dimensional data with tiny sample sizes

A Planar RGB-D SLAM which utilizes Manhattan World structure to provide optimal camera pose trajectory while also providing a sparse reconstruction containing points, lines and planes, and a dense surfel-based reconstruction.

Exploring whether attention is necessary for vision transformers

Dieser Scanner findet Websites, die nicht direkt in Suchmaschinen auftauchen, aber trotzdem erreichbar sind.

StudioGAN is a Pytorch library providing implementations of representative Generative Adversarial Networks (GANs) for conditional/unconditional image generation.

Dynamic Visual Reasoning by Learning Differentiable Physics Models from Video and Language (NeurIPS 2021)

Lenia - Mathematical Life Forms

Implementation for Shape from Polarization for Complex Scenes in the Wild