Multimodal Reinforcement Learning

JAX implementations of the following multimodal reinforcement learning approaches.

Dual-coding Episodic Memory from "Grounded Language Learning Fast and Slow"

The goal in this setting is for the agent to be presented with multiple objects with made up names following "This is a _____" statements and to then carry out an instruction such as "Move the wazzle to the table." This task requires the agent to learn long-term language and vision representations for concepts like "This is a" and objects that carry over between episodes such as "table" while also being able to learn one-shot representations of novel objects and their names.

Usage

Start by setting up the environment locally by running

poetry install
poetry shell

The learning environment depends on Docker and requires that the Docker Desktop program is running (on Mac). Once that's done you can run the default environment (fast mapping with 3 objects from the paper).

python fast_slow_learning/main.py

Solving reinforcement learning tasks which require language and vision

Related tags

Overview

Multimodal Reinforcement Learning

Usage

Owner

Henry Prior

IDM: An Intermediate Domain Module for Domain Adaptive Person Re-ID,

Train the HRNet model on ImageNet

[CVPR'21] Locally Aware Piecewise Transformation Fields for 3D Human Mesh Registration

Repositorio oficial del curso IIC2233 Programación Avanzada 🚀✨

Western-3DSlicer-Modules - Point-Set Registrations for Ultrasound Probe Calibrations

hySLAM is a hybrid SLAM/SfM system designed for mapping

Official Keras Implementation for UNet++ in IEEE Transactions on Medical Imaging and DLMIA 2018

Code for KDD'20 "An Efficient Neighborhood-based Interaction Model for Recommendation on Heterogeneous Graph"

Code for the KDD 2021 paper 'Filtration Curves for Graph Representation'

Approximate Nearest Neighbors in C++/Python optimized for memory usage and loading/saving to disk

FCN (Fully Convolutional Network) is deep fully convolutional neural network architecture for semantic pixel-wise segmentation

object recognition with machine learning on Respberry pi

Realtime segmentation with ENet, the fast and accurate segmentation net.

Code accompanying "Learning What To Do by Simulating the Past", ICLR 2021.

Learnable Boundary Guided Adversarial Training (ICCV2021)

Selecting Parallel In-domain Sentences for Neural Machine Translation Using Monolingual Texts

GBK-GNN: Gated Bi-Kernel Graph Neural Networks for Modeling Both Homophily and Heterophily

Implementation of StyleSpace Analysis: Disentangled Controls for StyleGAN Image Generation in PyTorch

Codes and models of NeurIPS2021 paper - DominoSearch: Find layer-wise fine-grained N:M sparse schemes from dense neural networks

Leveraging Instance-, Image- and Dataset-Level Information for Weakly Supervised Instance Segmentation