PyTorch implementation of Advantage async actor-critic Algorithms (A3C) in PyTorch

Last update: Dec 08, 2022

Overview

Advantage async actor-critic Algorithms (A3C) in PyTorch

@inproceedings{mnih2016asynchronous,
  title={Asynchronous methods for deep reinforcement learning},
  author={Mnih, Volodymyr and Badia, Adria Puigdomenech and Mirza, Mehdi and Graves, Alex and Lillicrap, Timothy P and Harley, Tim and Silver, David and Kavukcuoglu, Koray},
  booktitle={International Conference on Machine Learning},
  year={2016}}

This repository contains an implementation of Adavantage async Actor-Critic (A3C) in PyTorch based on the original paper by the authors and the PyTorch implementation by Ilya Kostrikov.

A3C is the state-of-art Deep Reinforcement Learning method.

Dependencies

Python 2.7
PyTorch
gym (OpenAI)
universe (OpenAI)
opencv (for env state processing)
visdom (for visualization)

Training

./train_lstm.sh

Test wigh trained weight after 169000 updates for PongDeterminisitc-v3.

./test_lstm.sh 169000

A test result video is available.

PyTorch implementation of Advantage async actor-critic Algorithms (A3C) in PyTorch

Related tags

Overview

Advantage async actor-critic Algorithms (A3C) in PyTorch

Dependencies

Training

Test wigh trained weight after 169000 updates for PongDeterminisitc-v3.

Check the loss curves of all threads in http://localhost:8097

References

Owner

LEI TAI

Unpaired Caricature Generation with Multiple Exaggerations

AI drive app that can help user become beautiful.

Implementation of Convolutional enhanced image Transformer

Adabelief-Optimizer - Repository for NeurIPS 2020 Spotlight "AdaBelief Optimizer: Adapting stepsizes by the belief in observed gradients"

The source code for the Cutoff data augmentation approach proposed in this paper: "A Simple but Tough-to-Beat Data Augmentation Approach for Natural Language Understanding and Generation".

NuPIC Studio is an all-in-one tool that allows users create a HTM neural network from scratch

An efficient implementation of GPNN

VisionKG: Vision Knowledge Graph

EgoNN: Egocentric Neural Network for Point Cloud Based 6DoF Relocalization at the City Scale

Deep learning algorithms for muon momentum estimation in the CMS Trigger System

This repository provides an unified frameworks to train and test the state-of-the-art few-shot font generation (FFG) models.

Few-shot Learning of GPT-3

Official code repository for the EMNLP 2021 paper

Title: Heart-Failure-Classification

SegNet-like Autoencoders in TensorFlow

Keyhole Imaging: Non-Line-of-Sight Imaging and Tracking of Moving Objects Along a Single Optical Path

AI grand challenge 2020 Repo (Speech Recognition Track)

Official implementation for "Image Quality Assessment using Contrastive Learning"

Code for our paper "Graph Pre-training for AMR Parsing and Generation" in ACL2022

So-ViT: Mind Visual Tokens for Vision Transformer

PyTorch implementation of Advantage async actor-critic Algorithms (A3C) in PyTorch

Related tags

Overview

Advantage async actor-critic Algorithms (A3C) in PyTorch

Dependencies

Training

Test wigh trained weight after 169000 updates for PongDeterminisitc-v3.

Check the loss curves of all threads in http://localhost:8097

References

Owner

LEI TAI

Unpaired Caricature Generation with Multiple Exaggerations

AI drive app that can help user become beautiful.

Implementation of Convolutional enhanced image Transformer

Adabelief-Optimizer - Repository for NeurIPS 2020 Spotlight "AdaBelief Optimizer: Adapting stepsizes by the belief in observed gradients"

The source code for the Cutoff data augmentation approach proposed in this paper: "A Simple but Tough-to-Beat Data Augmentation Approach for Natural Language Understanding and Generation".

NuPIC Studio is an all­-in-­one tool that allows users create a HTM neural network from scratch

An efficient implementation of GPNN

VisionKG: Vision Knowledge Graph

EgoNN: Egocentric Neural Network for Point Cloud Based 6DoF Relocalization at the City Scale

Deep learning algorithms for muon momentum estimation in the CMS Trigger System

This repository provides an unified frameworks to train and test the state-of-the-art few-shot font generation (FFG) models.

Few-shot Learning of GPT-3

Official code repository for the EMNLP 2021 paper

Title: Heart-Failure-Classification

SegNet-like Autoencoders in TensorFlow

Keyhole Imaging: Non-Line-of-Sight Imaging and Tracking of Moving Objects Along a Single Optical Path

AI grand challenge 2020 Repo (Speech Recognition Track)

Official implementation for "Image Quality Assessment using Contrastive Learning"

Code for our paper "Graph Pre-training for AMR Parsing and Generation" in ACL2022

So-ViT: Mind Visual Tokens for Vision Transformer

NuPIC Studio is an all-in-one tool that allows users create a HTM neural network from scratch