AAAI-22 paper: SimSR: Simple Distance-based State Representationfor Deep Reinforcement Learning

Last update: Dec 19, 2022

Related tags

Overview

SimSR

Code and dataset for the paper SimSR: Simple Distance-based State Representationfor Deep Reinforcement Learning (AAAI-22).

Requirements

We assume you have access to a gpu that can run CUDA 11. All of the dependencies are in the conda_env.yml file.

conda env create -f conda_env.yml

After the instalation ends you can activate your environment with

conda activate simsr

Instructions

To train a SimSR agent on the cartpole swingup task from image-based observations run bash run.sh from the root of this directory. The run.sh file contains the following command, which you can modify to try different environments / hyperparamters.

DOMAIN=cartpole
TASK=swingup
SEED=1

MUJOCO_GL="egl" CUDA_VISIBLE_DEVICES=0 nohup python -u train.py \
	--domain_name ${DOMAIN} \
	--task_name ${TASK} \
	--encoder_type pixel \
	--action_repeat 4 \
	--pre_transform_image_size 84 \
	--image_size 84 \
	--work_dir ./tmp \
	--agent simsr_sac \
	--frame_stack 3\
	--seed ${SEED} --critic_lr 1e-3 \
	--actor_lr 1e-3 \
	--eval_freq 10000 \
	--batch_size 128 \
	--num_train_steps 260000 > ${DOMAIN}_${TASK}_${SEED}.log &

Note that the MuJoCo Python bindings support three different OpenGL rendering backends: "glfw", "egl", or "osmesa". You can also specify a particular backend to use by setting the MUJOCO_GL= environment variable to one of them.

To visualize progress with tensorboard run:

tensorboard --logdir ./path/to/your/log --port 6006

References

Please cite the paper SimSR: Simple Distance-based State Representationfor Deep Reinforcement Learning if you found the resources in the repository useful.

AAAI-22 paper: SimSR: Simple Distance-based State Representationfor Deep Reinforcement Learning

Related tags

Overview

SimSR

Requirements

Instructions

References

Owner

Codes for ACL-IJCNLP 2021 Paper "Zero-shot Fact Verification by Claim Generation"

pytorch implementation of dftd2 & dftd3

TCNN Temporal convolutional neural network for real-time speech enhancement in the time domain

A Bayesian cognition approach for belief updating of correlation judgement through uncertainty visualizations

FasterAI: A library to make smaller and faster models with FastAI.

All course materials for the Zero to Mastery Deep Learning with TensorFlow course.

Some code of the implements of Geological Modeling Using 3D Pixel-Adaptive and Deformable Convolutional Neural Network

Inverse Rendering for Complex Indoor Scenes: Shape, Spatially-Varying Lighting and SVBRDF From a Single Image

Implementation of EMNLP 2017 Paper "Natural Language Does Not Emerge 'Naturally' in Multi-Agent Dialog" using PyTorch and ParlAI

Mixed Neural Likelihood Estimation for models of decision-making

Using fully convolutional networks for semantic segmentation with caffe for the cityscapes dataset

Applying PVT to Semantic Segmentation

An official PyTorch Implementation of Boundary-aware Self-supervised Learning for Video Scene Segmentation (BaSSL)

Code for intrusion detection system (IDS) development using CNN models and transfer learning

SAS output to EXCEL converter for Cornell/MIT Language and acquisition lab

基于PaddleClas实现垃圾分类，并转换为inference格式用PaddleHub服务端部署

This repo provides a demo for the CVPR 2021 paper "A Fourier-based Framework for Domain Generalization" on the PACS dataset.

Deep Learning to Create StepMania SM FIles

Pytorch codes for "Self-supervised Multi-view Stereo via Effective Co-Segmentation and Data-Augmentation"

Artstation-Artistic-face-HQ Dataset (AAHQ)