DEPRECATION
This repository is deprecated and is no longer maintaned. Please see a more recent implementation of RL for continuous control at jax-sac.
Description
Reimplementation of Continuous Deep Q-Learning with Model-based Acceleration and Continuous control with deep reinforcement learning.
Contributions are welcome. If you know how to make it more stable, don't hesitate to send a pull request.
Run
Use the default hyperparameters.
For NAF:
python main.py --algo NAF --env-name HalfCheetah-v2
For DDPG
python main.py --algo DDPG --env-name HalfCheetah-v2