Gobigger-Explore
๐ฎ
GoBigger Challenge 2021 Baseline
๐ค
Introduction
This is the baseline of GoBigger Multi-Agent Decision Intelligence Challenge in 2021. The baseline is based on OpenDILab and aims to provide a simple entry-level method. Participants can build agents by extending the baseline method provided. In addition, Opendilab's modular structure allows participants to easily get started, and it provides a wealth of reinforcement learning algorithms for participants to use. This baseline is a good starting point, especially for entry-level researchers who are familiar with multi-agent decision AI problems.
๐
Release Version
The current version is the latest version v-0.2.0.
- What needs to be optimized in the future
- Variable length features lead to potential GPU memory explosion risk.
- Better reward function design.
- Version-0.2.0
- Fix the ckpt bug to improve the accuracy of the evaluator.
- Fix replay_buffer bug
- Brand new feature engineering to improve convergence speed.
- Replay_buffer stores variable-length features to improve data utilization and training speed.
- Version-0.1.0
๐
Getting Started
-
System environment
- Core 16
- GPU A100(40G)
- Memory 50G
-
Baseline Config
- The default config is the config used in this experiment. Participants can modify it according to the system environment.
- The size of replay_buffer_size needs to be set according to the size of RAM.
- The size of batch_size needs to be set according to the size of the GPU memory.
-
Install the necessary packege
# Install DI-engine
git clone https://github.com/opendilab/DI-engine.git
cd YOUR_PATH/DI-engine/
pip install -e . --user
# Install Env Gobigger
git clone https://github.com/opendilab/GoBigger.git
cd YOUR_PATH/GoBigger/
pip install -e . --user
- Start training
# Download baseline
git clone https://github.com/opendilab/Gobigger-Explore.git
cd my_submission/entry/
python gobigger_vsbot_baseline_main.py
- Evaluator and Save game videos
cd my_submission/entry/
python gobigger_vsbot_baseline_eval.py --ckpt YOUR_CKPT_PATH
๐ฏ
Result
We released training log information, checkpoints, and evaluation videos. Below is the download link,
๐
Resources
- Challenge Page Link
- Challenge Repo Github Link
- DI-engine Repo Github Link
- GoBigger Repo Github Link