用强化学习DQN算法，训练AI模型来玩合成大西瓜游戏，提供Keras版本和PARL（paddle）版本

Last update: Dec 17, 2022

Overview

用强化学习玩合成大西瓜

代码地址：https://github.com/Sharpiless/play-daxigua-using-Reinforcement-Learning

用强化学习DQN算法，训练AI模型来玩合成大西瓜游戏，提供Keras版本、PARL（paddle）版本和pytorch版本。

B站：https://space.bilibili.com/470550823

CSDN：https://blog.csdn.net/weixin_44936889

AI Studio：https://aistudio.baidu.com/aistudio/personalcenter/thirdview/67156

Github：https://github.com/Sharpiless

1. 打开游戏：

这里使用pygame重写了大西瓜游戏，并封装为适合RL环境的代码。

解压图片素材：

unzip res.zip

运行：

python Main.py

即可开始游戏：

2. 训练RL模型：

RL算法采用DQN算法，其中Keras版本使用了简单的卷积神经网络来计算Q值，PRAL版本使用ResNet。

运行：

python train_keras.py

或者

python train_paddle.py

或者

python train_torch.py

开始训练：

关注我的公众号：

感兴趣的同学关注我的公众号——可达鸭的深度学习教程：

用强化学习DQN算法，训练AI模型来玩合成大西瓜游戏，提供Keras版本和PARL（paddle）版本

Related tags

Overview

用强化学习玩合成大西瓜

1. 打开游戏：

2. 训练RL模型：

关注我的公众号：

Owner

OHLC Average Prediction of Apple Inc. Using LSTM Recurrent Neural Network

Code for EMNLP 2021 paper: "Learning Implicit Sentiment in Aspect-based Sentiment Analysis with Supervised Contrastive Pre-Training"

Code base for the paper "Scalable One-Pass Optimisation of High-Dimensional Weight-Update Hyperparameters by Implicit Differentiation"

A Number Recognition algorithm

Speech Separation Using an Asynchronous Fully Recurrent Convolutional Neural Network

Code and data for the paper "Hearing What You Cannot See"

Rethinking the U-Net architecture for multimodal biomedical image segmentation

multimodal transformer

Experimental solutions to selected exercises from the book [Advances in Financial Machine Learning by Marcos Lopez De Prado]

Colour detection is necessary to recognize objects, it is also used as a tool in various image editing and drawing apps.

Implementation of self-attention mechanisms for general purpose. Focused on computer vision modules. Ongoing repository.

Face uncertainty quantification or estimation using PyTorch.

DISTIL: Deep dIverSified inTeractIve Learning.

Image-retrieval-baseline - MUGE Multimodal Retrieval Baseline

Lazy, a tool for running things in idle time

Revisiting Global Statistics Aggregation for Improving Image Restoration

LSTM Neural Networks for Spectroscopic Studies of Type Ia Supernovae

It's like Shape Editor in Maya but works with skeletons (transforms).

Codecov coverage standard for Python

Materials for my scikit-learn tutorial