当前位置:网站首页>It is the most difficult to teach AI to play iron fist frame by frame. Now arcade game lovers have something
It is the most difficult to teach AI to play iron fist frame by frame. Now arcade game lovers have something
2022-07-02 09:55:00 【QbitAl】
Bowen From the Aofei temple
qubits | official account QbitAI
current AI Have begun to learn to rub the move frame by frame to play the arcade ?
《 The king of Fighters 98》、《 Street Fighter 》、《 Death or life 》…… I've played with all my childhood memories , There are also claims to fight 5000 site To get started 3D Fighting games 《 An iron fist 》:

you 're right , It's the one who is very unfriendly to novices 《 An iron fist 》, Casually pull a character's rubbing table to feel the complexity :
( you 're right , Various framing decisions JF Technology is one of its characteristics )

△ An iron fist TT2 List of moves
but AI It happens that you can pass the customs before long after entering the pit The highest difficulty :

△ On the left AI
This kind of AI Behind it is a personal developer , He is also a hardcore arcade game enthusiast .
He trained novices “ Blacksmith ” stay Reddit Of “ Game recording ” There has been a near 500 The heat of the :

Reinforcement learning and training framework
the AI Behind the blacksmith , It's called DIAMBRA Arena Reinforcement learning interaction framework .
DIAMBRA Arena It provides multiple reinforcement learning research and experimental environments , Episodic reinforcement learning tasks , By discrete actions ( Such as joystick buttons ) And pixels and data in the screen ( Such as human blood bar ) form .
In this framework , Intelligent experience sends an action to the environment , The environment treats it , And correspondingly transform a starting state into a new state , Then return the observation and reward to the agent , With this interactive loop :

The code to implement the above loop is also very simple :
This framework currently supports Linux、Windows、MacOS And other mainstream operating systems .
which AI Of “ Real battlefield ” For the early Tekken Tag Tournament, Of course , The complexity of rubbing moves is not inferior to that of the later new version ……

Developers chose fengjianren (Jin) Hejiguang (Yoshimitsu) Two representative roles are the main operation objects .
The inputs are : from RGB Convert to grayscale , And shrink to 128 x 128px Pixel value of the game screen 、 To the number of battles (Stage)、 Character blood bar 、 One side of the game interface .
The reward in training is a function based on health , If the opponent's HP is damaged, he will get a positive reward , The loss of health of the character controlled by your side will be negatively punished .
meanwhile ,AI The action rate of is also limited to the maximum rate 1/10, That is, every 6 Step send an action .
Because the framework uses a discrete action space , therefore , An agent can only choose one movement at a time during training 9 Up , Wait down ) Or attack ( impact , kick , Punch ).
therefore , Although a combo combination has stronger actual combat ability , But because of AI Cannot click two actions at the same time , In the real battle , It will appear. AI Use kicks frequently (Kick) And changing roles (swap) Two actions :

There are senior in the comment area PVP Fans say , Want to see this top AI Fierce scenes of players' mutual abuse , The developer himself agrees with this :
We are creating a platform , On this platform , Programmers will submit their well-trained AI And confront each other , And broadcast the game on our channel .

AI The tournament
Now? , The developer team has begun to formally prepare for this “AI Game Championship ”, The programmers and developers behind it are quite so “ The coach ” perhaps “ Parents of contestants ”, The final winner can get 1400 Swiss francs ( Renminbi conversion 9261 element ).

“ event ” It's not just iron fist , Developers say , The underlying mechanisms of these fighting games are similar , Just modify the combination skill 、 The difference attribute of games such as character blood bar value .
therefore , their DIAMBRA Arena Framework for all kinds of arcade video games to provide full compliance OpenAI Gym The standard Python API.
Like death or life 、 Street fighter and many other popular arcade games have been included :

GitHub link :
https://github.com/diambra/diambraArena
Video link :
https://www.youtube.com/watch?v=9HAKEjhIfJY
Reference link :
[1]https://www.reddit.com/r/reinforcementlearning/comments/sq1s3f/deep_reinforcement_learning_algorithm_completing/
[2]https://www.reddit.com/r/MachineLearning/comments/sqra1n/p_deep_reinforcement_learning_algorithm/
边栏推荐
猜你喜欢

2837xd 代碼生成——補充(1)

Navicat remote connection MySQL reports an error 1045 - access denied for user 'root' @ '222.173.220.236' (using password: yes)

BugkuCTF-web24(解题思路及步骤)

每天睡前30分钟阅读Day6_Day6_Date_Calendar_LocalDate_TimeStamp_LocalTime

在SQL注入中,为什么union联合查询,id必须等于0

Introduction et prévention des essais de pénétration

2837xd代码生成模块学习(4)——idle_task、Simulink Coder

MySQL transaction

Idempotent design of Internet API interface

Typora安装包分享
随机推荐
保存视频 opencv::VideoWriter
tinyxml2 读取和修改文件
记录一下初次使用Xray的有趣过程
CKEditor 4.10.1 上传图片提示“不正确的服务器响应” 问题解决
How to install PHP in CentOS
Is the C language too fat
Introduction to go language
JDBC review
BugkuCTF-web24(解题思路及步骤)
Image recognition - data annotation
阿里云ack介绍
Error reporting on the first day of work (error reporting when Nessus installs WinPcap)
三相并网逆变器PI控制——离网模式
逆变器simulink模型——处理器在环测试(PIL)
【UE5】蓝图制作简单地雷教程
PI control of grid connected inverter (grid connected mode)
BugkuCTF-web16(备份是个好习惯)
因上努力,果上随缘
Skywalking理论与实践
C语言之最小数