当前位置:网站首页>It is the most difficult to teach AI to play iron fist frame by frame. Now arcade game lovers have something
It is the most difficult to teach AI to play iron fist frame by frame. Now arcade game lovers have something
2022-07-02 09:55:00 【QbitAl】
Bowen From the Aofei temple
qubits | official account QbitAI
current AI Have begun to learn to rub the move frame by frame to play the arcade ?
《 The king of Fighters 98》、《 Street Fighter 》、《 Death or life 》…… I've played with all my childhood memories , There are also claims to fight 5000 site To get started 3D Fighting games 《 An iron fist 》:
you 're right , It's the one who is very unfriendly to novices 《 An iron fist 》, Casually pull a character's rubbing table to feel the complexity :
( you 're right , Various framing decisions JF Technology is one of its characteristics )
△ An iron fist TT2 List of moves
but AI It happens that you can pass the customs before long after entering the pit The highest difficulty :
△ On the left AI
This kind of AI Behind it is a personal developer , He is also a hardcore arcade game enthusiast .
He trained novices “ Blacksmith ” stay Reddit Of “ Game recording ” There has been a near 500 The heat of the :
Reinforcement learning and training framework
the AI Behind the blacksmith , It's called DIAMBRA Arena Reinforcement learning interaction framework .
DIAMBRA Arena It provides multiple reinforcement learning research and experimental environments , Episodic reinforcement learning tasks , By discrete actions ( Such as joystick buttons ) And pixels and data in the screen ( Such as human blood bar ) form .
In this framework , Intelligent experience sends an action to the environment , The environment treats it , And correspondingly transform a starting state into a new state , Then return the observation and reward to the agent , With this interactive loop :
The code to implement the above loop is also very simple :
This framework currently supports Linux、Windows、MacOS And other mainstream operating systems .
which AI Of “ Real battlefield ” For the early Tekken Tag Tournament, Of course , The complexity of rubbing moves is not inferior to that of the later new version ……
Developers chose fengjianren (Jin) Hejiguang (Yoshimitsu) Two representative roles are the main operation objects .
The inputs are : from RGB Convert to grayscale , And shrink to 128 x 128px Pixel value of the game screen 、 To the number of battles (Stage)、 Character blood bar 、 One side of the game interface .
The reward in training is a function based on health , If the opponent's HP is damaged, he will get a positive reward , The loss of health of the character controlled by your side will be negatively punished .
meanwhile ,AI The action rate of is also limited to the maximum rate 1/10, That is, every 6 Step send an action .
Because the framework uses a discrete action space , therefore , An agent can only choose one movement at a time during training 9 Up , Wait down ) Or attack ( impact , kick , Punch ).
therefore , Although a combo combination has stronger actual combat ability , But because of AI Cannot click two actions at the same time , In the real battle , It will appear. AI Use kicks frequently (Kick) And changing roles (swap) Two actions :
There are senior in the comment area PVP Fans say , Want to see this top AI Fierce scenes of players' mutual abuse , The developer himself agrees with this :
We are creating a platform , On this platform , Programmers will submit their well-trained AI And confront each other , And broadcast the game on our channel .
AI The tournament
Now? , The developer team has begun to formally prepare for this “AI Game Championship ”, The programmers and developers behind it are quite so “ The coach ” perhaps “ Parents of contestants ”, The final winner can get 1400 Swiss francs ( Renminbi conversion 9261 element ).
“ event ” It's not just iron fist , Developers say , The underlying mechanisms of these fighting games are similar , Just modify the combination skill 、 The difference attribute of games such as character blood bar value .
therefore , their DIAMBRA Arena Framework for all kinds of arcade video games to provide full compliance OpenAI Gym The standard Python API.
Like death or life 、 Street fighter and many other popular arcade games have been included :
GitHub link :
https://github.com/diambra/diambraArena
Video link :
https://www.youtube.com/watch?v=9HAKEjhIfJY
Reference link :
[1]https://www.reddit.com/r/reinforcementlearning/comments/sq1s3f/deep_reinforcement_learning_algorithm_completing/
[2]https://www.reddit.com/r/MachineLearning/comments/sqra1n/p_deep_reinforcement_learning_algorithm/
边栏推荐
- Read 30 minutes before going to bed every day_ day3_ Files
- Tools used for Yolo object recognition and data generation
- Save video opencv:: videowriter
- Memories of a chat
- Required request body is missing:(跨域问题)
- Illusion -- Animation blueprint, state machine production, character walking, running and jumping action
- About the college entrance examination
- Typora installation package sharing
- Judging right triangle in C language
- The road is blocked and long, and the line is coming
猜你喜欢
C language programming problems
In SQL injection, why must the ID of union joint query be equal to 0
【UE5】AI随机漫游蓝图两种实现方法(角色蓝图、行为树)
High level application of SQL statements in MySQL database (II)
阿里云ack介绍
PI control of grid connected inverter (grid connected mode)
Navicat 远程连接Mysql报错1045 - Access denied for user ‘root‘@‘222.173.220.236‘ (using password: YES)
Activity的创建和跳转
Illusion -- Animation blueprint, state machine production, character walking, running and jumping action
虚幻——动画蓝图、状态机制作人物走跑跳动作
随机推荐
Error reporting on the first day of work (incomplete awvs unloading)
Image recognition - data augmentation
阿里云ack介绍
TD conducts functional simulation with Modelsim
2837xd Code Generation - Supplement (1)
Image recognition - Data Cleaning
C language: making barrels
QT qlabel style settings
图像识别-数据清洗
Image recognition - data annotation
滲透測試的介紹和防範
Read Day5 30 minutes before going to bed every day_ All key values in the map, how to obtain all value values
Matlab代码生成之SIL/PIL测试
QT QLabel样式设置
The latest progress and development trend of 2022 intelligent voice technology
Web security and defense
每天睡前30分钟阅读Day6_Day6_Date_Calendar_LocalDate_TimeStamp_LocalTime
大学生四六级作文模板(自创版,成功跨过六级)
Failed to configure a DataSource: ‘url‘ attribute is not specified and no embedd
记录下对游戏主机配置的个人理解与心得