当前位置:网站首页>Hunan University | robust Multi-Agent Reinforcement Learning in noisy environment
Hunan University | robust Multi-Agent Reinforcement Learning in noisy environment
2022-07-04 01:46:00 【Zhiyuan community】
Despite recent intensive learning (RL) Progress has been made in , But by RL Trained agents are usually sensitive to the environment , Especially in multi-agent scenarios . The existing Multi-Agent Reinforcement learning methods can work well only under the assumption of perfect environment . However , The real world environment is usually noisy . Inaccurate information obtained from noisy environment will hinder the learning of agent , Even lead to training failure . This paper focuses on the problem of training multiple robust agents in noisy environment . In this paper, a new algorithm is proposed , Multi-agent fault-tolerant reinforcement learning (MAFTRL). The main idea of this paper is to establish the error detection mechanism of agent itself , Design the information communication medium between agents . The error detection mechanism is based on automatic encoder , Calculate the reliability of each agent's observation , Effectively reduce environmental noise . Communication media based on attention mechanism can significantly improve the ability of agents to extract effective information . Experimental results show that , The method in this paper accurately detects the error observation of agents , It has good performance and strong robustness in traditional reliable environment and noisy environment . Besides ,MAFTRL It is obviously superior to traditional methods in noisy environment .
边栏推荐
- Applet graduation project is based on wechat classroom laboratory reservation applet graduation project opening report function reference
- Do you know the eight signs of a team becoming agile?
- Ceramic metal crowns - current market situation and future development trend
- Hbuilder link Xiaoyao simulator
- How can enterprises optimize the best cost of cloud computing?
- 51 MCU external interrupt
- MySQL uses the view to report an error, explain/show can not be issued; lacking privileges for underlying table
- Pesticide synergist - current market situation and future development trend
- Yyds dry goods inventory it's not easy to say I love you | use the minimum web API to upload files
- In the process of seeking human intelligent AI, meta bet on self supervised learning
猜你喜欢
Audio resource settings for U3D resource management
What is the student party's Bluetooth headset recommendation? Student party easy to use Bluetooth headset recommended
LeetCode 168. Detailed explanation of Excel list name
Yyds dry goods inventory it's not easy to say I love you | use the minimum web API to upload files
SQL statement
Feign implements dynamic URL
Openbionics robot project introduction | bciduino community finishing
MySQL deadly serial question 2 -- are you familiar with MySQL index?
TP5 automatic registration hook mechanism hook extension, with a complete case
Should enterprises start building progressive web applications?
随机推荐
Writeup (real questions and analysis of ciscn over the years) of the preliminary competition of national college students' information security competition
Rearrangement of tag number of cadence OrCAD components and sequence number of schematic page
Conditional statements of shell programming
C import Xls data method summary II (save the uploaded file to the DataTable instance object)
Maximum likelihood method, likelihood function and log likelihood function
Trading software programming
After listening to the system clear message notification, Jerry informed the device side to delete the message [article]
Luogu p1309 Swiss wheel
Mongodb learning notes: command line tools
Jerry's synchronous weather information to equipment [chapter]
The force deduction method summarizes the single elements in the 540 ordered array
Will the memory of ParticleSystem be affected by maxparticles
File contains vulnerability summary
TP5 automatic registration hook mechanism hook extension, with a complete case
Neo4j learning notes
Three layer switching ②
2022 electrician (elementary) examination question bank and electrician (elementary) simulation examination question bank
QML add gradient animation during state transition
Jerry's watch information type table [chapter]
Small program graduation project based on wechat reservation small program graduation project opening report reference