当前位置:网站首页>Hunan University | robust Multi-Agent Reinforcement Learning in noisy environment
Hunan University | robust Multi-Agent Reinforcement Learning in noisy environment
2022-07-04 01:46:00 【Zhiyuan community】
Despite recent intensive learning (RL) Progress has been made in , But by RL Trained agents are usually sensitive to the environment , Especially in multi-agent scenarios . The existing Multi-Agent Reinforcement learning methods can work well only under the assumption of perfect environment . However , The real world environment is usually noisy . Inaccurate information obtained from noisy environment will hinder the learning of agent , Even lead to training failure . This paper focuses on the problem of training multiple robust agents in noisy environment . In this paper, a new algorithm is proposed , Multi-agent fault-tolerant reinforcement learning (MAFTRL). The main idea of this paper is to establish the error detection mechanism of agent itself , Design the information communication medium between agents . The error detection mechanism is based on automatic encoder , Calculate the reliability of each agent's observation , Effectively reduce environmental noise . Communication media based on attention mechanism can significantly improve the ability of agents to extract effective information . Experimental results show that , The method in this paper accurately detects the error observation of agents , It has good performance and strong robustness in traditional reliable environment and noisy environment . Besides ,MAFTRL It is obviously superior to traditional methods in noisy environment .
边栏推荐
- Mobile phone battery - current market situation and future development trend
- Magical usage of edge browser (highly recommended by program ape and student party)
- String & memory function (detailed explanation)
- IPv6 experiment
- 0 basic learning C language - nixie tube dynamic scanning display
- C import Xls data method summary II (save the uploaded file to the DataTable instance object)
- Meta metauniverse female safety problems occur frequently. How to solve the related problems in the metauniverse?
- Huawei cloud micro certification Huawei cloud computing service practice has been stable
- Experimental animal models - current market situation and future development trend
- I don't know why it can't run in the project and how to change it
猜你喜欢
How programmers find girlfriends through blind dates
Force buckle day32
Avoid playing with super high conversion rate in material minefields
Small program graduation design is based on wechat order takeout small program graduation design opening report function reference
Huawei cloud micro certification Huawei cloud computing service practice has been stable
Pyinstaller packaging py script warning:lib not found and other related issues
String hash, find the string hash value after deleting any character, double hash
Feign implements dynamic URL
MySQL deadly serial question 2 -- are you familiar with MySQL index?
HackTheBox-baby breaking grad
随机推荐
Remember a lazy query error
Experimental animal models - current market situation and future development trend
51 MCU external interrupt
How to use AHAS to ensure the stability of Web services?
C library function int fprintf (file *stream, const char *format,...) Send formatted output to stream
Basic editing specifications and variables of shell script
2022 electrician (elementary) examination question bank and electrician (elementary) simulation examination question bank
[typora installation package] old typera installation package, free version
Maximum likelihood method, likelihood function and log likelihood function
0 basic learning C language - nixie tube dynamic scanning display
2020-12-02 SSM advanced integration Shang Silicon Valley
Is Shengang securities company as safe as other securities companies
The contact data on Jerry's management device supports reading and updating operations [articles]
Ka! Why does the seat belt suddenly fail to pull? After reading these pictures, I can't stop wearing them
Difference between value and placeholder
Reading notes - learn to write: what is writing?
CLP information - how does the digital transformation of credit business change from star to finger?
Rearrangement of tag number of cadence OrCAD components and sequence number of schematic page
C import Xls data method summary II (save the uploaded file to the DataTable instance object)
Meta metauniverse female safety problems occur frequently. How to solve the related problems in the metauniverse?