当前位置:网站首页>Hunan University | robust Multi-Agent Reinforcement Learning in noisy environment
Hunan University | robust Multi-Agent Reinforcement Learning in noisy environment
2022-07-04 01:46:00 【Zhiyuan community】
Despite recent intensive learning (RL) Progress has been made in , But by RL Trained agents are usually sensitive to the environment , Especially in multi-agent scenarios . The existing Multi-Agent Reinforcement learning methods can work well only under the assumption of perfect environment . However , The real world environment is usually noisy . Inaccurate information obtained from noisy environment will hinder the learning of agent , Even lead to training failure . This paper focuses on the problem of training multiple robust agents in noisy environment . In this paper, a new algorithm is proposed , Multi-agent fault-tolerant reinforcement learning (MAFTRL). The main idea of this paper is to establish the error detection mechanism of agent itself , Design the information communication medium between agents . The error detection mechanism is based on automatic encoder , Calculate the reliability of each agent's observation , Effectively reduce environmental noise . Communication media based on attention mechanism can significantly improve the ability of agents to extract effective information . Experimental results show that , The method in this paper accurately detects the error observation of agents , It has good performance and strong robustness in traditional reliable environment and noisy environment . Besides ,MAFTRL It is obviously superior to traditional methods in noisy environment .
边栏推荐
- Force buckle day32
- Use classname to modify style properties
- Human resource management online assignment
- Difference between value and placeholder
- Some other configurations on Huawei's spanning tree
- Conditional test, if, case conditional test statements of shell script
- 2022 R2 mobile pressure vessel filling certificate examination and R2 mobile pressure vessel filling simulation examination questions
- Will the memory of ParticleSystem be affected by maxparticles
- All ceramic crowns - current market situation and future development trend
- C import Xls data method summary II (save the uploaded file to the DataTable instance object)
猜你喜欢

Force buckle day32

Openbionics exoskeleton project introduction | bciduino community finishing

Applet graduation project based on wechat selection voting applet graduation project opening report function reference

Pyinstaller packaging py script warning:lib not found and other related issues

Remember a lazy query error

Yyds dry goods inventory it's not easy to say I love you | use the minimum web API to upload files

Huawei BFD and NQA

Since the "epidemic", we have adhered to the "no closing" of data middle office services

A malware detection method for checking PLC system using satisfiability modulus theoretical model

Huawei cloud micro certification Huawei cloud computing service practice has been stable
随机推荐
AI helps make new breakthroughs in art design plagiarism retrieval! Professor Liu Fang's team paper was employed by ACM mm, a multimedia top-level conference
Software product download collection
MySQL deadly serial question 2 -- are you familiar with MySQL index?
Feign implements dynamic URL
The force deduction method summarizes the single elements in the 540 ordered array
MySQL - use of aggregate functions and group by groups
Bacteriostatic circle scanning correction template
How to view the computing power of GPU?
Mongodb learning notes: command line tools
Huawei cloud micro certification Huawei cloud computing service practice has been stable
Jerry's watch listens to the message notification of the target third-party software and pushes the message to the device [article]
Mobile phone battery - current market situation and future development trend
Audio resource settings for U3D resource management
File contains vulnerability summary
Difference between value and placeholder
Ka! Why does the seat belt suddenly fail to pull? After reading these pictures, I can't stop wearing them
Fundamentals of machine learning: feature selection with lasso
Solution of cursor thickening
How can enterprises optimize the best cost of cloud computing?
All in one 1412: binary classification