当前位置:网站首页>Hunan University | robust Multi-Agent Reinforcement Learning in noisy environment
Hunan University | robust Multi-Agent Reinforcement Learning in noisy environment
2022-07-04 01:46:00 【Zhiyuan community】
Despite recent intensive learning (RL) Progress has been made in , But by RL Trained agents are usually sensitive to the environment , Especially in multi-agent scenarios . The existing Multi-Agent Reinforcement learning methods can work well only under the assumption of perfect environment . However , The real world environment is usually noisy . Inaccurate information obtained from noisy environment will hinder the learning of agent , Even lead to training failure . This paper focuses on the problem of training multiple robust agents in noisy environment . In this paper, a new algorithm is proposed , Multi-agent fault-tolerant reinforcement learning (MAFTRL). The main idea of this paper is to establish the error detection mechanism of agent itself , Design the information communication medium between agents . The error detection mechanism is based on automatic encoder , Calculate the reliability of each agent's observation , Effectively reduce environmental noise . Communication media based on attention mechanism can significantly improve the ability of agents to extract effective information . Experimental results show that , The method in this paper accurately detects the error observation of agents , It has good performance and strong robustness in traditional reliable environment and noisy environment . Besides ,MAFTRL It is obviously superior to traditional methods in noisy environment .
边栏推荐
- Day05 table
- MySQL statement learning record
- What are the advantages and disadvantages of data center agents?
- Summary of JWT related knowledge
- Douban scoring applet Part-3
- HackTheBox-baby breaking grad
- Conditional statements of shell programming
- When tidb meets Flink: tidb efficiently enters the lake "new play" | tilaker team interview
- What is the student party's Bluetooth headset recommendation? Student party easy to use Bluetooth headset recommended
- Notice on Soliciting Opinions on the draft of information security technology mobile Internet application (APP) life cycle security management guide
猜你喜欢

Douban scoring applet Part-3

When tidb meets Flink: tidb efficiently enters the lake "new play" | tilaker team interview

MySQL statement learning record

Basic editing specifications and variables of shell script

Gee: create a new feature and set corresponding attributes

Small program graduation design is based on wechat order takeout small program graduation design opening report function reference

A fan summed up so many interview questions for you. There is always one you need!

Feign implements dynamic URL
![Jerry's watch listens to the message notification of the target third-party software and pushes the message to the device [article]](/img/8b/ff062f34d36e1caa9909c8ab431daf.jpg)
Jerry's watch listens to the message notification of the target third-party software and pushes the message to the device [article]

Introduction to Tianchi news recommendation: 4 Characteristic Engineering
随机推荐
Sequence sorting of basic exercises of test questions
2020-12-02 SSM advanced integration Shang Silicon Valley
Jerry's update contact [article]
ES6 deletes an attribute in all array objects through map, deconstruction and extension operators
From the 18th line to the first line, the new story of the network security industry
Maximum likelihood method, likelihood function and log likelihood function
AI helps make new breakthroughs in art design plagiarism retrieval! Professor Liu Fang's team paper was employed by ACM mm, a multimedia top-level conference
I don't know why it can't run in the project and how to change it
Summary of JWT related knowledge
Openbionics robot project introduction | bciduino community finishing
HackTheBox-baby breaking grad
1189. Maximum number of "balloons"
Mobile phone battery - current market situation and future development trend
Feign implements dynamic URL
51 single chip microcomputer timer 2 is used as serial port
Msp32c3 board connection MSSQL method
How to delete MySQL components using xshell7?
QML add gradient animation during state transition
Introduction to superresolution
Future source code view -juc series