当前位置:网站首页>Hunan University | robust Multi-Agent Reinforcement Learning in noisy environment
Hunan University | robust Multi-Agent Reinforcement Learning in noisy environment
2022-07-04 01:46:00 【Zhiyuan community】
Despite recent intensive learning (RL) Progress has been made in , But by RL Trained agents are usually sensitive to the environment , Especially in multi-agent scenarios . The existing Multi-Agent Reinforcement learning methods can work well only under the assumption of perfect environment . However , The real world environment is usually noisy . Inaccurate information obtained from noisy environment will hinder the learning of agent , Even lead to training failure . This paper focuses on the problem of training multiple robust agents in noisy environment . In this paper, a new algorithm is proposed , Multi-agent fault-tolerant reinforcement learning (MAFTRL). The main idea of this paper is to establish the error detection mechanism of agent itself , Design the information communication medium between agents . The error detection mechanism is based on automatic encoder , Calculate the reliability of each agent's observation , Effectively reduce environmental noise . Communication media based on attention mechanism can significantly improve the ability of agents to extract effective information . Experimental results show that , The method in this paper accurately detects the error observation of agents , It has good performance and strong robustness in traditional reliable environment and noisy environment . Besides ,MAFTRL It is obviously superior to traditional methods in noisy environment .
边栏推荐
- All metal crowns - current market situation and future development trend
- Hbuilder link Xiaoyao simulator
- Portable two-way radio equipment - current market situation and future development trend
- Who moved my code!
- C library function int fprintf (file *stream, const char *format,...) Send formatted output to stream
- Pyinstaller packaging py script warning:lib not found and other related issues
- Mobile phone battery - current market situation and future development trend
- How to delete MySQL components using xshell7?
- TP5 automatic registration hook mechanism hook extension, with a complete case
- The latest analysis of hoisting machinery command in 2022 and free examination questions of hoisting machinery command
猜你喜欢

LeetCode 168. Detailed explanation of Excel list name

Applet graduation project is based on wechat classroom laboratory reservation applet graduation project opening report function reference

【.NET+MQTT】. Net6 environment to achieve mqtt communication, as well as bilateral message subscription and publishing code demonstration of server and client
![The contact data on Jerry's management device supports reading and updating operations [articles]](/img/89/d36e785bd94c2373c34fb95eee3a9c.jpg)
The contact data on Jerry's management device supports reading and updating operations [articles]

Long article review: entropy, free energy, symmetry and dynamics in the brain
![When the watch system of Jerry's is abnormal, it is used to restore the system [chapter]](/img/fb/7d4a026260f8817460cc67f06e49ae.jpg)
When the watch system of Jerry's is abnormal, it is used to restore the system [chapter]

SRCNN:Learning a Deep Convolutional Network for Image Super-Resolution

C import Xls data method summary II (save the uploaded file to the DataTable instance object)

CLP information - how does the digital transformation of credit business change from star to finger?

MySQL - use of aggregate functions and group by groups
随机推荐
Meta metauniverse female safety problems occur frequently, how to solve the relevant problems in the metauniverse?
Flutter local database sqflite
Force deduction solution summary 1189- maximum number of "balloons"
Customize redistemplate tool class
Small program graduation design is based on wechat order takeout small program graduation design opening report function reference
G3 boiler water treatment registration examination and G3 boiler water treatment theory examination in 2022
Summary of common tools and technical points of PMP examination
String & memory function (detailed explanation)
MySQL deadly serial question 2 -- are you familiar with MySQL index?
Software product download collection
HackTheBox-baby breaking grad
All metal crowns - current market situation and future development trend
In yolov5, denselayer is used to replace focus, and the FPN structure is changed to bi FPN
LeetCode 168. Detailed explanation of Excel list name
[leetcode daily question] a single element in an ordered array
Jerry's modification setting status [chapter]
Typescript basic knowledge sorting
【.NET+MQTT】. Net6 environment to achieve mqtt communication, as well as bilateral message subscription and publishing code demonstration of server and client
技術實踐|線上故障分析及解决方法(上)
Introduction to Tianchi news recommendation: 4 Characteristic Engineering