当前位置:网站首页>3. Multi agent reinforcement learning
3. Multi agent reinforcement learning
2022-07-08 01:26:00 【C--G】
Basic concepts
Settings
- Fully Cooperative Setting
- Fully Competitive Setting
- Mixed Cooperative & Competitive
- Self-Interested Setting
Basic terminology
State,Action,State Transition
Rewards
Returns
Policy Network
Uncertainty in the Return
State-Value Function
Convergence
- Single-Agent Policy Learning
- Multi-Agent Policy Learning
- Difficulty of MARL
Single-Agent Policy Gradient for MARL
Architectures
Fully Decentralized
- Execution
- Actor-Critic Method
Fully Centralized
- Method
- Shortcoming:Slow during Execution
Centralized Training with Decentralized Execution
Parameter Sharing
边栏推荐
- 2022 safety officer-c certificate examination summary and safety officer-c certificate reexamination examination
- NPM Internal Split module
- The beauty of Mathematics -- the principle of fine Fourier transform
- Su embedded training - Day9
- Micro rabbit gets a field of API interface JSON
- Kafka-connect将Kafka数据同步到Mysql
- 2022 free test questions of fusion welding and thermal cutting and summary of fusion welding and thermal cutting examination
- Understanding of prior probability, posterior probability and Bayesian formula
- Definition and classification of energy
- Gnuradio operation error: error thread [thread per block [12]: < block OFDM_ cyclic_ prefixer(8)>]: Buffer too small
猜你喜欢
Different methods for setting headers of different pages in word (the same for footer and page number)
Chapter 5 neural network
Ag9311maq design 100W USB type C docking station data | ag9311maq is used for 100W USB type C to HDMI with PD fast charging +u3+sd/cf docking station scheme description
2021 tea master (primary) examination materials and tea master (primary) simulation test questions
Vscode is added to the right-click function menu
For the first time in China, three Tsinghua Yaoban undergraduates won the stoc best student thesis award
4. Strategic Learning
Cs5261type-c to HDMI alternative ag9310 | ag9310 alternative
Application of state mode in JSF source code
2022 safety officer-c certificate examination paper and safety officer-c certificate simulated examination question bank
随机推荐
Guojingxin center "APEC investment +": some things about the Internet sector today | observation on stabilizing strategic industrial funds
Solve the error: NPM warn config global ` --global`, `--local` are deprecated Use `--location=global` instead.
[loss function] entropy / relative entropy / cross entropy
Macro definition and multiple parameters
Overall introduction of the project
Smart agricultural technology framework
3、多智能体强化学习
Su embedded training - Day8
Basic implementation of pie chart
Chapter improvement of clock -- multi-purpose signal modulation generation system based on ambient optical signal detection and custom signal rules
2022 new examination questions for crane driver (limited to bridge crane) and question bank for crane driver (limited to bridge crane) operation examination
解决报错:npm WARN config global `--global`, `--local` are deprecated. Use `--location=global` instead.
2022 chemical automation control instrument examination summary and chemical automation control instrument simulation examination questions
Continued from the previous design
Guojingxin center "APEC education +" Shanghai Jiaotong University Japan Cooperation Center x Fudan philosophy class "Zhe Yi" 2022 New Year greetings
130. 被圍繞的區域
Vs code configuration latex environment nanny level configuration tutorial (dual system)
Kafka-connect将Kafka数据同步到Mysql
Design method and application of ag9311maq and ag9311mcq in USB type-C docking station or converter
Gnuradio3.9.4 create OOT module instances