当前位置:网站首页>5. Discrete control and continuous control

5. Discrete control and continuous control

2022-07-08 01:20:00 C--G

Discrete VS Continuous Control

Discrete
 Insert picture description here
Continuous
 Insert picture description here
DQN One action, one dimension , Cannot be used for continuous control
 Insert picture description here
Policy Network One action, one dimension , Cannot be used for continuous control
 Insert picture description here
Must use DQN Do continuous control , It is necessary to discretize the continuous space
 Insert picture description here
 Insert picture description here
Better Approaches to Continuous Control
 Insert picture description here

Deterministic policy network

 Insert picture description here
 Insert picture description here
 Insert picture description here
 Insert picture description here

updating Value Network by TD

 Insert picture description here

Updating Policy Network by DPG

 Insert picture description here
 Insert picture description here
 Insert picture description here

improvement:Using Target Networks

 Insert picture description here
 Insert picture description here
 Insert picture description here
 Insert picture description here
 Insert picture description here
How to improve
 Insert picture description here
 Insert picture description here

Stochastic Policy for Continuous Control

 Insert picture description here
 Insert picture description here
 Insert picture description here

Policy Network

Univariate Normal Distribution
 Insert picture description here
Multivariate Normal Distribution
 Insert picture description here
Function Approximation
 Insert picture description here
 Insert picture description here
 Insert picture description here

Training Policy Network

 Insert picture description here

Auxiliary Network

 Insert picture description here
 Insert picture description here
 Insert picture description here
 Insert picture description here
 Insert picture description here
 Insert picture description here
 Insert picture description here
 Insert picture description here
 Insert picture description here

Policy Gradient Methods

 Insert picture description here
 Insert picture description here
 Insert picture description here
 Insert picture description here

原网站

版权声明
本文为[C--G]所创,转载请带上原文链接,感谢
https://yzsam.com/2022/189/202207072320355505.html