当前位置:网站首页>5. Discrete control and continuous control
5. Discrete control and continuous control
2022-07-08 01:20:00 【C--G】
Discrete VS Continuous Control
Discrete
Continuous
DQN One action, one dimension , Cannot be used for continuous control 
Policy Network One action, one dimension , Cannot be used for continuous control 
Must use DQN Do continuous control , It is necessary to discretize the continuous space 

Better Approaches to Continuous Control
Deterministic policy network




updating Value Network by TD

Updating Policy Network by DPG



improvement:Using Target Networks





How to improve 

Stochastic Policy for Continuous Control



Policy Network
Univariate Normal Distribution
Multivariate Normal Distribution
Function Approximation


Training Policy Network

Auxiliary Network









Policy Gradient Methods




边栏推荐
- Generic configuration legend
- C#中string用法
- 130. Surrounding area
- Ag7120 and ag7220 explain the driving scheme of HDMI signal extension amplifier | ag7120 and ag7220 design HDMI signal extension amplifier circuit reference
- 11. Recurrent neural network RNN
- 1.线性回归
- Introduction to ML regression analysis of AI zhetianchuan
- 10.CNN应用于手写数字识别
- 50MHz generation time
- Image data preprocessing
猜你喜欢

Get started quickly using the local testing tool postman

Basic realization of line chart (II)

Chapter VIII integrated learning

Two methods for full screen adaptation of background pictures, background size: cover; Or (background size: 100% 100%;)

9. Introduction to convolutional neural network

Led serial communication

Cs5212an design display to VGA HD adapter products | display to VGA Hd 1080p adapter products
![[deep learning] AI one click to change the sky](/img/74/f2e854b9f24129bcd9376733c2369f.png)
[deep learning] AI one click to change the sky

Complete model verification (test, demo) routine

Image data preprocessing
随机推荐
13. Model saving and loading
4、策略学习
2021-04-12 - new features lambda expression and function functional interface programming
Ag7120 and ag7220 explain the driving scheme of HDMI signal extension amplifier | ag7120 and ag7220 design HDMI signal extension amplifier circuit reference
12. RNN is applied to handwritten digit recognition
Su embedded training - Day5
Ag9310 for type-C docking station scheme circuit design method | ag9310 for type-C audio and video converter scheme circuit design reference
Design method and reference circuit of type C to hdmi+ PD + BB + usb3.1 hub (rj45/cf/tf/ sd/ multi port usb3.1 type-A) multifunctional expansion dock
How to use education discounts to open Apple Music members for 5 yuan / month and realize member sharing
Common fault analysis and Countermeasures of using MySQL in go language
Transportation, new infrastructure and smart highway
String usage in C #
8. Optimizer
Binder core API
A speed Limited large file transmission tool for every major network disk
Taiwan Xinchuang sss1700 latest Chinese specification | sss1700 latest Chinese specification | sss1700datasheet Chinese explanation
swift获取url参数
4. Apprentissage stratégique
The Ministry of housing and urban rural development officially issued the technical standard for urban information model (CIM) basic platform, which will be implemented from June 1
6.Dropout应用