当前位置:网站首页>5. Discrete control and continuous control
5. Discrete control and continuous control
2022-07-08 01:20:00 【C--G】
Discrete VS Continuous Control
Discrete
Continuous
DQN One action, one dimension , Cannot be used for continuous control
Policy Network One action, one dimension , Cannot be used for continuous control
Must use DQN Do continuous control , It is necessary to discretize the continuous space
Better Approaches to Continuous Control
Deterministic policy network
updating Value Network by TD
Updating Policy Network by DPG
improvement:Using Target Networks
How to improve
Stochastic Policy for Continuous Control
Policy Network
Univariate Normal Distribution
Multivariate Normal Distribution
Function Approximation
Training Policy Network
Auxiliary Network
Policy Gradient Methods
边栏推荐
- 2021-04-12 - new features lambda expression and function functional interface programming
- Chapter 16 intensive learning
- 完整的模型验证(测试,demo)套路
- C#中string用法
- 9. Introduction to convolutional neural network
- EDP to LVDS conversion design circuit | EDP to LVDS adapter board circuit | capstone/cs5211 chip circuit schematic reference
- High quality USB sound card / audio chip sss1700 | sss1700 design 96 kHz 24 bit sampling rate USB headset microphone scheme | sss1700 Chinese design scheme explanation
- Use "recombined netlist" to automatically activate eco "APR netlist"
- Macro definition and multiple parameters
- Mathematical modeling -- knowledge map
猜你喜欢
Design method and application of ag9311maq and ag9311mcq in USB type-C docking station or converter
A speed Limited large file transmission tool for every major network disk
Arm bare metal
Talk about smart Park
Blue Bridge Cup embedded (F103) -1 STM32 clock operation and led operation method
Recommend a document management tool mendely Reference Manager
解决报错:npm WARN config global `--global`, `--local` are deprecated. Use `--location=global` instead.
Design method and reference circuit of type C to hdmi+ PD + BB + usb3.1 hub (rj45/cf/tf/ sd/ multi port usb3.1 type-A) multifunctional expansion dock
Solve the error: NPM warn config global ` --global`, `--local` are deprecated Use `--location=global` instead.
完整的模型训练套路
随机推荐
130. Surrounding area
AI遮天传 ML-回归分析入门
For the first time in China, three Tsinghua Yaoban undergraduates won the stoc best student thesis award
Su embedded training - Day8
Know how to get the traffic password
1.线性回归
50MHz generation time
6. Dropout application
2022 operation certificate examination for main principals of hazardous chemical business units and main principals of hazardous chemical business units
Definition and classification of energy
英雄联盟胜负预测--简易肯德基上校
Cs5261type-c to HDMI alternative ag9310 | ag9310 alternative
Binder core API
Application of state mode in JSF source code
Leetcode notes No.21
Design method and application of ag9311maq and ag9311mcq in USB type-C docking station or converter
Continued from the previous design
利用GPU训练网络模型
Ag9311maq design 100W USB type C docking station data | ag9311maq is used for 100W USB type C to HDMI with PD fast charging +u3+sd/cf docking station scheme description
Su embedded training - C language programming practice (implementation of address book)