当前位置:网站首页>5、離散控制與連續控制
5、離散控制與連續控制
2022-07-08 01:19:00 【C--G】
Discrete VS Continuous Control
Discrete
Continuous
DQN一個動作一個維度,不能用於連續控制
Policy Network一個動作一個維度,不能用於連續控制
非要用DQN做連續控制,就要將連續空間離散化
Better Approaches to Continuous Control
Deterministic policy network
updating Value Network by TD
Updating Policy Network by DPG
improvement:Using Target Networks
提昇方法
Stochastic Policy for Continuous Control
Policy Network
Univariate Normal Distribution
Multivariate Normal Distribution
Function Approximation
Training Policy Network
Auxiliary Network
Policy Gradient Methods
边栏推荐
- Implementation of adjacency table of SQLite database storage directory structure 2-construction of directory tree
- 4.交叉熵
- 14. Draw network model structure
- Leetcode notes No.7
- String usage in C #
- Four digit nixie tube display multi digit timing
- Leetcode notes No.21
- A speed Limited large file transmission tool for every major network disk
- Frrouting BGP protocol learning
- How to transfer Netease cloud music /qq music to Apple Music
猜你喜欢
133. Clone map
Prediction of the victory or defeat of the League of heroes -- simple KFC Colonel
Application of state mode in JSF source code
How to transfer Netease cloud music /qq music to Apple Music
From starfish OS' continued deflationary consumption of SFO, the value of SFO in the long run
14. Draw network model structure
Common configurations in rectangular coordinate system
Kuntai ch7511b scheme design | ch7511b design EDP to LVDS data | pin to pin replaces ch7511b circuit design
How to write mark down on vscode
130. Zones environnantes
随机推荐
9. Introduction to convolutional neural network
10. CNN applied to handwritten digit recognition
2022-07-07: the original array is a monotonic array with numbers greater than 0 and less than or equal to K. there may be equal numbers in it, and the overall trend is increasing. However, the number
Common configurations in rectangular coordinate system
Design method and application of ag9311maq and ag9311mcq in USB type-C docking station or converter
EDP to LVDS conversion design circuit | EDP to LVDS adapter board circuit | capstone/cs5211 chip circuit schematic reference
Definition and classification of energy
13.模型的保存和载入
Micro rabbit gets a field of API interface JSON
The Ministry of housing and urban rural development officially issued the technical standard for urban information model (CIM) basic platform, which will be implemented from June 1
Ag9310 same function alternative | cs5261 replaces ag9310type-c to HDMI single switch screen alternative | low BOM replaces ag9310 design
4、策略學習
6.Dropout应用
Scheme selection and scheme design of multifunctional docking station for type C to VGA HDMI audio and video launched by ange in Taiwan | scheme selection and scheme explanation of usb-c to VGA HDMI c
Redis 主从复制
11. Recurrent neural network RNN
13.模型的保存和載入
Su embedded training - Day8
Introduction to ML regression analysis of AI zhetianchuan
Parade ps8625 | replace ps8625 | EDP to LVDS screen adapter or screen drive board