当前位置:网站首页>Hong Kong Polytechnic University | data efficient reinforcement learning and adaptive optimal perimeter control of network traffic dynamics
Hong Kong Polytechnic University | data efficient reinforcement learning and adaptive optimal perimeter control of network traffic dynamics
2022-07-03 16:27:00 【Zhiyuan community】
【 title 】Data efficient reinforcement learning and adaptive optimal perimeter control of network traffic dynamics
【 The author team 】C. Chen, Y.P. Huang, W.H.K. Lam, T.L. Pan, S.C. Hsu, A. Sumalee, R.X. Zhong
【 Date of publication 】2022.6.28
【 Thesis link 】https://www.sciencedirect.com/sdfe/reader/pii/S0968090X22001929/pdf
【 Recommended reasons 】 The existing data-driven and feedback flow control strategies do not consider the heterogeneity of real-time data measurement . Besides , Traditional traffic control reinforcement learning (RL) Due to the lack of data efficiency , Usually slow convergence . Moreover, the traditional optimal perimeter control scheme needs to accurately understand the system dynamics , Therefore, they are vulnerable to endogenous uncertainty . In this paper, we propose a holistic reinforcement learning (IRL) To learn macro traffic dynamics , To achieve adaptive optimal perimeter control . The main contribution of this paper is :(a) Continuous time control with discrete gain update is developed , To adapt to discrete-time sensor data .(b) In order to reduce sampling complexity and use available data more effectively , Replay experience (ER) Technology introduction IRL Algorithm .(c) The proposed method is based on “ No model ” The method relaxes the requirements for model calibration , Through data-driven RL The algorithm achieves robustness to modeling uncertainty and improves real-time performance .(d) be based on IRL The convergence of the algorithm and the stability of the controlled traffic dynamics are proved theoretically . The optimal control law is parameterized , Then through neural network (NN) Approaching , This reduces the computational complexity .
边栏推荐
- NFT新的契机,多媒体NFT聚合平台OKALEIDO即将上线
- How to set up SVN server on this machine
- The difference between calling by value and simulating calling by reference
- 中南大学|通过探索理解: 发现具有深度强化学习的可解释特征
- Explore Netease's large-scale automated testing solutions see here see here
- (补)双指针专题
- [web security] - [SQL injection] - error detection injection
- 跟我学企业级flutter项目:简化框架demo参考
- Deep understanding of grouping sets statements in SQL
- How can technology managers quickly improve leadership?
猜你喜欢

一台服务器最大并发 tcp 连接数多少?65535?

First knowledge of database

Slam learning notes - build a complete gazebo multi machine simulation slam from scratch (4)

Low level version of drawing interface (explain each step in detail)

Learn from me about the enterprise flutter project: simplified framework demo reference

深入理解 SQL 中的 Grouping Sets 语句

斑马识别成狗,AI犯错的原因被斯坦福找到了

消息队列消息丢失和消息重复发送的处理策略

From "zero sum game" to "positive sum game", PAAS triggered the third wave of cloud computing

Embedded development: seven reasons to avoid open source software
随机推荐
NFT new opportunity, multimedia NFT aggregation platform okaleido will be launched soon
Netease UI automation test exploration: airtest+poco
【Proteus仿真】74HC595+74LS154驱动显示16X16点阵
Thinking about telecommuting under the background of normalization of epidemic | community essay solicitation
Top k questions of interview
Pyinstaller is not an internal or external command, nor is it a runnable program or batch file
June to - -------
Batch files: list all files in a directory with relative paths - batch files: list all files in a directory with relative paths
Record a jar package conflict resolution process
架构实战营 - 第 6 期 毕业总结
【LeetCode】94. Middle order traversal of binary tree
[statement] about searching sogk1997 and finding many web crawler results
Visual SLAM algorithms: a survey from 2010 to 2016
Interviewer: how does the JVM allocate and recycle off heap memory
记一次jar包冲突解决过程
[proteus simulation] 8 × 8LED dot matrix screen imitates elevator digital scrolling display
用同花顺炒股开户安全吗?
无心剑中译泰戈尔《漂鸟集(1~10)》
Mysql 将逗号隔开的属性字段数据由列转行
NSQ source code installation and operation process