当前位置:网站首页>《强化学习周刊》第51期:PAC、ILQL、RRL&无模型强化学习集成于微电网络格控制:综述与启示
《强化学习周刊》第51期:PAC、ILQL、RRL&无模型强化学习集成于微电网络格控制:综述与启示
2022-06-29 20:40:00 【智源社区】
告诉大家一个好消息,《强化学习周刊》开启“订阅功能”,以后我们会向您自动推送最新版的《强化学习周刊》。订阅方法:
1,注册智源社区账号
2,点击周刊界面左上角的作者栏部分“强化学习周刊”(如下图),进入“强化学习周刊”主页。

3,点击“关注TA”(如下图)

4,您已经完成《强化学习周刊》订阅啦,以后智源社区会自动向您推送最新版的《强化学习周刊》!
论文推荐
。
标题:Offline RL for Natural Language Generation with Implicit Language Q Learning(UC Berkeley:Charlie Snell | 基于隐式语言Q学习的自然语言生成离线RL)
简介:
https://arxiv.org/pdf/2206.11871.pdf
标题:Multi-Access Point Coordination for Next-Gen Wi-Fi Networks Aided by Deep Reinforcement Learning(University of Washington :Hao Yin | 深度强化学习辅助下一代Wi-Fi网络的多接入点协调)
简介:
https://arxiv.org/pdf/2206.11378.pdf
标题:PAC: Assisted Value Factorisation with Counterfactual Predictions in Multi-Agent Reinforcement Learning(乔治·华盛顿大学 : Hanhan Zhou | PAC:多智能体强化学习中具有反事实预测的辅助价值因子分解)
简介:
https://arxiv.org/pdf/2206.11420.pdf
标题:Recursive Reinforcement Learning(University of Colorado Boulder : Mateo Perez | 循环强化学习)
简介:
https://arxiv.org/pdf/2206.11430.pdf
标题:Reinforcement Learning under Partial Observability Guided by Learned Environment Models(Silicon Austria Labs (SAL):Edi Muˇskardin | 学习环境模型引导下的部分可观察性强化学习)
简介:
https://arxiv.org/pdf/2206.11708.pdf
标题:Deep Reinforcement Learning-Assisted Federated Learning for Robust Short-term Utility Demand Forecasting in Electricity Wholesale Markets(电子科大 : Chenghao Huang | 电力批发市场短期电力需求预测的深度强化学习辅助联合学习)
简介:
https://arxiv.org/pdf/2206.11715.pdf
标题:AnyMorph: Learning Transferable Polices By Inferring Agent Morphology(卡内基梅隆大学: Brandon Trabucco| ICML 2022:通过推断智能体形态来学习可转移策略)
简介:
https://arxiv.org/pdf/2206.12279.pdf
标题:Multi-Agent Deep Reinforcement Learning for Cost- and Delay-Sensitive Virtual Network Function Placement and Routing(北京邮电大学: Shaoyang Wang|用于成本和延迟敏感的虚拟网络功能放置和路由的多智能体深度强化学习)
简介:
https://arxiv.org/pdf/2206.12146.pdf
标题:Phasic Self-Imitative Reduction for Sparse-Reward Goal-Conditioned Reinforcement Learning(清华大学: Yunfei Li| ICML 2022:稀疏奖励目标条件强化学习的阶段性自我模仿减少)
简介:
https://arxiv.org/pdf/2206.12030.pdf
标题:World Value Functions: Knowledge Representation for Learning and Planning(金山大学: Geraud Nangue Tasse|世界价值函数:学习和规划的知识表示)
简介:
https://arxiv.org/pdf/2206.11940.pdf
标题:Improving de novo molecular design with curriculum learning(查尔姆斯理工大学: Jeff Guo|利用课程式学习改进新型分子设计)
简介:
标题:Brain-inspired meta-reinforcement learning cognitive control in conflictual inhibition decision-making task for artificial agents(SSSUP: Federica Robertazzi|人工智能体冲突抑制决策任务中的脑启发元强化学习认知控制)
简介:
https://www.sciencedirect.com/science/article/pii/S0893608022002350
标题:Energy saving evaluation of an energy efficient data center using a model-free reinforcement learning approach(新加坡国立大学: Muhammad Haiqal Bin Mahbod|采用无模型强化学习方法评估能源效益数据中心的节能情况)
简介:
https://www.sciencedirect.com/sdfe/reader/pii/S0306261922007309/pdf
标题:Gait switching and targeted navigation of microswimmers via deep reinforcement learning(SCU: Zonghao Zou|通过深度强化学习进行微型游泳机器人的步态切换和目标导航)
简介:
https://www.nature.com/articles/s42005-022-00935-x.pdf
标题:Optimistic Linear Support and Successor Features as a Basis for Optimal Policy Transfer(UFRGS : Lucas N. Alegre|ICML 2022: 作为最优策略转移基础的乐观线性支持和后继特征)
简介:
https://arxiv.org/pdf/2206.11326.pdf
标题:无模型强化学习与微电网络控制的融合:综述与启示
简介:
https://arxiv.org/pdf/2206.11398.pdf
边栏推荐
- Real time tracking of bug handling progress of the project through metersphere and dataease
- Hangfire详解
- Several policies of Shenzhen Futian District to support investment attraction in 2022
- Sentinel's quick start takes you through flow control in three minutes
- Mysql Json 数据类型&函数
- Fastadmin background setting radio button
- Analysis of the underlying architecture of spark storage system - spark business environment practice
- [notes] take notes again -- learn by doing Verilog HDL – 014
- Startservice() procedure
- PHP implementation extracts non repeated integers (programming topics can be the fastest familiar functions)
猜你喜欢

Comparable比较器写法&ClassCastExcption类转换异常

如何评价科大讯飞AI翻译笔P20系列,值得买吗?

data link layer

导航【微机原理】

"Operation and maintenance department has Xiao Deng" to review and analyze file and folder access rights

18. `bs object Node name next_ sibling` previous_ Sibling get sibling node

如何审核 Active Directory 用户账户更改?

At least 3 years for learning amplifier?

liunx指令

「运维有小邓」Active Directory批量用户创建
随机推荐
Flume-ng配置
High energy live broadcast, a gathering of celebrities! We invite you to explore bizdevops.
Summary of swift optional values
0/1分数规划专题
mysql中explain语句查询sql是否走索引,extra中的几种类型整理汇总
[today in history] June 29: SGI and MIPS merged; Microsoft acquires PowerPoint developer; News corporation sells MySpace
The reason why the log analysis tool of "operation and maintenance" is used more and more frequently
Win7 easy connect 提示:选路连接失败,可能当前连接网络异常,请稍后重试
At least 3 years for learning amplifier?
Nutch2.1 distributed fetching
Startservice() procedure
0/1 score planning topic
「运维有小邓」Active Directory批量用户创建
go: 如何编写一个正确的udp服务端
Codeforces Global Round 21 C D E
分析影响导电滑环传输信号的因素
LSF-bsub命令
Analysis of the underlying architecture of spark storage system - spark business environment practice
社区访谈丨一个IT新人眼中的JumpServer开源堡垒机
Win10 sets automatic dial-up networking task to realize automatic reconnection after startup and disconnection

