当前位置:网站首页>《强化学习周刊》第51期:PAC、ILQL、RRL&无模型强化学习集成于微电网络格控制:综述与启示
《强化学习周刊》第51期:PAC、ILQL、RRL&无模型强化学习集成于微电网络格控制:综述与启示
2022-06-29 20:40:00 【智源社区】
告诉大家一个好消息,《强化学习周刊》开启“订阅功能”,以后我们会向您自动推送最新版的《强化学习周刊》。订阅方法:
1,注册智源社区账号
2,点击周刊界面左上角的作者栏部分“强化学习周刊”(如下图),进入“强化学习周刊”主页。

3,点击“关注TA”(如下图)

4,您已经完成《强化学习周刊》订阅啦,以后智源社区会自动向您推送最新版的《强化学习周刊》!
论文推荐
。
标题:Offline RL for Natural Language Generation with Implicit Language Q Learning(UC Berkeley:Charlie Snell | 基于隐式语言Q学习的自然语言生成离线RL)
简介:
https://arxiv.org/pdf/2206.11871.pdf
标题:Multi-Access Point Coordination for Next-Gen Wi-Fi Networks Aided by Deep Reinforcement Learning(University of Washington :Hao Yin | 深度强化学习辅助下一代Wi-Fi网络的多接入点协调)
简介:
https://arxiv.org/pdf/2206.11378.pdf
标题:PAC: Assisted Value Factorisation with Counterfactual Predictions in Multi-Agent Reinforcement Learning(乔治·华盛顿大学 : Hanhan Zhou | PAC:多智能体强化学习中具有反事实预测的辅助价值因子分解)
简介:
https://arxiv.org/pdf/2206.11420.pdf
标题:Recursive Reinforcement Learning(University of Colorado Boulder : Mateo Perez | 循环强化学习)
简介:
https://arxiv.org/pdf/2206.11430.pdf
标题:Reinforcement Learning under Partial Observability Guided by Learned Environment Models(Silicon Austria Labs (SAL):Edi Muˇskardin | 学习环境模型引导下的部分可观察性强化学习)
简介:
https://arxiv.org/pdf/2206.11708.pdf
标题:Deep Reinforcement Learning-Assisted Federated Learning for Robust Short-term Utility Demand Forecasting in Electricity Wholesale Markets(电子科大 : Chenghao Huang | 电力批发市场短期电力需求预测的深度强化学习辅助联合学习)
简介:
https://arxiv.org/pdf/2206.11715.pdf
标题:AnyMorph: Learning Transferable Polices By Inferring Agent Morphology(卡内基梅隆大学: Brandon Trabucco| ICML 2022:通过推断智能体形态来学习可转移策略)
简介:
https://arxiv.org/pdf/2206.12279.pdf
标题:Multi-Agent Deep Reinforcement Learning for Cost- and Delay-Sensitive Virtual Network Function Placement and Routing(北京邮电大学: Shaoyang Wang|用于成本和延迟敏感的虚拟网络功能放置和路由的多智能体深度强化学习)
简介:
https://arxiv.org/pdf/2206.12146.pdf
标题:Phasic Self-Imitative Reduction for Sparse-Reward Goal-Conditioned Reinforcement Learning(清华大学: Yunfei Li| ICML 2022:稀疏奖励目标条件强化学习的阶段性自我模仿减少)
简介:
https://arxiv.org/pdf/2206.12030.pdf
标题:World Value Functions: Knowledge Representation for Learning and Planning(金山大学: Geraud Nangue Tasse|世界价值函数:学习和规划的知识表示)
简介:
https://arxiv.org/pdf/2206.11940.pdf
标题:Improving de novo molecular design with curriculum learning(查尔姆斯理工大学: Jeff Guo|利用课程式学习改进新型分子设计)
简介:
标题:Brain-inspired meta-reinforcement learning cognitive control in conflictual inhibition decision-making task for artificial agents(SSSUP: Federica Robertazzi|人工智能体冲突抑制决策任务中的脑启发元强化学习认知控制)
简介:
https://www.sciencedirect.com/science/article/pii/S0893608022002350
标题:Energy saving evaluation of an energy efficient data center using a model-free reinforcement learning approach(新加坡国立大学: Muhammad Haiqal Bin Mahbod|采用无模型强化学习方法评估能源效益数据中心的节能情况)
简介:
https://www.sciencedirect.com/sdfe/reader/pii/S0306261922007309/pdf
标题:Gait switching and targeted navigation of microswimmers via deep reinforcement learning(SCU: Zonghao Zou|通过深度强化学习进行微型游泳机器人的步态切换和目标导航)
简介:
https://www.nature.com/articles/s42005-022-00935-x.pdf
标题:Optimistic Linear Support and Successor Features as a Basis for Optimal Policy Transfer(UFRGS : Lucas N. Alegre|ICML 2022: 作为最优策略转移基础的乐观线性支持和后继特征)
简介:
https://arxiv.org/pdf/2206.11326.pdf
标题:无模型强化学习与微电网络控制的融合:综述与启示
简介:
https://arxiv.org/pdf/2206.11398.pdf
边栏推荐
猜你喜欢

Win7 easy connect 提示:选路连接失败,可能当前连接网络异常,请稍后重试

18. `bs object Node name next_ sibling` previous_ Sibling get sibling node

期末复习【微机原理】

Stm32cubemx learning (6) external interrupt experiment

Chainsafe cross chain bridge deployment tutorial

High energy live broadcast, a gathering of celebrities! We invite you to explore bizdevops.

How do I audit Active Directory User account changes?

The explain statement in MySQL queries whether SQL is indexed, and several types in extra collate and summarize

日本樱桃一颗拍出1980元天价,网友:吃了有上当的感觉

CorelDRAW2022全新版V24.1.0.360更新
随机推荐
[buuctf.reverse] 142_[SUCTF2019]babyunic
mapbox-gl开发教程(十二):加载面图层数据
File contains vulnerability
Lexin interview process
Hangfire details
Measures to support the development of advanced manufacturing industry in Futian District of Shenzhen in 2022
Oracle reserved word query
data link layer
Linux Installation mysql5
Etcd database source code analysis - put process of server
分析影响导电滑环传输信号的因素
How to use the configuration in thinkphp5
2021 CCPC 哈尔滨 J. Local Minimum (思维题)
The reason why the log analysis tool of "operation and maintenance" is used more and more frequently
Following the crowd hurts you
偶然发现了另一种跨域方式,不知道有没有人这么玩过
Special training of C language array
[today in history] June 29: SGI and MIPS merged; Microsoft acquires PowerPoint developer; News corporation sells MySpace
Defense cornerstone in attack and defense drill -- all-round monitoring
Fastadmin background setting radio button

