当前位置:网站首页>University of Calgary | recommendation system based on Reinforcement Learning
University of Calgary | recommendation system based on Reinforcement Learning
2022-06-22 19:54:00 【Zhiyuan community】
【 title 】Reinforcement Learning based Recommender Systems: A Survey
【 The author team 】M. Mehdi Afsar, Trafford Crump, Behrouz Far
【 Date of publication 】2022.6.15
【 Thesis link 】https://dl.acm.org/doi/pdf/10.1145/3543846
【 Recommended reasons 】 Recommendation system (RS) Has become an integral part of daily life . Traditionally , A recommendation question is considered a classification or prediction question , But now it is generally believed that , Expressing it as a sequential decision problem can better reflect the users - System interaction . therefore , It can be expressed as a Markov decision process (MDP) And through reinforcement learning (RL) Algorithm to solve . And traditional recommendation methods ( Including collaborative filtering and content-based filtering ) Different ,RL Able to process sequentially 、 Dynamic user system interaction , And take into account the long-term user participation . This paper introduces a recommendation system based on reinforcement learning (RLRS) The study of . First recognize and explain RLRS Usually it can be divided into based on RL and DRL Methods . then , A four part RLRS frame , I.e. status representation 、 Strategy optimization 、 Reward formulation and environmental construction , And summarize accordingly RLRS Algorithm . This article uses a variety of charts to highlight emerging themes and depict important trends . Last , Important aspects and challenges that can be solved in the future were discussed .
边栏推荐
- 自定义控件AutoScaleMode为Font造成宽度增加的问题
- MySQL多表操作
- Activereports report practical application tutorial (19) -- multi data source binding
- Calendar control programming
- Online generation of placeholder pictures
- 小甲鱼老师《带你学C带你飞》的后续课程补充
- 树和森林的遍历
- Interface development component devaxpress asp Net core v21.2 - UI component enhancements
- MySQL多表操作练习题
- 0816飞达的缺点(改进方向)
猜你喜欢

matplotlib设置坐标轴刻度间隔

如何用银灿IS903主控DIY自己的U盘?(练习BGA焊接的好项目)

ABAQUS 使用RSG绘制插件初体验

树、森林及二叉树的相互转换

Some problem records of openpnp using process

84. (cesium chapter) movement of cesium model on terrain

Openpnp调试 ------ 0816飞达推0402编带

Altium Designer中off grid pin解决方法

0.1----- process of drawing PCB with AD

详解openGauss多线程架构启动过程
随机推荐
C WinForm embedded flash
0816 shortcomings of Feida (improvement direction)
希尔排序
Calendar control programming
如何用银灿IS903主控DIY自己的U盘?(练习BGA焊接的好项目)
MySQL约束
[compréhension approfondie de la technologie tcaplusdb] exploitation et entretien de tcaplusdb - inspection quotidienne des patrouilles
Adapter mode of structural mode
Nlp-d57-nlp competition D26 & skimming questions D13 & reading papers & finding bugs for more than an hour
[deeply understand tcapulusdb technology] how to initialize and launch tcapulusdb machine
1.4-----PCB设计?(电路设计)确定方案
一文带你读懂内存泄露
【深入理解TcaplusDB技术】TcaplusDB运维单据
Human pose estimation
0.0 - how can SolidWorks be uninstalled cleanly?
K8s deploy MySQL
vim中快速缩进用法
使用 qrcodejs2 生成二维码详细API和参数
Solution de pin hors grille dans altium designer
MySQL多表操作