当前位置:网站首页>Cold start problem of recommended system
Cold start problem of recommended system
2022-06-12 07:35:00 【bugmaker.】
The recommendation system needs to predict the user's future behavior and interest according to the user's historical behavior and interest , So a lot of user behavior Data becomes an important part and prerequisite of recommendation system . How to design a personalized recommendation system without a large amount of user data and make users satisfied with the recommendation results so that they are willing to use the recommendation system , It is the problem of cold start . There are three main types of cold start problems , User cold start , Item cold start , System cold start .
User cold start
User cold start mainly solves the problem of how to make personalized recommendation for new users . When new users arrive , We don't have data on his behavior , So it's impossible to predict his interest based on his historical behavior , So I can't make personalized recommendation for him .
Use user registration information
The personalized recommendation process based on registration information is basically as follows :
(1) Get the user's registration information ;
(2) Classify users according to their registration information ;
(3) Recommend to users the items they like in their category .
Choose the right item to start the user's interest
Another way to solve the user cold start problem is when a new user first accesses the recommended system , Don't show the user the tweet immediately Recommendation results , But to provide users with some items , Let users feedback their interest in these items , Then, according to the feedback from users, I will give you
Item cold start
Item cold start mainly solves the problem of how to recommend new items to users who may be interested in it for personalized recommendation . Item cold start is very important in news websites and other websites with strong timeliness , Because there are new people's items in those websites all the time , And every Items must be able to be displayed to users at the first time , Otherwise, after a period of time , The value of the goods is greatly reduced .
about UserCF Algorithm
about UserCF The algorithm needs to solve the problem of the first driving force , Where the first user finds new items . As long as there is a small part People can find and like new things ,UserCF The algorithm can spread these items to more users . The easiest way to solve the first driving force is to randomly display new items to users , But it's obviously not very personal , Therefore, we can consider using the Content information , Put the new item first to users who have liked other items similar to its content .
about ItemCF Algorithm
about ItemCF Algorithm , Cold starting is a serious problem . because ItemCF The principle of the algorithm is to recommend items similar to the items he likes before ,ItemCF Every once in a while, the algorithm uses user behavior to calculate the item similarity table ( It is usually calculated once a day ). therefore , When a new item is added , This item will not exist in the item related table in memory , If new items are not displayed to users , Users can't act on it , thus ItemCF Unable to recommend new products . So , We can only use the content information of the item to calculate the item related table , And update related tables frequently .
System cold start
System cold start mainly solves how to build a new website ( No users yet , It doesn't work Household behavior , There's only some information about the items ) Design personalized recommendation system , Thus, when the website is just released, the user body Experience the problem of personalized recommendation service .
Play an expert role
A lot of recommendation systems are built , There is no user behavior data , There is not enough content information to calculate the exact similarity of items . that , In order to let users get a better experience when the recommendation system is established , Many systems use expert tagging .
as everyone knows , It is difficult to calculate the similarity between music . First , Music is multimedia , If we calculate the similarity between songs from the audio analysis , The technical threshold is very high , And it is difficult to calculate satisfactorily . secondly , Just use the album of songs 、 It is difficult to obtain a satisfactory song similarity table from attribute information such as singers , Because a singer 、 An album often has only oneortwo good songs . To solve this problem ,Pandora hire
A group of computer literate musicians were used to carry out a project called music gene . They listened to the songs of tens of thousands of singers , And annotate each dimension of these songs . Final , They used 400 Multiple features Pandora Call these traits genes ). After marking all the songs , Each song can be expressed as a 400 Dimension vector , Then the similarity of songs can be calculated by the common vector similarity algorithm .
Jinni In the film genetic engineering, semi artificial 、 Semi automatic mode . First , It allows experts to mark films , Every movie has about 50 A gene , These genes come from about 1000 A gene bank . then , After experts mark certain samples ,Jinni Be able to use natural language understanding and machine learning technology , By analyzing the user's comments on the movie and some content attributes of the movie ( Especially new movies ) Make your own mark . All in all ,Jinn The cold start problem of the system is solved by combining expert and machine learning .
边栏推荐
- There is no solid line connection between many devices in Proteus circuit simulation design diagram. How are they realized?
- Detailed explanation of coordinate tracking of TF2 operation in ROS (example + code)
- Velocity autocorrelation function lammps v.s MATALB
- Node, topic, parameter renaming and global, relative and private namespaces in ROS (example + code)
- Set up a remote Jupiter notebook
- Chapter V - message authentication and digital signature
- [college entrance examination] prospective college students look at it, choose the direction and future, and grasp it by themselves
- ROS dynamic parameter configuration: use of dynparam command line tool (example + code)
- The first demand in my life - batch uploading of Excel data to the database
- The function of C language string Terminator
猜你喜欢

Stm32cubemx learning (I) USB HID bidirectional communication

Missing getting in online continuous learning with neuron calibration thesis analysis + code reading

Dynamic coordinate transformation in ROS (dynamic parameter adjustment + dynamic coordinate transformation)

Modelarts培训任务1

Complete set of typescript Basics

Chapter 4 - key management and distribution

Chapter 3 - Fundamentals of cryptography

Test left shift real introduction

Detailed explanation of multi coordinate transformation in ROS (example + code)

Voice assistant - Introduction and interaction process
随机推荐
Voice assistant - Multi round conversation (theory and concept)
Day 6 of pyhon
LED lighting experiment with simulation software proteus
Velocity autocorrelation function lammps v.s MATALB
[wax chain tour] release a free and open source alien worlds script TLM
Vs2019 MFC IP address control control inherits cipaddressctrl class redrawing
Thoroughly understand the "rotation matrix / Euler angle / quaternion" and let you experience the beauty of three-dimensional rotation
R语言glm函数构建泊松回归模型(possion)、epiDisplay包的poisgof函数对拟合的泊松回归模型进行拟合优度检验、即模型拟合的效果、验证模型是否有过度离散overdispersion
BI技巧丨当月期初
ROS dynamic parameter configuration: use of dynparam command line tool (example + code)
Knife4j first use
RT thread studio learning (I) new project
Leetcode34. find the first and last positions of elements in a sorted array
Detailed explanation of 8086/8088 system bus (sequence analysis + bus related knowledge)
鸿蒙os-第一次培训
8086/8088 instruction execution pipeline disconnection reason
C language queue implementation
Learning to continuously learn paper notes + code interpretation
2022电工(初级)考试题库及模拟考试
Chapter V - message authentication and digital signature