当前位置:网站首页>[paper reading] kgat: knowledge graph attention network for recommendation
[paper reading] kgat: knowledge graph attention network for recommendation
2022-07-05 10:19:00 【Let me be quiet for a while】
original text :https://arxiv.org/pdf/1905.07854.pdf
Code :https://github.com/xiangwang1223/knowledge_graph_attention_network
What: To provide more accuracy 、 Diverse and interpretable suggestions , Must go beyond the user - Modeling of object interaction , And consider Additional information ( such as KG). In this paper , We studied Knowledge map (KG) The utility of , It does this by associating items with their properties , To break the Independent interaction hypothesis .( By way of KG and user-item graph Combine , Capture high-dimensional connections )【 High dimensional connection :which connect two items with one or multiple linked attributes】
Why: Traditional approach , Such as factoring machine (FM) Think of it as a supervised learning problem , It assumes that each interaction is an independent instance , And encode the edge information . Because the relationship between instances or objects is ignored ( for example , The director of one film is also the actor of another film ), These methods are not enough to extract from the collective behavior of users Cooperative signal .(collaborative signal)
How: KGAT Recursively from the node's neighbors ( It can be a user 、 Item or attribute ) Propagate embed and aggregate , To refine the embedding of nodes , And use the attention mechanism to distinguish the importance of neighbors .
Result
Conclusion
This article emphasizes the importance of collaboration signals , And it aims to surpass users - Modeling object interaction , So consider adding additional information ; By way of 【 Knowledge map 】 and 【 user - Product interaction diagram 】 Combine , obtain 【 Collaborative knowledge map CKG】, To capture high-dimensional connections 、 Cooperative signal . The way to do it is , Through multi-layer embedding aggregation including attention mechanism, it comes from a node neighbor ( It can be user、item、attribute), To update 、 Optimize embedded representation .
Model
The author begins with an example , Explain adding KG Then capture High dimensional connection 、 Cooperative signal Importance .
Collaborative filtering method , Focus on finding similar users ( Interact with a item), So by i1,u1 Can find u4 and u5; Monitor learning methods , Emphasize looking for similar items ( Have the same properties ), So by e1,i1 find i2.
The above method cannot be found u1 and u2、u3 of , Unable to find ,u1 and i3、i4 of .
1. CKG Embedding Layer
Energy fraction of triples (TransR):( The lower the score, the higher the confidence 【 There is more relationship between two entities 】) among ,
Optimize through pairwise sorting losses :
among ,
By randomly replacing real relationships , Training .( Equivalent to negative sampling )
【 The purpose of this layer is to learn optimization Entity 、 Embedded representation of relationships 】 It is better than direct random embedded representation .
2. Attentive Embedding Propagation Layers
This layer is based on GCN Recursive transfer embedded representation ; according to GAT Get the embedded representation of the joint level , Emphasize the importance of this high-dimensional connection .
Each layer contains 3 A component :information propagation, knowledge-aware attention, and information aggregation( Information dissemination 、 Knowledge perception, attention mechanism and information aggregation )
Information dissemination
eh Self aggregation (ego-network):
Spread information before entities and their neighbors .
Aggregate entities with their neighbors , Through the linear attenuation coefficient .
Knowledge perception attention mechanism
The attenuation coefficient is :
Make in relation r In the space of , The fraction of the attenuation coefficient obtained , From entities h And entities t The relationship between , For the closer entities , The larger the attenuation coefficient obtained , stay 【 Information dissemination 】 in , The more information you keep .
For the sake of simplicity , Use the inner product to get the fraction of the attenuation coefficient .
Re pass softmax Normalize all scores :
From this we can get , Capturing cooperative signals 、 When aggregating entity neighbors , Which entities are assigned higher scores .
In the above formula , There is more than just entity representation , There is also relationship . In order to get more information in communication , Not only the node representation as input , Also get entities h And entities t Relationship representation between .
Information aggregation
This floor is a 【eh】 and 【eh Self aggregation (ego-network)】 polymerization , Get a new entity h It means .
There are many ways :
High dimensional propagation
The above process is a complete process of one-layer aggregation , You can stack multiple levels of results , Capture high-dimensional connections and higher-order neighbor :
Each level eh Express , adopt 【 On the upper floor eh Express 】 and 【eh Self expression 】 obtain .
3. Prediction Layer
Go through the above steps , You can get user representation and item representation after multi-layer aggregation , Connect multiple representations to form a user representation and an item representation :
thus , It enriches user representation and item representation .
Reference resources :
- KGAT:Knowledge Graph Attention Network for Recommendation:https://hikg.net/archives/123/
边栏推荐
- Uni app running to wechat development tool cannot Preview
- MySQL character type learning notes
- How to judge that the thread pool has completed all tasks?
- 各位大佬,我测试起了3条线程同时往3个mysql表中写入,每条线程分别写入100000条数据,用了f
- 自动化规范检查软件如何发展而来?
- 请问postgresql cdc 怎么设置单独的增量模式呀,debezium.snapshot.mo
- 把欧拉的创新带向世界 SUSE 要做那个引路人
- Coffeescript Chinese character to pinyin code
- Design of stepping motor controller based on single chip microcomputer (forward rotation and reverse rotation indicator gear)
- 【小技巧】獲取matlab中cdfplot函數的x軸,y軸的數值
猜你喜欢
Cerebral cortex: directed brain connection recognition widespread functional network abnormalities in Parkinson's disease
学习笔记4--高精度地图关键技术(下)
Apple 5g chip research and development failure? It's too early to get rid of Qualcomm
Today in history: the first e-book came out; The inventor of magnetic stripe card was born; The pioneer of handheld computer was born
What is the origin of the domain knowledge network that drives the new idea of manufacturing industry upgrading?
把欧拉的创新带向世界 SUSE 要做那个引路人
WorkManager的学习二
驱动制造业产业升级新思路的领域知识网络,什么来头?
最全是一次I2C总结
[论文阅读] KGAT: Knowledge Graph Attention Network for Recommendation
随机推荐
AtCoder Beginner Contest 254「E bfs」「F st表维护差分数组gcd」
AtCoder Beginner Contest 258「ABCDEFG」
Zblogphp breadcrumb navigation code
Click the picture in the mobile browser and the picture will not pop up
Energy momentum: how to achieve carbon neutralization in the power industry?
php解决redis的缓存雪崩,缓存穿透,缓存击穿的问题
能源势动:电力行业的碳中和该如何实现?
Cut off 20% of Imagenet data volume, and the performance of the model will not decline! Meta Stanford et al. Proposed a new method, using knowledge distillation to slim down the data set
MySQL数字类型学习笔记
How did automated specification inspection software develop?
ByteDance Interviewer: how to calculate the memory size occupied by a picture
isEmpty 和 isBlank 的用法区别
Activity jump encapsulation
最全是一次I2C总结
Detailed explanation of the use of staticlayout
天龙八部TLBB系列 - 关于技能冷却和攻击范围数量的问题
高级 OpenCV:BGR 像素强度图
Atcoder beginer contest 254 "e BFS" f st table maintenance differential array GCD "
Window下线程与线程同步总结
驱动制造业产业升级新思路的领域知识网络,什么来头?