当前位置:网站首页>Raki's notes on reading paper: neighborhood matching network for entity alignment
Raki's notes on reading paper: neighborhood matching network for entity alignment
2022-06-30 02:39:00 【Sleepy Raki】
Abstract & Introduction & Related Work
If you are not a knowledge map player , Please read the popular science of entity alignment first : Introduction to entity alignment
Research tasks
Entity alignedExisting methods and related work
- embedding-based methods
Facing the challenge
- Entity alignment is not easy , Because the knowledge map in real life is usually incomplete , And different knowledge maps usually have heterogeneous patterns . therefore , Equivalent entities from two knowledge maps may have different surface forms or different adjacency structures .
- The problem of different adjacency relations between equal entities is ubiquitous
- The difference of neighborhood size and topology brings great challenges to entity alignment methods

Innovative ideas
- The map sampling method extracts a neighborhood with identifiability for each entity
- Cross graph neighborhood matching module , Joint coding of neighborhood differences for a given entity pair
The experimental conclusion
sota
Our Approach
The two graphs are based on a pre aligned set of peer entities , The goal is to find the peer entity pairs in the two graphs 
Overview of NMN
- NMN The first use of GCNs Modeling the neighborhood topology information
- Neighborhood sampling is used to select neighbors with more information
- It uses a cross map matching module to capture the differences between neighbors

KG Structure Embedding
First use GCN To aggregate higher-order entity neighbor structure information , Use pre training word embedding to initialize GCN
Enter two graphs as a large graph into GCN Inside , Every GCN The layer takes a set of node features as input , And update the node representation to :
h i l h_i^{l} hil It's No L Node output characteristics of layer 
In order to control the accumulated noise , stay GCN A high-speed network is used between layers to effectively control noise propagation 
Neighborhood Sampling
One hop neighbor of an entity is the key to determine whether the entity should be aligned with other entities , But not all one hop neighbors contribute positively to entity alignment , Therefore, a down sampling process is used to select a neighbor with the largest amount of Central entity information
GCN The learned entities are embedded with rich context information ,for Neighborhood structure and entity semantics , The more context sensitive the central entity, the easier it is to sample

In essence, a discriminant neighborhood subgraph is constructed for each entity , This can achieve more accurate alignment through neighborhood matching 
Neighborhood Matching
The neighborhood subgraph generated by the sampling process determines which neighborhoods of the target entity should be considered in the later stage . let me put it another way ,NMN Handle pipeline The later stage of the will only operate on the neighbors in the subgraph . In the neighborhood matching stage , We want to be the corresponding KG For each candidate entity, find out which neighborhood of the entity is closely related to a neighborhood node in the subgraph of the target entity . This information is important for determining two entities ( From two KG) Whether it should be aligned is crucial 
Candidate selection
In order to reduce the computational overhead ,NMN First of all to E 1 E_1 E1 Medium e i e_i ei Sample an alignment candidate set C i = { c i 1 , c i 2 , . . . , c i t ∣ c i k ∈ E 2 } C_i = \{c_{i1}, c_{i2}, ..., c_{it} |c_{ik}∈E_2\} Ci={ ci1,ci2,...,cit∣cik∈E2}, And then calculate e i e_i ei And the subgraph similarity of these candidate sets . This is based on E 2 E_2 E2 In the embedded space e i e_i ei Closer entities are more likely to be associated with e i e_i ei Aligned observations . therefore , about E 2 E_2 E2 One of the entities in e j e_j ej, It is sampled as e i e_i ei The probability of a candidate can be calculated as 
Cross-graph neighborhood matching
p and q They are given E1 and E2 The neighbors of the nodes in the graph , Calculate an attention 
And then put them together 
For each target neighbor in the neighborhood subgraph , The attention mechanism in the matching module can accurately detect another KG Which neighbor in the subgraph of is most likely to match the target neighbor . Intuitively speaking , Matching vectors mp Captured the difference between the two closest neighbors . When the representations of two neighbors are similar , Matching vectors tend to be zero vectors , So their characterization remains similar . When the representations of neighbors are different , The matching vector will be amplified by propagation
Neighborhood Aggregation

My question is , Can you explain each parameter clearly ??? W g a t e W_{gate} Wgate What did you say ?
Experimental Setup


Experimental Results




Conclusion
NMN It's solved KG The ubiquitous neighborhood heterogeneity problem in . We achieve this goal by using a new sampling based method to select the most informative neighbor for each entity
NMN By considering topological structure and neighborhood similarity , Estimate the similarity of two entities at the same time . We have conducted extensive experiments on real-world data sets , And will NMN With the latest 12 Two embedding based methods are compared . Experimental results show that ,NMN Got the best 、 More robust performance , In different data sets and evaluation indicators, it is better than the competitive method
Remark
Can you make the symbol clear ?trainable Let's at least say the parameters 
Reading a little wordless paper
边栏推荐
- 堆排序
- CA数字证书包含哪些文件?如何查看SSL证书信息?
- PMP考生如何应对新考纲?看过来!
- 重看《Redis设计与实现》后记录几个要点
- 什么是自签名证书?自签名SSL证书的优缺点?
- Pytorch学习(二)
- 如何使用SMS向客户传递服务信息?指南在这里!
- Summary of knowledge points about eigenvalues and eigenvectors of matrices in Chapter 5 of Linear Algebra (Jeff's self perception)
- JS advanced -es6 syntax
- threejs 镜子案例Reflector 创建镜子+房子搭建+小球移动
猜你喜欢

如何预防钓鱼邮件?S/MIME邮件证书来支招

微信小程序页面跳转以及参数传递

福利抽奖 | 开源企业级监控Zabbix6.0都有哪些亮点

Jupyter notebook displays a collection of K-line graphs

Digicert、Sectigo、Globalsign代码签名证书的区别

什么是X.509证书?X.509证书工作原理及应用?

CMake教程系列-02-使用cmake代碼生成二進制

IBM websphere通道联通搭建和测试

Seven common errors of SSL certificate and their solutions

Merge sort
随机推荐
What is a dangling pointer- What is a dangling pointer?
如何使用SMS向客户传递服务信息?指南在这里!
2.8 【 weight of complete binary tree 】
What about punctuation in the first column of unity text
Two methods of SSL certificate format conversion
What should academic presentation /ppt do?
Global and Chinese markets of liquid optical waveguides 2022-2028: Research Report on technology, participants, trends, market size and share
IBM websphere通道联通搭建和测试
PR second training notes
什么是自签名证书?自签名SSL证书的优缺点?
Créer des compétences exquises dans l'éducation des créateurs
CMake教程系列-02-使用cmake代码生成二进制
day33
Pytoch learning (II)
NPDP产品经理国际认证考试报名有什么要求?
IBM WebSphere channel connectivity setup and testing
PMP考生如何应对新考纲?看过来!
What is the difference between a layer 3 switch and a layer 2 switch
FDA邮件安全解决方案
Le Code autojs peut - il être chiffré? Oui, présentation des techniques de chiffrement autojs