当前位置:网站首页>Deep learning: gat
Deep learning: gat
2022-07-27 18:21:00 【sky_ Zhe】
Here's the catalog title
Original thesis
Explain in detail :E:\ The group will report \2021.11.5 report -GAT\20211105 draft
May refer to : Detailed introduction gac And improvement process
source : about GCN Improvement
Attention mechanism
The attention mechanism in deep learning is essentially similar to the selective visual attention mechanism of human beings , The core goal is to select the information that is more critical to the current task goal from a large number of information .
GCN limited
- It is difficult to assign different weights to different neighbor.
This limits the ability of the model to capture the relevance of spatial information , This is also inferior to GAT Root cause of .
stay GAT in , Each node in the graph can be based on the characteristics of adjacent nodes , Assign different weights to it
- GCN The way of combining the characteristics of adjacent nodes is related to the structure of the graph
This limits the generalization ability of the trained model on other graph structures .
GCN The limitations are : Hard to handle dynamic graphs ; It is difficult to assign different weights to different neighbor. Actually GAT yes GNN Improvement , And GCN similar , It's just based on self-attention The graph model of .
difference
The core difference lies in how to collect and accumulate distance 1 Characteristic representation of neighbor nodes of .
Figure attention model GAT Replace with attention mechanism GCN Fixed standardized operation in .
Essentially ,GAT Just put the original GCN The standardized function of is replaced by the neighbor node feature aggregation function using attention weight .
Purpose
GAT And GCN It is also a feature extractor , Aiming at N Nodes , Predict and output the characteristics of new nodes according to the input node characteristics .
GAT characteristic
- 1 Each node in the graph can be based on the characteristics of adjacent nodes , Assign different weights to it
- 2 Only related to adjacent nodes , That is, the nodes of the shared edge , There is no need to get the information of the whole picture :
(1) The graph does not need to be undirected ( If the edge j → i non-existent , We can simply omit the calculation α i j;
(2) It makes our technology directly applicable to inductive learning—— Include the task of evaluating models on graphics that are completely invisible during training .
The basic idea
According to the position of each node on its adjacent nodes attention, To update the node representation .
The process
this paper : Assign corresponding weights to different adjacent nodes , multi-head Bulls Attention structure , Calculate the attention coefficient
For vertices i , Calculate its neighbors and... One by one i The similarity coefficient between :
That is, first use the shared parameters W Add dimension to vertices , Post stitching (concatenate) Two characteristics , By mapping functions g(.) Map high-dimensional features to a Attention In real numbers .
Through to i Our neighbors softmax, You can get ( Learning out ) The relationship coefficient between nodes :
To sum by weight aggregate
Generally speaking , The aggregation method generally weights and sums the features transmitted by neighbors , You can update the characteristics of this node :
In this article, the way of collection is enhanced , use K An attention mechanism , That's it K A neighbor weighting method , To update the characteristics of this node . namely Attention Medium multi-head thought :
Model comparison
Case study : The paper cora
GCN-dgl
Code location :E:\ Project routines \GCN\ The paper \ Code location
Data presentation

result

Official website :
Code :E:\ Project routines \GCN\ The paper \gcn_dgl_ Official website 
GAT-dgl
Code :
result

边栏推荐
- 深度识别:论文阅读_2S-AGCN CVPR2019(基于骨架的动作识别的两流自适应图卷积网络)
- VSS tip: search all checked out files (search checked out files according to users)
- Hutool digital computing
- [learning notes] advanced version of MySQL database - index optimization, slow query, locking mechanism, etc
- Please ask the great God a question, flinkcdc, the datetime field in synchronous MySQL will become a timestamp. Has anyone ever encountered it
- 数据库的常用命令1
- zabbix6.0的安装部署
- org.apache.catalina.core.StandardContext.startInternal Context [] startup failed due to previous err
- 展锐鲜苗:赋能全场景应用,海量数据需要AI与IoT融合
- 贴牌“美国制造”,国产安防设备竟被装上了美航母!
猜你喜欢

快速获取网站媒体资源方法

Resolve merge fields in salesforce

深度学习:GCN图分类案例

Redis网红高频面试题三连:缓存穿透?缓存击穿?缓存雪崩?

The latest advanced interview questions for big factories are necessary

The global cloud market is growing rapidly, and data security has entered a strong regulatory era of rule of law

使用分布式框架WCF出现的BUG记录

What every Salesforce developer should know about Dates and Times in Apex

深度学习:GCN(图卷积神经网络)理论学习总结

GIS数据漫谈(五)— 地理坐标系统
随机推荐
[learning notes] the implementation principle of the ordered set Zset in redis - skip table
org.apache.catalina.core.StandardContext.startInternal Context [] startup failed due to previous err
深度学习:GAT
IDEA打包war包与war包位置
You can't specify target table 'table name' for update in from clause error resolution in MySQL
Is it difficult to operate email safely? COREMAIL joins hands with cloud store to create a new ecosystem of corporate email office!
JS中的冒泡排序
MySql代码数据库创建 停车管理系统 外键
2. 改变颜色空间及颜色检测
Find redundant duplicate records in the table, delete and retain the minimum one
超实用!阿里P9私藏的Kubernetes学习笔记,看完直呼NB
CFA exam registration instructions
浅论分布式训练中的recompute机制
Jianan Yunzhi has completed the pre roadshow and is expected to land on NASDAQ on November 20
深度学习:GCN案例
vue使用keep-alive实现页面缓存
[learning notes] Lombok's @builder annotation
Buffer的只读模式
国巨斥资18亿美元收购竞争对手Kemet,交易或在明年下半年完成
多线程实现循环