当前位置:网站首页>Deep learning: gat
Deep learning: gat
2022-07-27 18:21:00 【sky_ Zhe】
Here's the catalog title
Original thesis
Explain in detail :E:\ The group will report \2021.11.5 report -GAT\20211105 draft
May refer to : Detailed introduction gac And improvement process
source : about GCN Improvement
Attention mechanism
The attention mechanism in deep learning is essentially similar to the selective visual attention mechanism of human beings , The core goal is to select the information that is more critical to the current task goal from a large number of information .
GCN limited
- It is difficult to assign different weights to different neighbor.
This limits the ability of the model to capture the relevance of spatial information , This is also inferior to GAT Root cause of .
stay GAT in , Each node in the graph can be based on the characteristics of adjacent nodes , Assign different weights to it
- GCN The way of combining the characteristics of adjacent nodes is related to the structure of the graph
This limits the generalization ability of the trained model on other graph structures .
GCN The limitations are : Hard to handle dynamic graphs ; It is difficult to assign different weights to different neighbor. Actually GAT yes GNN Improvement , And GCN similar , It's just based on self-attention The graph model of .
difference
The core difference lies in how to collect and accumulate distance 1 Characteristic representation of neighbor nodes of .
Figure attention model GAT Replace with attention mechanism GCN Fixed standardized operation in .
Essentially ,GAT Just put the original GCN The standardized function of is replaced by the neighbor node feature aggregation function using attention weight .
Purpose
GAT And GCN It is also a feature extractor , Aiming at N Nodes , Predict and output the characteristics of new nodes according to the input node characteristics .
GAT characteristic
- 1 Each node in the graph can be based on the characteristics of adjacent nodes , Assign different weights to it
- 2 Only related to adjacent nodes , That is, the nodes of the shared edge , There is no need to get the information of the whole picture :
(1) The graph does not need to be undirected ( If the edge j → i non-existent , We can simply omit the calculation α i j;
(2) It makes our technology directly applicable to inductive learning—— Include the task of evaluating models on graphics that are completely invisible during training .
The basic idea
According to the position of each node on its adjacent nodes attention, To update the node representation .
The process
this paper : Assign corresponding weights to different adjacent nodes , multi-head Bulls Attention structure , Calculate the attention coefficient
For vertices i , Calculate its neighbors and... One by one i The similarity coefficient between :
That is, first use the shared parameters W Add dimension to vertices , Post stitching (concatenate) Two characteristics , By mapping functions g(.) Map high-dimensional features to a Attention In real numbers .
Through to i Our neighbors softmax, You can get ( Learning out ) The relationship coefficient between nodes :
To sum by weight aggregate
Generally speaking , The aggregation method generally weights and sums the features transmitted by neighbors , You can update the characteristics of this node :
In this article, the way of collection is enhanced , use K An attention mechanism , That's it K A neighbor weighting method , To update the characteristics of this node . namely Attention Medium multi-head thought :
Model comparison
Case study : The paper cora
GCN-dgl
Code location :E:\ Project routines \GCN\ The paper \ Code location
Data presentation

result

Official website :
Code :E:\ Project routines \GCN\ The paper \gcn_dgl_ Official website 
GAT-dgl
Code :
result

边栏推荐
- ts学习笔记-interface
- 1. opencv图片基础操作
- How to solve the error of ora-00955 when Oracle modifies the primary key
- 深度识别:论文阅读_2S-AGCN CVPR2019(基于骨架的动作识别的两流自适应图卷积网络)
- 施耐德电气、欧莱雅等企业巨头如何开放式创新?DEMO WORLD世界创新峰会揭秘
- On recommend mechanism in distributed training
- "Who is Huawei" documentary film series landing on BBC: exposing a large number of Ren Zhengfei's unknown experience
- Backup table recovery table
- 深度学习:GAN优化方法-DCGAN案例
- Salesforce runs all test classes and gets coverage reports
猜你喜欢

2. 改变颜色空间及颜色检测

Introduction to ef framework

数据库的常用命令2

NowCoder(5选)

浅论分布式训练中的recompute机制

1. opencv图片基础操作

Three consecutive high-frequency interview questions of redis online celebrity: cache penetration? Cache breakdown? Cache avalanche?

Class not found: “com.parkManagement.dao.DaoTest 测试找不到测试类

WPF makes login interface

快速获取网站媒体资源方法
随机推荐
Publish your own NPM component library
【学习笔记】MySQL数据库高级版 - 索引优化、慢查询、锁机制等
Class not found: “com.parkManagement.dao.DaoTest 测试找不到测试类
What every Salesforce developer should know about Dates and Times in Apex
力压谷歌、英伟达!阿里含光800芯片再获权威测试世界第一
Bug records using distributed framework WCF
小米CC9 Pro拆解:后置五摄成本是骁龙855数倍!
深度学习:GAN优化方法-DCGAN案例
贴牌“美国制造”,国产安防设备竟被装上了美航母!
深度学习:GAT
zabbix6.0的安装部署
深度学习:GCN案例
类的六大关系——依赖和关联的区别
展锐鲜苗:赋能全场景应用,海量数据需要AI与IoT融合
What's the use of games| Game application value research case collection
The end of another era!
1542. 找出最长的超赞子字符串 哈希+状态压缩
GIS数据漫谈(五)— 地理坐标系统
org.apache.catalina.core.StandardContext.startInternal Context [] startup failed due to previous err
【云图说】 第249期 移动应用安全服务—App的体检中心,全面检测,安全上路!