当前位置:网站首页>Paper notes: e (n) equivariant graph neural networks
Paper notes: e (n) equivariant graph neural networks
2022-06-29 16:34:00 【UQI-LIUWJ】
0 Introduce
This paper introduces a new model to learn and rotate 、 translation 、 A graph neural network with equivariant reflection and arrangement , be called E(n)- Equivariant graph neural network (EGNN).
Compared with the existing method ,EGNN There is no need to compute expensive high-order representations in the middle layer , While still achieving competitive or better performance . Besides , Although the existing methods are limited to 3 Equivariant of dimensional space , but EGNN It is easy to expand to higher dimensional space .
1 introduction
Although deep learning has largely replaced handmade features , But much progress has been made It depends on the inductive bias in the deep neural network .(inductive bias)
An effective way to limit neural network to a function related to a problem is , Use the symmetry of the problem 、 Transformation equivariant (equivariance), Simplify the calculation of the current problem by studying a symmetric group .【eg,CNN The convolution of is equivariant 、 Pooling is approximately invariant ;GNN The order of the points of is equivariant ( The arrangement of different points corresponds to different adjacency matrices , But in the end this one graph The message is the same )】
Many problems show that 3D Translational and rotational symmetry . The set of these symmetric operations is written as SE(3) , If reflection is included , Then the set is written as E(3). It is usually desirable to predict these tasks relative to E(3) Transformation is equivariant or invariant .、
lately , Equal changes have been proposed E(3) or SE(3) Various forms and methods of . Many of these works have achieved innovation in the study of high-order representation types in the intermediate network layer . However , The conversion of these higher-order representations requires the computation of costly coefficients or approximations . Besides , In practice , For many types of data , Input and output are limited to scalar values ( Such as temperature or energy , It is called in the literature type-0) and 3d vector ( Such as velocity or momentum , It is called in the literature type-1).
This article paper A new architecture is proposed , It is translation 、 Rotation and reflection etc (E(n)), And the permutation of the input point set . The model is simpler than the previous method , At the same time, the equivariant in the model is not limited to 3 Dimensional space , And it can be extended to a larger dimensional space , Without significantly increasing the amount of computation .
2 Background knowledge
2.1 Equivariant
Definition :
In layman's terms , Pan first / rotate / Permutation remapping , And mapping before translating / rotate / array The effect is the same
2.2 GNN
GNN note : Message propagation model _UQI-LIUWJ The blog of -CSDN Blog
3 EGNN
- Consider a graph
, among
, - The characteristics of each point embedding yes
( and 2.2 Of GNN equally )[ This nf refer to node feature, Not at all n ride f]【 Physical information without direction , Scalar 】 - But on this basis ,EGNN Add one for each point n Coordinates of dimensions
【 Geometric information with direction , vector 】
GNN Will be maintained with these coordinates xi Equivariant of rotation and translation , And it will be linked to GNN The same way as the node set V The equivariant of the permutation .

Use the formula to express the l layer EGNN,Equivariant Graph Convolutional Layer (EGCL) Yes :
And traditional GNN Different places are drawn with green frames
- In the equation 3 in ,EGNN The distance between two coordinates is increased
As a parameter
- In the equation 4 in , According to the meaning of the paper ,xi The position of is updated to the radial vector field , Owned by
Joint decision 【 I don't quite understand here , Why is it radial , Because it's different xi, Its direction is different ,
Subtract as a vector , Not necessarily along the radial direction 】
- equation 4 The weighting coefficient of each term is determined by the function
Calculated
- there C=1/(M-1)
- ——> equation 3&4 It can guarantee equidenaturation
- ——> meanwhile , Another difference is , All of these are considered here (i,j) Yes , Not just between neighboring points pair, in other words ,embedding mij Can contain information about the full graph
3.1 Translational equivariant
g Is a translation vector ,x(type-1 vector ) Is translational equivariant (equivariant),h(type-0 vector ) Is translation invariance (invariant)

——> It's not hard to find out ,EGCL The combination of is also equivariant
3.2 expand EGNN
Here to the front EGNN Make minor modifications , So that we can explicitly track the momentum of the particles .
This can not only be used to obtain the estimated velocity of particles in each layer , Momentum can also be introduced
Expressed by formula , Will be 
Expressed as 
'
If
by 0, So the equation 4 And equation 7 It's about one thing
3.3 Get some information
In some cases , We may not always get an adjacency matrix . In these cases , We can assume a full connection graph , Where all nodes exchange messages with each other .
This full connection method may not be well extended to large graphs , We want to consider only the neighbor nodes N(i) Interaction between points of .
The paper uses the following method here 
Among them, if (i,j) There is a link , that eij by 1, Otherwise 0
The paper approximates by a function eij :
( Linear layer +sigmoid Activation function , The input side embedding, Output side value Of soft estimation)
边栏推荐
- A tour of grpc:02 - generate code from proto
- Sophon base 3.1 launches mlops function to provide wings for enterprise AI capability operation
- Sophon kg upgrade 3.1: break down barriers between data and liberate enterprise productivity
- Cortical traceability analysis of ERP
- 工具链赋能百家,地平线开启智能驾驶量产的“马太效应”
- Comprehensive analysis of Seata distributed transaction at and XA
- Which parameter is the partition information adjusted? The MySQL source stream API is used, not the table API
- 全面剖析Seata 分布式事务 AT 与XA
- 实践 | 脚本错误量极致优化-让脚本错误一目了然
- 真正的测试 =“半个产品+半个开发”?
猜你喜欢

MySQL基础——多表查询

贪婪的苹果计划提高iPhone14的价格,这将为中国手机提供机会

一个简单但是能上分的特征标准化方法

CV5200自组网远程WiFi模组,无人机无线图传应用,高清低时延方案

能够1年涨薪2次的软件测试工程师,他到底强在哪里?

破解湖+仓混合架构顽疾,星环科技推出自主可控云原生湖仓一体平台

Sophon Base 3.1 推出MLOps功能,为企业AI能力运营插上翅膀

Huaxia Fund: sharing of digital transformation practice achievements in the fund industry

如何利用OpenMesh实现不同格式的3D文件间的转换

Cortical traceability analysis of ERP
随机推荐
Sophon Base 3.1 推出MLOps功能,为企业AI能力运营插上翅膀
Cv5200 ad hoc network remote WiFi module, UAV wireless image transmission application, HD low delay scheme
The latest agenda of dtcc2022 China database technology conference was released
C winfrom chart chart control bar chart and line chart
数学知识:求组合数 II—求组合数
Which parameter is the partition information adjusted? The MySQL source stream API is used, not the table API
为防止被00后整顿,一公司招聘要求员工不能起诉公司
Key sprite fighting monsters - window binding skill
迪赛智慧数——其他图表(基本旭日图):毕业演讲高频词
GNN笔记:消息传播模型
After 3 years of testing experience, do you know the state transition method for use case design?
Picture and text show you how to thoroughly understand the atomicity of MySQL transaction undolog
leetcode:42. Rain water [double hands are elegant]
代码大全读后感
华夏基金:基金行业数字化转型实践成果分享
或许再过两年,ASML将可以自由给中国供应EUV光刻机
[proteus simulation] 8-bit nixie tube dynamic scanning display change data
To solve the stubborn problem of Lake + warehouse hybrid architecture, Star Ring Technology launched an independent controllable cloud native Lake warehouse integrated platform
Selenium 凭什么成为 Web 自动化测试的首选?(内附源码)
哪个版本的JVM最快?
, among
,
( and 2.2 Of GNN equally )[ This nf refer to node feature, Not at all n ride f]【 Physical information without direction , Scalar 】
【 Geometric information with direction , vector 】![h^{l+1},x^{l+1}=EGCL[h^l,x^l,\mathcal{E}]](http://img.inotgo.com/imagesLocal/202206/29/202206291549325189_2.gif)

As a parameter
Joint decision
Calculated