当前位置:网站首页>CaEGCN: Cross-Attention Fusion based Enhanced Graph Convolutional Network for Clustering 2021
CaEGCN: Cross-Attention Fusion based Enhanced Graph Convolutional Network for Clustering 2021
2022-07-27 23:43:00 【Oriental shrimp】
problem : Existing deep clustering methods often ignore the relationship between data .
This paper proposes a deep clustering framework based on cross attention —— Enhanced graph convolution network based on cross attention fusion (CaEGCN) , The network consists of four main modules : Cross attention fusion module , Innovatively self coding the content related to individual data (CAE) And a graphic convolutional self coding module related to the relationship between layer by layer data (GAE) Connect ; And self-monitoring model , The model highlights the identification information of clustering tasks . The cross attention fusion module integrates two heterogeneous representations ,CAE The module complements GAE Content information of the module , Avoided GCN Over smoothing problem . stay GAE Module , Two new loss functions are proposed , Reconstruct the content of data and the relationship between data respectively . Last , Self monitoring module constraints CAE and GAE The distribution represented by the middle layer is consistent .
1) A deep clustering framework based on end-to-end cross attention fusion is proposed , Among them, the cross attention fusion module creatively connects the automatic picture coding module and the automatic content coding module ;
2) A cross attention fusion model is proposed , Pay attention to the weight distribution of the fused heterogeneous representation ;
3) In the graph convolution automatic coding module , Propose to reconstruct the content and relationship of data at the same time , Effectively enhanced CaEGCN Clustering performance ;
CROSS-ATTENTION FUSION BASED ENHANCED GRAPH CONVOLUTIONAL NETWORK
The overall framework is as shown in the figure :
It consists of four main modules : Self encoder module for extracting content information ; be based on GCN Self encoder module , Used to exploit the relationship between data ; Cross attention module , Used to connect the above two modules , The multi-level adaptive fusion strategy supplements effective content information as much as possible in the transmission process ; And a self-monitoring module for constraining the distribution consistency of the middle layer representation .
A. Constructing the Graph
Construct raw data X The method of drawing is similar to SDCN equally ,KNN Method .
B. Content Auto-encoder Module (CAE)
Adopt the full connection layer of the foundation , Loss adopts reconstruction loss , restructure X.
C. Cross-Attention Fusion Module
Cross attention fusion mechanism is used to integrate CAE The content information learned and GAE Integrate the learned data relationships .
The cross attention fusion mechanism is defined as :
R = F a t t ( Q , K , V ) (1) R=F_{att}(Q,K,V) \tag{1} R=Fatt(Q,K,V)(1)
Q:query,K:key,V:value
The initial input of the cross attention fusion module Y Defined as :
Y = γ Z l + ( 1 − r ) H l (2) Y=\gamma Z_l + (1-r)H_l \tag{2} Y=γZl+(1−r)Hl(2)
Z l , H l Z_l,H_l Zl,Hl Respectively GAE And CAE The first l Layer output . γ \gamma γ It's a trade-off parameter , In this experiment, it is set to 0.5.
Attention mechanism calculation : First calculate the fusion query And fusion key The similarity between :
s a b = q a ∗ k b (3) s_{ab} = q_a * k_b \tag{3} sab=qa∗kb(3)
Then proceed on the above basis softmax Normalized acquisition a a b a_{ab} aab:
a a b = s o f t m a x ( s a b ) = e x p ( s a b ) ∑ a = 0 D a t t e x p ( s a b ) (4) a_{ab} = softmax(s_{ab}) = \frac{exp(s_{ab})}{\sum^{D_{att}}_{a=0}exp(s_{ab})}\tag{4} aab=softmax(sab)=∑a=0Dattexp(sab)exp(sab)(4)
Cross attention fusion mechanism R The final output is :
r a = ∑ b = 0 N a a b v b (5) r_a = \sum^N_{b=0}a_{ab}v_b \tag{5} ra=b=0∑Naabvb(5)
To further perceive different aspects of data , The multi head mechanism is introduced :
R m = F a t t ( Q m , K m , V m ) , m = 1 , 2 , 3 , . . . , M . (6) R^m=F_{att}(Q_m,K_m,V_m) \tag{6},m=1,2,3,...,M. Rm=Fatt(Qm,Km,Vm),m=1,2,3,...,M.(6)
among Q m = W m q Q Q_m = W_m^qQ Qm=WmqQ,K,V Empathy .
R = W ⋅ C o n c a t ( R 1 , . . . , R M ) (7) R = W\cdot Concat(R_1,...,R_M)\tag{7} R=W⋅Concat(R1,...,RM)(7)
among Concat(·) Represents the matrix concatenation operation . This is the so-called multi head mechanism and cross attention fusion module .
D. Graph Convolutional Auto-Encoder Module (GAE)
The output obtained from the previous cross attention fusion module is represented R As GAE The input of , Perform spectral convolution .
Z l = G A E ( R l − 1 , A ) = a l ( D ^ − 1 / 2 A ^ D ^ − 1 / 2 R l − 1 U L ) (8) Z_l = GAE(R_{l-1},A)=a_l(\hat{D}^{-1/2}\hat{A}\hat{D}^{-1/2}R_{l-1}U_L )\tag{8} Zl=GAE(Rl−1,A)=al(D^−1/2A^D^−1/2Rl−1UL)(8)
Loss function reconstruction A ~ = S i g m o d i ( Z L T Z L ) \tilde{A} = Sigmodi(Z_L^TZ_L) A~=Sigmodi(ZLTZL) and X ~ = Z L \tilde{X}=Z_L X~=ZL.
E. Self-Supervised Module
In the previous study, it is difficult to judge whether it is the optimal representation of clustering , We need to give an optimization goal of clustering .
Here and such SDCN Like the previous methods Student’s t-distribution As a real label , The second normalization is used as the target label . Yes CAE and GAE At the same time, optimize .
边栏推荐
- Can Siemens PLC collect analog data of multiple slave stations in real time and wirelessly?
- Ideas, methods and steps of making folding fans with 3DMAX
- 面试官:说一下网络数据传输的具体流程
- NDK 系列(6):说一下注册 JNI 函数的方式和时机
- 请求合并哪家强——接口请求合并的3种技巧,性能直接爆表
- Calling dht11/22 temperature and humidity sensor in Proteus simulation Arduino
- 进制转换方法
- 你的列表很卡?这4个优化能让你的列表丝般顺滑
- How do I shut down oracle?
- Using the optical fingerprint scheme under the huiding screen, Samsung Galaxy a71 5g is listed
猜你喜欢

Learn more about xxE injection

Flink怎么使用Savepoint

Process and planned task management

Common Taylor expansion

我年薪100万,全身上下没有超过100块的衣服:存钱,是最顶级的自律

Technical certification | Tupo software and Huawei cloud create a new situation of win-win cooperation

字符流学习14.3

The technology of applet container is very promising, which can greatly improve the efficiency of mobile R & D

Introduction to several common usage scenarios of message queue

用户画像在科技期刊微信公众号精准推送中的应用
随机推荐
Which one is better to request to merge -- three skills of interface request merging, and the performance directly explodes the table
Character stream learning 14.3
Blood spitting finishing nanny level series tutorial - playing Fiddler bag capturing tutorial (5) - detailed explanation of fiddler monitoring panel
低代码开发前景如何,大家都真的看好低代码开发吗?
The share price soared 180.46%! Shanghai silicon industry, the leader of domestic large silicon wafers, is listed: the cumulative net profit in recent four years is less than 60million
字符流学习14.3
Introduction to several common usage scenarios of message queue
重新定义分析 - EventBridge 实时事件分析平台发布
请求合并哪家强——接口请求合并的3种技巧,性能直接爆表
编辑复制粘贴判定问题(bug?),所见即所得显示符号问题反馈。
[soft test software evaluator] 2014 comprehensive knowledge over the years
Ideas, methods and steps of making folding fans with 3DMAX
【软考软件评测师】2014综合知识历年真题
Application of user portrait in precise push of wechat official account of scientific journals
What technology is RPA process automation robot? How to realize office automation?
How to use xshell Free Edition
Elk log analysis system installation and deployment
Disable caching with meta HTML tags in all browsers
Record the errors about formatc in R language
MySQL之数据查询(WHERE)