当前位置:网站首页>Single cell thesis record (part13) -- spagcn: integrating gene expression, spatial location and history to
Single cell thesis record (part13) -- spagcn: integrating gene expression, spatial location and history to
2022-06-22 06:05:00 【GoatGui】
Learning notes , For reference only , If there is a mistake, it must be corrected
Authors:Jian Hu,Xiangjie Li,Mingyao Li
Journal:Nature Methods
Year:2021
List of articles
SpaGCN: Integrating gene expression, spatial location and histology to identify spatial domains and spatially variable genes by graph convolutional network
abstract
Recent advances in spatially resolved transcriptomics (SRT) technologies have enabled comprehensive characterization of gene expression patterns in the context of tissue microenvironment. To elucidate the spatial variation of gene expression , We proposed SpaGCN, This is a graph convolution network method , stay SRT Gene expression was integrated into the data analysis 、 Spatial location and histology . Through graph convolution, SpaGCN aggregates gene expression of each spot from its neighboring spots, which enables the identification of spatial domains( Space domain ) with coherent expression and histology ( Consistent expression and histology ). The subsequent domain guided differential expression (DE) analysis then detects genes with enriched expression patterns in the identified domains ( Detect the genes with rich expression patterns in the identified field ). Analyzing seven SRT datasets using SpaGCN, we show it can detect genes with much more enriched spatial expression patterns than competing methods. Furthermore, genes detected by SpaGCN are transferrable and can be utilized to study spatial variation of gene expression in other datasets(SpaGCN The detected genes are transferable , It can be used to study the spatial changes of gene expression in other data sets ). SpaGCN is computationally fast, platform independent, making it a desirable tool for diverse SRT studies.
Overview of SpaGCN and evaluation.
We explain the workflow of SpaGCN using in situ capturing-based SRT data as an example, but the method can be easily modified to analyze other types of SRT data. As shown in Fig. 1a, SpaGCN first builds a graph to represent the relationship of all spots considering both spatial location and histology information. Next, SpaGCN utilizes a graph convolutional layer to aggregate gene expression information from neighboring spots. Then, SpaGCN uses the aggregated expression matrix to cluster spots using an unsupervised iterative clustering algorithm. Each cluster is considered as a spatial domain from which SpaGCN then detects SVGs that are enriched in a domain by DE analysis (Fig. 1b). When a single gene cannot mark the expression pattern of a domain, SpaGCN will construct a meta gene, formed by the combination of multiple genes, to represent the expression pattern of the domain.
To showcase the strength of SpaGCN, we applied it to seven publicly available datasets (Supplementary Table 1). The spatial domains identified by SpaGCN agree better with known tissue structures than Louvain, stLearn, and BayesSpace. We also compared SVGs detected by SpaGCN with those detected by SpatialDE and SPARK, and found that the SpaGCN-detected SVGs have more coherent expression patterns and better biological interpretability than the other two methods. The specificity of spatial expression patterns revealed by SpaGCN-detected SVGs were further confirmed by Moran’s I and Geary’s C statistics, two commonly used metrics for quantifying spatial autocorrelation of gene expression.


Fig. 1 | Workflow of SpaGCN. a, SpaGCN First use graph convolution network (GCN) Integration of gene expression 、 Spatial location and histological information , Then use unsupervised iterative clustering to spot Divided into different spatial domains . GCN Is based on an undirected weighted graph , Every two of them spot The edge weight between the two spot The Euclidean distance between ,spot By spatial coordinates (x,y) And the third dimension z Definition (z From the histological image RGB It's worth getting ). b, For each detected spatial domain ,SpaGCN adopt domain Guided DE Analysis and identification SVG or meta genes.
Methods








The network parameters and cluster centroids are simultaneously optimized by minimizing L using stochastic gradient descent with momentum. This unsupervised iterative clustering algorithm has previously been utilized for scRNA-seq analysis and showed superior performance over Louvain’s method.
After clustering, SpaGCN also provides an optional refinement step for the clustering result. In this step, SpaGCN examines the domain assignment of each spot and its surrounding spots. For a given spot, if more than half of its surrounding spots are assigned to a different domain, this spot will be relabeled to the same domain as the major label of its surrounding spots. As this refinement step only relabels a few spots, it has little impact on the downstream SVG detection. We performed cluster refinement only for the human dorsolateral prefrontal cortex 10x Visium data and the STARmap data when comparing to their manual annotations with clear domain boundaries.
边栏推荐
- Use of idea plug-in EASYCODE
- PID笔记
- Matlab system identification
- Test platform for combinational logic blocks
- Simple use of idea plug-in easy code
- 3D asset optimization and vertex data management for performance optimization
- Case analysis of terminal data leakage prevention
- An unordered array of N integers. Find the first number after each element that is larger than it. The time complexity is O (n)
- Adaboost
- 单细胞文献学习(part2)--stPlus: a reference-based method for the accurate enhancement of ST
猜你喜欢

I2C interface

单细胞论文记录(part13)--SpaGCN: Integrating gene expression, spatial location and histology to ...

小熊派BearPi-HM Micro正式合入OpenHarmony主干

I2C接口

401-字符串(344. 反转字符串、541. 反转字符串II、题目:剑指Offer 05.替换空格、151. 颠倒字符串中的单词)

Grabcut analysis

idea本地运行scope

DataBricks从开源到商业化踩过的坑

Single cell literature learning (Part2) -- stplus: a reference based method for the exact enhancement of St

MATLAB系统辨识
随机推荐
W800芯片平台进入OpenHarmony主干
Compréhension du pointeur C
格雷码与二进制的转换
Matlab system identification
RGB, sRGB and XYZ coordinate conversion
Use of idea plug-in EASYCODE
clickhouse对比两台机器数据
单细胞论文记录(part10)--Computational challenges and opportunities in SRT data
D3D10 截图功能 保存Texture到本地
D3D learning notes (1) - Introduction to the use conditions of autodraw at so stage
时序构成的测试平台
reduce_sum()中的reduction_indices
vcpkg:If you are sure you want to rebuild the above packages, run the command with the --recurse opt
Ethernet communication protocol
Viewing advanced numbers from the perspective of vector space (1) -- a series introduction
分页工具类 pageUtil<T>
System identification of automatic control principle
BinaryFormatter 保存和加载游戏数据 For Unity
Unity development - scene asynchronous loading
D3D学习笔记(1)—— SO 阶段介绍 AutoDraw使用条件