scSDSC: Self-supervised Deep Subspace Clustering for scRNA-seq Data

IF 2.9 3区生物学 Q3 BIOCHEMICAL RESEARCH METHODS

Current Bioinformatics Pub Date : 2023-08-16 DOI:10.2174/1574893618666230816090443

Jian-ping Zhao, Bo Yang, Hai-yun Wang, Chunhan Zheng

{"title":"scSDSC: Self-supervised Deep Subspace Clustering for scRNA-seq Data","authors":"Jian-ping Zhao, Bo Yang, Hai-yun Wang, Chunhan Zheng","doi":"10.2174/1574893618666230816090443","DOIUrl":null,"url":null,"abstract":"\n\nSingle-cell RNA sequencing(scRNA-seq) data can identify heterogeneity between cells, thereby identifying cell types and discovering rare cell types. Clustering is often used to identify cell types, but the high noise and high dimension of scRNA-seq lead to the degradation of clustering performance and impact downstream analysis. Deep learning is widely used in this field, which provides promising performance in feature learning.\n\n\n\nMost deep learning models only consider the relationship between genes, ignore the relationship between cells. We try to use the relationships between cells and the relationships between genes to construct clustering models.\n\n\n\nWe proposed scSDSC: a deep subspace cluster architecture that considers the relationships between genes and cells at the same time. Similar to deep subspace clustering (DSC), we added a fully connected layer after the embedding layer to obtain the self-expression matrix. In addition, we also added a fully connected SoftMax layer to generate the pseudo-label and used the information carried by the pseudo-label for model training. Finally, the affinity matrix is obtained for spectral clustering.\n\n\n\nExperimental results on eight real datasets show that scSDSC outperforms existing methods in downstream analysis.\n\n\n\nOur method plays an important role in improving clustering accuracy and downstream analysis.\n","PeriodicalId":10801,"journal":{"name":"Current Bioinformatics","volume":" ","pages":""},"PeriodicalIF":2.9000,"publicationDate":"2023-08-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Current Bioinformatics","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.2174/1574893618666230816090443","RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"BIOCHEMICAL RESEARCH METHODS","Score":null,"Total":0}

引用次数: 0

Abstract

Single-cell RNA sequencing(scRNA-seq) data can identify heterogeneity between cells, thereby identifying cell types and discovering rare cell types. Clustering is often used to identify cell types, but the high noise and high dimension of scRNA-seq lead to the degradation of clustering performance and impact downstream analysis. Deep learning is widely used in this field, which provides promising performance in feature learning. Most deep learning models only consider the relationship between genes, ignore the relationship between cells. We try to use the relationships between cells and the relationships between genes to construct clustering models. We proposed scSDSC: a deep subspace cluster architecture that considers the relationships between genes and cells at the same time. Similar to deep subspace clustering (DSC), we added a fully connected layer after the embedding layer to obtain the self-expression matrix. In addition, we also added a fully connected SoftMax layer to generate the pseudo-label and used the information carried by the pseudo-label for model training. Finally, the affinity matrix is obtained for spectral clustering. Experimental results on eight real datasets show that scSDSC outperforms existing methods in downstream analysis. Our method plays an important role in improving clustering accuracy and downstream analysis.

查看原文本刊更多论文

scSDSC：scRNA-seq数据的自监督深子空间聚类

单细胞RNA测序(scRNA-seq)数据可以识别细胞间的异质性，从而鉴定细胞类型，发现罕见的细胞类型。聚类通常用于识别细胞类型，但scRNA-seq的高噪声和高维数导致聚类性能下降，影响下游分析。深度学习在这一领域得到了广泛的应用，在特征学习方面有很好的表现。大多数深度学习模型只考虑基因之间的关系，忽略了细胞之间的关系。我们尝试使用细胞之间的关系和基因之间的关系来构建聚类模型。我们提出了一种同时考虑基因和细胞之间关系的深层子空间簇结构——scSDSC。与深子空间聚类(deep subspace clustering, DSC)类似，我们在嵌入层之后增加一个完全连接层来获得自表达矩阵。此外，我们还增加了一个全连接的SoftMax层来生成伪标签，并利用伪标签携带的信息进行模型训练。最后，得到用于谱聚类的亲和矩阵。在8个真实数据集上的实验结果表明，scSDSC在下游分析方面优于现有方法。该方法在提高聚类精度和下游分析方面发挥了重要作用。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Current Bioinformatics 生物-生化研究方法

CiteScore

6.60

自引率

2.50%

发文量

审稿时长

>12 weeks

期刊介绍： Current Bioinformatics aims to publish all the latest and outstanding developments in bioinformatics. Each issue contains a series of timely, in-depth/mini-reviews, research papers and guest edited thematic issues written by leaders in the field, covering a wide range of the integration of biology with computer and information science. The journal focuses on advances in computational molecular/structural biology, encompassing areas such as computing in biomedicine and genomics, computational proteomics and systems biology, and metabolic pathway engineering. Developments in these fields have direct implications on key issues related to health care, medicine, genetic disorders, development of agricultural products, renewable energy, environmental protection, etc.