scGANSL:基于子空间学习的scRNA-seq数据聚类图注意网络。

IF 5.3 2区 化学 Q1 CHEMISTRY, MEDICINAL
Zhenqiu Shu, Yixuan Ren, Qinghan Long, Hongbin Wang, Zhengtao Yu
{"title":"scGANSL:基于子空间学习的scRNA-seq数据聚类图注意网络。","authors":"Zhenqiu Shu, Yixuan Ren, Qinghan Long, Hongbin Wang, Zhengtao Yu","doi":"10.1021/acs.jcim.5c00731","DOIUrl":null,"url":null,"abstract":"<p><p>Single-cell RNA sequencing (scRNA-seq) has become a crucial technology for analyzing cellular diversity at the single-cell level. Cell clustering is crucial in scRNA-seq data analysis as it accurately identifies distinct cell types and uncovers potential subpopulations. However, most existing scRNA-seq methods rely on a single view for analysis, leading to an incomplete interpretation of the scRNA-seq data. Furthermore, the high dimensionality of the scRNA-seq data and the inevitable noise pose significant challenges for clustering tasks. To address these challenges, in this study, we introduce a novel clustering method, called graph attention network with subspace learning (scGANSL), for scRNA-seq data clustering. Specifically, the proposed scGANSL method first constructs two views using highly variable genes (HVGs) screening and principal component analysis (PCA). They are then individually fed into a multiview shared graph autoencoder, where clustering labels guide the learning of latent representations and the coefficient matrix. Furthermore, the proposed method integrates a zero-inflated negative binomial (ZINB) model into a self-supervised graph attention autoencoder to learn latent representations more effectively. To preserve both local and global structures of scRNA-seq data in the latent representation space, we introduce a local learning and self-expression strategy to guide model training. Experimental results across various scRNA-seq data sets demonstrate that the proposed scGANSL model significantly outperforms other state-of-the-art scRNA-seq data clustering methods.</p>","PeriodicalId":44,"journal":{"name":"Journal of Chemical Information and Modeling ","volume":" ","pages":"6367-6381"},"PeriodicalIF":5.3000,"publicationDate":"2025-06-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"scGANSL: Graph Attention Network with Subspace Learning for scRNA-seq Data Clustering.\",\"authors\":\"Zhenqiu Shu, Yixuan Ren, Qinghan Long, Hongbin Wang, Zhengtao Yu\",\"doi\":\"10.1021/acs.jcim.5c00731\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>Single-cell RNA sequencing (scRNA-seq) has become a crucial technology for analyzing cellular diversity at the single-cell level. Cell clustering is crucial in scRNA-seq data analysis as it accurately identifies distinct cell types and uncovers potential subpopulations. However, most existing scRNA-seq methods rely on a single view for analysis, leading to an incomplete interpretation of the scRNA-seq data. Furthermore, the high dimensionality of the scRNA-seq data and the inevitable noise pose significant challenges for clustering tasks. To address these challenges, in this study, we introduce a novel clustering method, called graph attention network with subspace learning (scGANSL), for scRNA-seq data clustering. Specifically, the proposed scGANSL method first constructs two views using highly variable genes (HVGs) screening and principal component analysis (PCA). They are then individually fed into a multiview shared graph autoencoder, where clustering labels guide the learning of latent representations and the coefficient matrix. Furthermore, the proposed method integrates a zero-inflated negative binomial (ZINB) model into a self-supervised graph attention autoencoder to learn latent representations more effectively. To preserve both local and global structures of scRNA-seq data in the latent representation space, we introduce a local learning and self-expression strategy to guide model training. Experimental results across various scRNA-seq data sets demonstrate that the proposed scGANSL model significantly outperforms other state-of-the-art scRNA-seq data clustering methods.</p>\",\"PeriodicalId\":44,\"journal\":{\"name\":\"Journal of Chemical Information and Modeling \",\"volume\":\" \",\"pages\":\"6367-6381\"},\"PeriodicalIF\":5.3000,\"publicationDate\":\"2025-06-23\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Chemical Information and Modeling \",\"FirstCategoryId\":\"92\",\"ListUrlMain\":\"https://doi.org/10.1021/acs.jcim.5c00731\",\"RegionNum\":2,\"RegionCategory\":\"化学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2025/6/5 0:00:00\",\"PubModel\":\"Epub\",\"JCR\":\"Q1\",\"JCRName\":\"CHEMISTRY, MEDICINAL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Chemical Information and Modeling ","FirstCategoryId":"92","ListUrlMain":"https://doi.org/10.1021/acs.jcim.5c00731","RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/6/5 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"CHEMISTRY, MEDICINAL","Score":null,"Total":0}
引用次数: 0

摘要

单细胞RNA测序(scRNA-seq)已成为单细胞水平分析细胞多样性的关键技术。细胞聚类在scRNA-seq数据分析中至关重要,因为它能准确识别不同的细胞类型并揭示潜在的亚群。然而,大多数现有的scRNA-seq方法依赖于单一视图进行分析,导致对scRNA-seq数据的解释不完整。此外,scRNA-seq数据的高维性和不可避免的噪声给聚类任务带来了重大挑战。为了解决这些挑战,在本研究中,我们引入了一种新的聚类方法,称为带有子空间学习的图注意网络(scGANSL),用于scRNA-seq数据聚类。具体而言,所提出的scGANSL方法首先使用高变量基因(hvg)筛选和主成分分析(PCA)构建了两个视图。然后将它们单独输入到多视图共享图自编码器中,其中聚类标签指导潜在表示和系数矩阵的学习。此外,该方法将零膨胀负二项(ZINB)模型集成到自监督图注意自编码器中,以更有效地学习潜在表征。为了在潜在表示空间中保留scRNA-seq数据的局部和全局结构,我们引入了局部学习和自我表达策略来指导模型训练。各种scRNA-seq数据集的实验结果表明,所提出的scGANSL模型显著优于其他最先进的scRNA-seq数据聚类方法。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
scGANSL: Graph Attention Network with Subspace Learning for scRNA-seq Data Clustering.

Single-cell RNA sequencing (scRNA-seq) has become a crucial technology for analyzing cellular diversity at the single-cell level. Cell clustering is crucial in scRNA-seq data analysis as it accurately identifies distinct cell types and uncovers potential subpopulations. However, most existing scRNA-seq methods rely on a single view for analysis, leading to an incomplete interpretation of the scRNA-seq data. Furthermore, the high dimensionality of the scRNA-seq data and the inevitable noise pose significant challenges for clustering tasks. To address these challenges, in this study, we introduce a novel clustering method, called graph attention network with subspace learning (scGANSL), for scRNA-seq data clustering. Specifically, the proposed scGANSL method first constructs two views using highly variable genes (HVGs) screening and principal component analysis (PCA). They are then individually fed into a multiview shared graph autoencoder, where clustering labels guide the learning of latent representations and the coefficient matrix. Furthermore, the proposed method integrates a zero-inflated negative binomial (ZINB) model into a self-supervised graph attention autoencoder to learn latent representations more effectively. To preserve both local and global structures of scRNA-seq data in the latent representation space, we introduce a local learning and self-expression strategy to guide model training. Experimental results across various scRNA-seq data sets demonstrate that the proposed scGANSL model significantly outperforms other state-of-the-art scRNA-seq data clustering methods.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
CiteScore
9.80
自引率
10.70%
发文量
529
审稿时长
1.4 months
期刊介绍: The Journal of Chemical Information and Modeling publishes papers reporting new methodology and/or important applications in the fields of chemical informatics and molecular modeling. Specific topics include the representation and computer-based searching of chemical databases, molecular modeling, computer-aided molecular design of new materials, catalysts, or ligands, development of new computational methods or efficient algorithms for chemical software, and biopharmaceutical chemistry including analyses of biological activity and other issues related to drug discovery. Astute chemists, computer scientists, and information specialists look to this monthly’s insightful research studies, programming innovations, and software reviews to keep current with advances in this integral, multidisciplinary field. As a subscriber you’ll stay abreast of database search systems, use of graph theory in chemical problems, substructure search systems, pattern recognition and clustering, analysis of chemical and physical data, molecular modeling, graphics and natural language interfaces, bibliometric and citation analysis, and synthesis design and reactions databases.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信