{"title":"scGANSL:基于子空间学习的scRNA-seq数据聚类图注意网络。","authors":"Zhenqiu Shu, Yixuan Ren, Qinghan Long, Hongbin Wang, Zhengtao Yu","doi":"10.1021/acs.jcim.5c00731","DOIUrl":null,"url":null,"abstract":"<p><p>Single-cell RNA sequencing (scRNA-seq) has become a crucial technology for analyzing cellular diversity at the single-cell level. Cell clustering is crucial in scRNA-seq data analysis as it accurately identifies distinct cell types and uncovers potential subpopulations. However, most existing scRNA-seq methods rely on a single view for analysis, leading to an incomplete interpretation of the scRNA-seq data. Furthermore, the high dimensionality of the scRNA-seq data and the inevitable noise pose significant challenges for clustering tasks. To address these challenges, in this study, we introduce a novel clustering method, called graph attention network with subspace learning (scGANSL), for scRNA-seq data clustering. Specifically, the proposed scGANSL method first constructs two views using highly variable genes (HVGs) screening and principal component analysis (PCA). They are then individually fed into a multiview shared graph autoencoder, where clustering labels guide the learning of latent representations and the coefficient matrix. Furthermore, the proposed method integrates a zero-inflated negative binomial (ZINB) model into a self-supervised graph attention autoencoder to learn latent representations more effectively. To preserve both local and global structures of scRNA-seq data in the latent representation space, we introduce a local learning and self-expression strategy to guide model training. Experimental results across various scRNA-seq data sets demonstrate that the proposed scGANSL model significantly outperforms other state-of-the-art scRNA-seq data clustering methods.</p>","PeriodicalId":44,"journal":{"name":"Journal of Chemical Information and Modeling ","volume":" ","pages":"6367-6381"},"PeriodicalIF":5.3000,"publicationDate":"2025-06-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"scGANSL: Graph Attention Network with Subspace Learning for scRNA-seq Data Clustering.\",\"authors\":\"Zhenqiu Shu, Yixuan Ren, Qinghan Long, Hongbin Wang, Zhengtao Yu\",\"doi\":\"10.1021/acs.jcim.5c00731\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>Single-cell RNA sequencing (scRNA-seq) has become a crucial technology for analyzing cellular diversity at the single-cell level. Cell clustering is crucial in scRNA-seq data analysis as it accurately identifies distinct cell types and uncovers potential subpopulations. However, most existing scRNA-seq methods rely on a single view for analysis, leading to an incomplete interpretation of the scRNA-seq data. Furthermore, the high dimensionality of the scRNA-seq data and the inevitable noise pose significant challenges for clustering tasks. To address these challenges, in this study, we introduce a novel clustering method, called graph attention network with subspace learning (scGANSL), for scRNA-seq data clustering. Specifically, the proposed scGANSL method first constructs two views using highly variable genes (HVGs) screening and principal component analysis (PCA). They are then individually fed into a multiview shared graph autoencoder, where clustering labels guide the learning of latent representations and the coefficient matrix. Furthermore, the proposed method integrates a zero-inflated negative binomial (ZINB) model into a self-supervised graph attention autoencoder to learn latent representations more effectively. To preserve both local and global structures of scRNA-seq data in the latent representation space, we introduce a local learning and self-expression strategy to guide model training. Experimental results across various scRNA-seq data sets demonstrate that the proposed scGANSL model significantly outperforms other state-of-the-art scRNA-seq data clustering methods.</p>\",\"PeriodicalId\":44,\"journal\":{\"name\":\"Journal of Chemical Information and Modeling \",\"volume\":\" \",\"pages\":\"6367-6381\"},\"PeriodicalIF\":5.3000,\"publicationDate\":\"2025-06-23\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Chemical Information and Modeling \",\"FirstCategoryId\":\"92\",\"ListUrlMain\":\"https://doi.org/10.1021/acs.jcim.5c00731\",\"RegionNum\":2,\"RegionCategory\":\"化学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2025/6/5 0:00:00\",\"PubModel\":\"Epub\",\"JCR\":\"Q1\",\"JCRName\":\"CHEMISTRY, MEDICINAL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Chemical Information and Modeling ","FirstCategoryId":"92","ListUrlMain":"https://doi.org/10.1021/acs.jcim.5c00731","RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/6/5 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"CHEMISTRY, MEDICINAL","Score":null,"Total":0}
scGANSL: Graph Attention Network with Subspace Learning for scRNA-seq Data Clustering.
Single-cell RNA sequencing (scRNA-seq) has become a crucial technology for analyzing cellular diversity at the single-cell level. Cell clustering is crucial in scRNA-seq data analysis as it accurately identifies distinct cell types and uncovers potential subpopulations. However, most existing scRNA-seq methods rely on a single view for analysis, leading to an incomplete interpretation of the scRNA-seq data. Furthermore, the high dimensionality of the scRNA-seq data and the inevitable noise pose significant challenges for clustering tasks. To address these challenges, in this study, we introduce a novel clustering method, called graph attention network with subspace learning (scGANSL), for scRNA-seq data clustering. Specifically, the proposed scGANSL method first constructs two views using highly variable genes (HVGs) screening and principal component analysis (PCA). They are then individually fed into a multiview shared graph autoencoder, where clustering labels guide the learning of latent representations and the coefficient matrix. Furthermore, the proposed method integrates a zero-inflated negative binomial (ZINB) model into a self-supervised graph attention autoencoder to learn latent representations more effectively. To preserve both local and global structures of scRNA-seq data in the latent representation space, we introduce a local learning and self-expression strategy to guide model training. Experimental results across various scRNA-seq data sets demonstrate that the proposed scGANSL model significantly outperforms other state-of-the-art scRNA-seq data clustering methods.
期刊介绍:
The Journal of Chemical Information and Modeling publishes papers reporting new methodology and/or important applications in the fields of chemical informatics and molecular modeling. Specific topics include the representation and computer-based searching of chemical databases, molecular modeling, computer-aided molecular design of new materials, catalysts, or ligands, development of new computational methods or efficient algorithms for chemical software, and biopharmaceutical chemistry including analyses of biological activity and other issues related to drug discovery.
Astute chemists, computer scientists, and information specialists look to this monthly’s insightful research studies, programming innovations, and software reviews to keep current with advances in this integral, multidisciplinary field.
As a subscriber you’ll stay abreast of database search systems, use of graph theory in chemical problems, substructure search systems, pattern recognition and clustering, analysis of chemical and physical data, molecular modeling, graphics and natural language interfaces, bibliometric and citation analysis, and synthesis design and reactions databases.