{"title":"Recursive Clustering of Cellular Diversity in scRNA-Seq Data.","authors":"Michael Squires, Peng Qiu","doi":"10.1089/cmb.2024.0625","DOIUrl":null,"url":null,"abstract":"<p><p>In scRNA-seq analysis, cell clusters are typically defined by a single round of feature extraction and clustering. This approach may miss phenotypic differences in cell types that are characterized by genes not sufficiently represented in the feature set derived using all cells, such as rare cell types. This work explores an alternative approach, where cell clusters are identified by recursively performing feature extraction and clustering on previously identified clusters, such that each subclustering step uses features that are more specific to distinguishing the higher-resolution subclusters. We benchmark this recursive approach against the conventional, nonrecursive clustering approach and demonstrate that the recursive method results in robust improvement in cell type detection on four scRNA-seq datasets across a wide range of clustering resolution parameters. We apply the recursive approach to cluster scRNA-seq data obtained from patients with Crohn's disease belonging to three clinical phenotypes and observe that recursive clustering captures phenotypic differences only visible at specific levels of granularity within an interpretable hierarchical framework while defining cell clusters within a gene expression feature space more specific to each cluster.</p>","PeriodicalId":15526,"journal":{"name":"Journal of Computational Biology","volume":" ","pages":""},"PeriodicalIF":1.4000,"publicationDate":"2025-03-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Computational Biology","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1089/cmb.2024.0625","RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"BIOCHEMICAL RESEARCH METHODS","Score":null,"Total":0}
引用次数: 0
Abstract
In scRNA-seq analysis, cell clusters are typically defined by a single round of feature extraction and clustering. This approach may miss phenotypic differences in cell types that are characterized by genes not sufficiently represented in the feature set derived using all cells, such as rare cell types. This work explores an alternative approach, where cell clusters are identified by recursively performing feature extraction and clustering on previously identified clusters, such that each subclustering step uses features that are more specific to distinguishing the higher-resolution subclusters. We benchmark this recursive approach against the conventional, nonrecursive clustering approach and demonstrate that the recursive method results in robust improvement in cell type detection on four scRNA-seq datasets across a wide range of clustering resolution parameters. We apply the recursive approach to cluster scRNA-seq data obtained from patients with Crohn's disease belonging to three clinical phenotypes and observe that recursive clustering captures phenotypic differences only visible at specific levels of granularity within an interpretable hierarchical framework while defining cell clusters within a gene expression feature space more specific to each cluster.
期刊介绍:
Journal of Computational Biology is the leading peer-reviewed journal in computational biology and bioinformatics, publishing in-depth statistical, mathematical, and computational analysis of methods, as well as their practical impact. Available only online, this is an essential journal for scientists and students who want to keep abreast of developments in bioinformatics.
Journal of Computational Biology coverage includes:
-Genomics
-Mathematical modeling and simulation
-Distributed and parallel biological computing
-Designing biological databases
-Pattern matching and pattern detection
-Linking disparate databases and data
-New tools for computational biology
-Relational and object-oriented database technology for bioinformatics
-Biological expert system design and use
-Reasoning by analogy, hypothesis formation, and testing by machine
-Management of biological databases