{"title":"scPANDA: PAN-Blood Data Annotator with a 10-Million Single-Cell Atlas.","authors":"Chang-Xiao Li, Can Huang, Dong-Sheng Chen","doi":"10.24920/004472","DOIUrl":null,"url":null,"abstract":"<p><strong>Objectives: </strong>Recent advancements in single-cell RNA sequencing (scRNA-seq) have revolutionized the study of cellular heterogeneity, particularly within the hematological system. However, accurately annotating cell types remains challenging due to the complexity of immune cells. To address this challenge, we develop a PAN-blood single-cell Data Annotator (scPANDA), which leverages a comprehensive 10-million-cell atlas to provide precise cell type annotation.</p><p><strong>Methods: </strong>The atlas, constructed from data collected in 16 studies, incorporated rigorous quality control, preprocessing, and integration steps to ensure a high-quality reference for annotation. scPANDA utilizes a three-layer inference approach, progressively refining cell types from broad compartments to specific clusters. Iterative clustering and harmonization processes were employed to maintain cell type purity throughout the analysis. Furthermore, the performance of scPANDA was evaluated in three external datasets.</p><p><strong>Results: </strong>The atlas was structured hierarchically, consisting of 16 compartments, 54 classes, 4,460 low-level clusters (<i>pd_cc_cl_tfs</i>), and 611 high-level clusters (<i>pmid_cts</i>). Robust performance of the tool was demonstrated in annotating diverse immune scRNA-seq datasets, analyzing immune-tumor coexisting clusters in renal cell carcinoma, and identifying conserved cell clusters across species.</p><p><strong>Conclusions: </strong>scPANDA exemplifies effective reference mapping with a large-scale atlas, enhancing the accuracy and reliability of blood cell type identification.</p>","PeriodicalId":35615,"journal":{"name":"Chinese Medical Sciences Journal","volume":" ","pages":"1-21"},"PeriodicalIF":0.0000,"publicationDate":"2025-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Chinese Medical Sciences Journal","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.24920/004472","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"Medicine","Score":null,"Total":0}
引用次数: 0
Abstract
Objectives: Recent advancements in single-cell RNA sequencing (scRNA-seq) have revolutionized the study of cellular heterogeneity, particularly within the hematological system. However, accurately annotating cell types remains challenging due to the complexity of immune cells. To address this challenge, we develop a PAN-blood single-cell Data Annotator (scPANDA), which leverages a comprehensive 10-million-cell atlas to provide precise cell type annotation.
Methods: The atlas, constructed from data collected in 16 studies, incorporated rigorous quality control, preprocessing, and integration steps to ensure a high-quality reference for annotation. scPANDA utilizes a three-layer inference approach, progressively refining cell types from broad compartments to specific clusters. Iterative clustering and harmonization processes were employed to maintain cell type purity throughout the analysis. Furthermore, the performance of scPANDA was evaluated in three external datasets.
Results: The atlas was structured hierarchically, consisting of 16 compartments, 54 classes, 4,460 low-level clusters (pd_cc_cl_tfs), and 611 high-level clusters (pmid_cts). Robust performance of the tool was demonstrated in annotating diverse immune scRNA-seq datasets, analyzing immune-tumor coexisting clusters in renal cell carcinoma, and identifying conserved cell clusters across species.
Conclusions: scPANDA exemplifies effective reference mapping with a large-scale atlas, enhancing the accuracy and reliability of blood cell type identification.