Xiran Chen , Sha Lin , Xiaofeng Chen , Weikai Li , Yifei Li
{"title":"Timestamp calibration for time-series single cell RNA-seq expression data","authors":"Xiran Chen , Sha Lin , Xiaofeng Chen , Weikai Li , Yifei Li","doi":"10.1016/j.jmb.2025.169021","DOIUrl":null,"url":null,"abstract":"<div><div>Timestamp automatic annotation (TAA) is a crucial procedure for analyzing time-series scRNA-seq data, as they unveil dynamic biological developments and cell regeneration processes. However, current TAA methods heavily rely on manual timestamps, often overlooking their reliability. This oversight can significantly degrade the performance of timestamp automatic annotation due to noisy timestamps. Nevertheless, the current approach for addressing this issue tends to select less critical cleaned samples for timestamp calibration. To tackle this challenge, we have developed a novel timestamp calibration model called ScPace for handling noisy labeled time-series scRNA-seq data. This approach incorporates a latent variable indicator within a base classifier instead of probability sampling to detect noisy samples effectively. To validate our proposed method, we conducted experiments on both simulated and real time-series scRNA-seq datasets. Cross validation experiments with different artificial mislabeling rates demonstrate that ScPace outperforms previous approaches. Furthermore, after calibrating the timestamps of the original time-series scRNA-seq data using our method, we performed supervised pseudotime analysis, revealing that ScPace enhances its performance significantly. These findings suggest that ScPace is an effective tool for timestamp calibration by enabling reclassification and deletion of detected noisy labeled samples while maintaining robustness across diverse ranges of time-series scRNA-seq datasets. The source code is available at https://github.com/OPUS-Lightphenexx/ScPace.</div></div>","PeriodicalId":369,"journal":{"name":"Journal of Molecular Biology","volume":"437 9","pages":"Article 169021"},"PeriodicalIF":4.7000,"publicationDate":"2025-02-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Molecular Biology","FirstCategoryId":"99","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0022283625000877","RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"BIOCHEMISTRY & MOLECULAR BIOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Timestamp automatic annotation (TAA) is a crucial procedure for analyzing time-series scRNA-seq data, as they unveil dynamic biological developments and cell regeneration processes. However, current TAA methods heavily rely on manual timestamps, often overlooking their reliability. This oversight can significantly degrade the performance of timestamp automatic annotation due to noisy timestamps. Nevertheless, the current approach for addressing this issue tends to select less critical cleaned samples for timestamp calibration. To tackle this challenge, we have developed a novel timestamp calibration model called ScPace for handling noisy labeled time-series scRNA-seq data. This approach incorporates a latent variable indicator within a base classifier instead of probability sampling to detect noisy samples effectively. To validate our proposed method, we conducted experiments on both simulated and real time-series scRNA-seq datasets. Cross validation experiments with different artificial mislabeling rates demonstrate that ScPace outperforms previous approaches. Furthermore, after calibrating the timestamps of the original time-series scRNA-seq data using our method, we performed supervised pseudotime analysis, revealing that ScPace enhances its performance significantly. These findings suggest that ScPace is an effective tool for timestamp calibration by enabling reclassification and deletion of detected noisy labeled samples while maintaining robustness across diverse ranges of time-series scRNA-seq datasets. The source code is available at https://github.com/OPUS-Lightphenexx/ScPace.
期刊介绍:
Journal of Molecular Biology (JMB) provides high quality, comprehensive and broad coverage in all areas of molecular biology. The journal publishes original scientific research papers that provide mechanistic and functional insights and report a significant advance to the field. The journal encourages the submission of multidisciplinary studies that use complementary experimental and computational approaches to address challenging biological questions.
Research areas include but are not limited to: Biomolecular interactions, signaling networks, systems biology; Cell cycle, cell growth, cell differentiation; Cell death, autophagy; Cell signaling and regulation; Chemical biology; Computational biology, in combination with experimental studies; DNA replication, repair, and recombination; Development, regenerative biology, mechanistic and functional studies of stem cells; Epigenetics, chromatin structure and function; Gene expression; Membrane processes, cell surface proteins and cell-cell interactions; Methodological advances, both experimental and theoretical, including databases; Microbiology, virology, and interactions with the host or environment; Microbiota mechanistic and functional studies; Nuclear organization; Post-translational modifications, proteomics; Processing and function of biologically important macromolecules and complexes; Molecular basis of disease; RNA processing, structure and functions of non-coding RNAs, transcription; Sorting, spatiotemporal organization, trafficking; Structural biology; Synthetic biology; Translation, protein folding, chaperones, protein degradation and quality control.