Sophia A Campione, Christina M Kelliher, Cullen Roth, Chun Yi Cho, Anastasia Deckard, Francis Motta, Steven B Haase
{"title":"Identification and correction of time-series transcriptomic anomalies","authors":"Sophia A Campione, Christina M Kelliher, Cullen Roth, Chun Yi Cho, Anastasia Deckard, Francis Motta, Steven B Haase","doi":"10.1093/nar/gkaf524","DOIUrl":null,"url":null,"abstract":"Transcriptomic analyses performed in time series have uncovered many important insights into dynamic biological processes such as circadian rhythms, cellular developmental cycles, and the cell cycle. Some of these studies have revealed transcriptomic artifacts (STRIPEs), characterized by substantial changes in transcript levels across the transcriptome between a time point and its temporal neighbors. These changes are unlikely to reflect underlying biology as the magnitude of the change is too large to occur within the time interval and because every gene in the time point exhibits a substantial change. Furthermore, STRIPEs occur across species exhibiting different biology, do not occur in the same phase across replicate time-series experiments, and can vary between technical replicates of a single time point. Here, we demonstrate STRIPEs in five time-series transcriptomic datasets across different species, biological processes, and timescales. We describe a computational method to detect STRIPEs in time series using the Kolmogorov–Smirnov statistical test, allowing for unbiased, user-friendly detection of STRIPEs. Finally, we present three methods for STRIPE correction and demonstrate their efficacy. Using periodicity analysis to identify periodic genes, we find nearly 600 genes changed in periodicity labeling following successful STRIPE correction, indicating the large impact of STRIPE removal on downstream analysis.","PeriodicalId":19471,"journal":{"name":"Nucleic Acids Research","volume":"64 1","pages":""},"PeriodicalIF":16.6000,"publicationDate":"2025-06-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Nucleic Acids Research","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1093/nar/gkaf524","RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"BIOCHEMISTRY & MOLECULAR BIOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Transcriptomic analyses performed in time series have uncovered many important insights into dynamic biological processes such as circadian rhythms, cellular developmental cycles, and the cell cycle. Some of these studies have revealed transcriptomic artifacts (STRIPEs), characterized by substantial changes in transcript levels across the transcriptome between a time point and its temporal neighbors. These changes are unlikely to reflect underlying biology as the magnitude of the change is too large to occur within the time interval and because every gene in the time point exhibits a substantial change. Furthermore, STRIPEs occur across species exhibiting different biology, do not occur in the same phase across replicate time-series experiments, and can vary between technical replicates of a single time point. Here, we demonstrate STRIPEs in five time-series transcriptomic datasets across different species, biological processes, and timescales. We describe a computational method to detect STRIPEs in time series using the Kolmogorov–Smirnov statistical test, allowing for unbiased, user-friendly detection of STRIPEs. Finally, we present three methods for STRIPE correction and demonstrate their efficacy. Using periodicity analysis to identify periodic genes, we find nearly 600 genes changed in periodicity labeling following successful STRIPE correction, indicating the large impact of STRIPE removal on downstream analysis.
期刊介绍:
Nucleic Acids Research (NAR) is a scientific journal that publishes research on various aspects of nucleic acids and proteins involved in nucleic acid metabolism and interactions. It covers areas such as chemistry and synthetic biology, computational biology, gene regulation, chromatin and epigenetics, genome integrity, repair and replication, genomics, molecular biology, nucleic acid enzymes, RNA, and structural biology. The journal also includes a Survey and Summary section for brief reviews. Additionally, each year, the first issue is dedicated to biological databases, and an issue in July focuses on web-based software resources for the biological community. Nucleic Acids Research is indexed by several services including Abstracts on Hygiene and Communicable Diseases, Animal Breeding Abstracts, Agricultural Engineering Abstracts, Agbiotech News and Information, BIOSIS Previews, CAB Abstracts, and EMBASE.