Yan Li, Xiang Zhou, Rui Chen, Xianyang Zhang, Hongyuan Cao
{"title":"STAREG: Statistical replicability analysis of high throughput experiments with applications to spatial transcriptomic studies.","authors":"Yan Li, Xiang Zhou, Rui Chen, Xianyang Zhang, Hongyuan Cao","doi":"10.1371/journal.pgen.1011423","DOIUrl":null,"url":null,"abstract":"<p><p>Replicable signals from different yet conceptually related studies provide stronger scientific evidence and more powerful inference. We introduce STAREG, a statistical method for replicability analysis of high throughput experiments, and apply it to analyze spatial transcriptomic studies. STAREG uses summary statistics from multiple studies of high throughput experiments and models the the joint distribution of p-values accounting for the heterogeneity of different studies. It effectively controls the false discovery rate (FDR) and has higher power by information borrowing. Moreover, it provides different rankings of important genes. With the EM algorithm in combination with pool-adjacent-violator-algorithm (PAVA), STAREG is scalable to datasets with millions of genes without any tuning parameters. Analyzing two pairs of spatially resolved transcriptomic datasets, we are able to make biological discoveries that otherwise cannot be obtained by using existing methods.</p>","PeriodicalId":49007,"journal":{"name":"PLoS Genetics","volume":"20 10","pages":"e1011423"},"PeriodicalIF":4.0000,"publicationDate":"2024-10-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11478871/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"PLoS Genetics","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1371/journal.pgen.1011423","RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/10/1 0:00:00","PubModel":"eCollection","JCR":"Q1","JCRName":"GENETICS & HEREDITY","Score":null,"Total":0}
引用次数: 0
Abstract
Replicable signals from different yet conceptually related studies provide stronger scientific evidence and more powerful inference. We introduce STAREG, a statistical method for replicability analysis of high throughput experiments, and apply it to analyze spatial transcriptomic studies. STAREG uses summary statistics from multiple studies of high throughput experiments and models the the joint distribution of p-values accounting for the heterogeneity of different studies. It effectively controls the false discovery rate (FDR) and has higher power by information borrowing. Moreover, it provides different rankings of important genes. With the EM algorithm in combination with pool-adjacent-violator-algorithm (PAVA), STAREG is scalable to datasets with millions of genes without any tuning parameters. Analyzing two pairs of spatially resolved transcriptomic datasets, we are able to make biological discoveries that otherwise cannot be obtained by using existing methods.
来自不同但概念相关的研究的可复制信号可提供更有力的科学证据和推论。我们介绍了 STAREG--一种用于高通量实验可重复性分析的统计方法,并将其应用于分析空间转录组研究。STAREG 使用来自多个高通量实验研究的汇总统计量,并根据不同研究的异质性对 p 值的联合分布进行建模。它能有效控制错误发现率(FDR),并通过信息借用获得更高的功率。此外,它还提供了不同的重要基因排名。STAREG 将 EM 算法与池邻接-违反者算法(PAVA)相结合,无需任何调整参数即可扩展到数百万个基因的数据集。通过分析两对空间分辨率转录组数据集,我们能够发现现有方法无法发现的生物学现象。
期刊介绍:
PLOS Genetics is run by an international Editorial Board, headed by the Editors-in-Chief, Greg Barsh (HudsonAlpha Institute of Biotechnology, and Stanford University School of Medicine) and Greg Copenhaver (The University of North Carolina at Chapel Hill).
Articles published in PLOS Genetics are archived in PubMed Central and cited in PubMed.