J. Pura, Xuechan Li, Cliburn Chan, Jichun Xie
{"title":"TEAM: A multiple testing algorithm on the aggregation tree for flow cytometry analysis","authors":"J. Pura, Xuechan Li, Cliburn Chan, Jichun Xie","doi":"10.1214/22-aoas1645","DOIUrl":null,"url":null,"abstract":"In immunology studies, flow cytometry is a commonly used multivariate single-cell assay. One key goal in flow cytometry analysis is to detect the immune cells responsive to certain stimuli. Statistically, this problem can be translated into comparing two protein expression probability density functions (pdfs) before and after the stimulus; the goal is to pinpoint the regions where these two pdfs di er. Further screening of these di erential regions can be performed to identify enriched sets of responsive cells. In this paper, we model identifying di erential density regions as a multiple testing problem. First, we partition the sample space into small bins. In each bin, we form a hypothesis to test the existence of di erential pdfs. Second, we develop a novel multiple testing method, called TEAM (Testing on the Aggregation tree Method), to identify those bins that harbor di erential pdfs while controlling the false discovery rate (FDR) under the desired level. TEAM embeds the testing procedure into an aggregation tree to test from fineto coarse-resolution. The procedure achieves the statistical goal of pinpointing density di erences to the smallest possible regions. TEAM is computationally e cient, capable of analyzing large flow cytometry data sets in much shorter time compared with competing methods. We applied TEAM and competing methods on a flow cytometry data set to identify T cells responsive to the cytomegalovirus (CMV)-pp65 antigen stimulation. With additional downstream screening, TEAM successfully identified enriched sets containing monofunctional, bifunctional, and polyfunctional T cells. Competing methods either did not finish in a reasonable time frame or provided less interpretable results. Numerical simulations and theoretical justifications demonstrate that TEAM has asymptotically valid, powerful, and robust performance. Overall, TEAM is a computationally e cient and statistically powerful algorithm that can yield meaningful biological insights in flow cytometry studies.","PeriodicalId":188068,"journal":{"name":"The Annals of Applied Statistics","volume":"6 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"The Annals of Applied Statistics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1214/22-aoas1645","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

摘要

在免疫学研究中,流式细胞术是一种常用的多变量单细胞测定方法。流式细胞术分析的一个关键目标是检测免疫细胞对某些刺激的反应。在统计学上,这个问题可以转化为比较刺激前后的两个蛋白表达概率密度函数(pdfs);目标是确定这两个PDFS所在的区域。进一步筛选这些差异区域可以进行鉴定,以丰富的反应细胞组。在本文中,我们将识别差分密度区域建模为一个多重测试问题。首先,我们将样本空间划分为小的箱子。在每个箱子中,我们形成一个假设来检验差分pdf的存在性。其次,我们开发了一种新的多重测试方法,称为TEAM(聚集树测试方法),以识别包含差分pdf的箱子,同时将错误发现率(FDR)控制在期望的水平之下。TEAM将测试过程嵌入到一个聚合树中,从精细到粗分辨率进行测试。该程序实现了精确定位密度差异到尽可能小的区域的统计目标。TEAM计算效率高,与竞争对手的方法相比,能够在更短的时间内分析大量流式细胞术数据集。我们在流式细胞术数据集上应用TEAM和竞争方法来鉴定对巨细胞病毒(CMV)-pp65抗原刺激有反应的T细胞。通过额外的下游筛选,TEAM成功鉴定出含有单功能、双功能和多功能T细胞的富集组。相互竞争的方法要么没有在合理的时间框架内完成,要么提供的可解释性较差的结果。数值模拟和理论验证表明,该算法具有渐近有效、强大和稳健的性能。总的来说,TEAM是一种计算效率高、统计功能强大的算法,可以在流式细胞术研究中产生有意义的生物学见解。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
TEAM: A multiple testing algorithm on the aggregation tree for flow cytometry analysis
In immunology studies, flow cytometry is a commonly used multivariate single-cell assay. One key goal in flow cytometry analysis is to detect the immune cells responsive to certain stimuli. Statistically, this problem can be translated into comparing two protein expression probability density functions (pdfs) before and after the stimulus; the goal is to pinpoint the regions where these two pdfs di er. Further screening of these di erential regions can be performed to identify enriched sets of responsive cells. In this paper, we model identifying di erential density regions as a multiple testing problem. First, we partition the sample space into small bins. In each bin, we form a hypothesis to test the existence of di erential pdfs. Second, we develop a novel multiple testing method, called TEAM (Testing on the Aggregation tree Method), to identify those bins that harbor di erential pdfs while controlling the false discovery rate (FDR) under the desired level. TEAM embeds the testing procedure into an aggregation tree to test from fineto coarse-resolution. The procedure achieves the statistical goal of pinpointing density di erences to the smallest possible regions. TEAM is computationally e cient, capable of analyzing large flow cytometry data sets in much shorter time compared with competing methods. We applied TEAM and competing methods on a flow cytometry data set to identify T cells responsive to the cytomegalovirus (CMV)-pp65 antigen stimulation. With additional downstream screening, TEAM successfully identified enriched sets containing monofunctional, bifunctional, and polyfunctional T cells. Competing methods either did not finish in a reasonable time frame or provided less interpretable results. Numerical simulations and theoretical justifications demonstrate that TEAM has asymptotically valid, powerful, and robust performance. Overall, TEAM is a computationally e cient and statistically powerful algorithm that can yield meaningful biological insights in flow cytometry studies.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信