Analysis of composition of microbiomes: a novel method for studying microbial composition.

Microbial Ecology in Health and Disease Pub Date : 2015-05-29 eCollection Date: 2015-01-01 DOI:10.3402/mehd.v26.27663
Siddhartha Mandal, Will Van Treuren, Richard A White, Merete Eggesbø, Rob Knight, Shyamal D Peddada
{"title":"Analysis of composition of microbiomes: a novel method for studying microbial composition.","authors":"Siddhartha Mandal,&nbsp;Will Van Treuren,&nbsp;Richard A White,&nbsp;Merete Eggesbø,&nbsp;Rob Knight,&nbsp;Shyamal D Peddada","doi":"10.3402/mehd.v26.27663","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Understanding the factors regulating our microbiota is important but requires appropriate statistical methodology. When comparing two or more populations most existing approaches either discount the underlying compositional structure in the microbiome data or use probability models such as the multinomial and Dirichlet-multinomial distributions, which may impose a correlation structure not suitable for microbiome data.</p><p><strong>Objective: </strong>To develop a methodology that accounts for compositional constraints to reduce false discoveries in detecting differentially abundant taxa at an ecosystem level, while maintaining high statistical power.</p><p><strong>Methods: </strong>We introduced a novel statistical framework called analysis of composition of microbiomes (ANCOM). ANCOM accounts for the underlying structure in the data and can be used for comparing the composition of microbiomes in two or more populations. ANCOM makes no distributional assumptions and can be implemented in a linear model framework to adjust for covariates as well as model longitudinal data. ANCOM also scales well to compare samples involving thousands of taxa.</p><p><strong>Results: </strong>We compared the performance of ANCOM to the standard t-test and a recently published methodology called Zero Inflated Gaussian (ZIG) methodology (1) for drawing inferences on the mean taxa abundance in two or more populations. ANCOM controlled the false discovery rate (FDR) at the desired nominal level while also improving power, whereas the t-test and ZIG had inflated FDRs, in some instances as high as 68% for the t-test and 60% for ZIG. We illustrate the performance of ANCOM using two publicly available microbial datasets in the human gut, demonstrating its general applicability to testing hypotheses about compositional differences in microbial communities.</p><p><strong>Conclusion: </strong>Accounting for compositionality using log-ratio analysis results in significantly improved inference in microbiota survey data.</p>","PeriodicalId":18568,"journal":{"name":"Microbial Ecology in Health and Disease","volume":"26 ","pages":"27663"},"PeriodicalIF":0.0000,"publicationDate":"2015-05-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.3402/mehd.v26.27663","citationCount":"1376","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Microbial Ecology in Health and Disease","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3402/mehd.v26.27663","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2015/1/1 0:00:00","PubModel":"eCollection","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1376

Abstract

Background: Understanding the factors regulating our microbiota is important but requires appropriate statistical methodology. When comparing two or more populations most existing approaches either discount the underlying compositional structure in the microbiome data or use probability models such as the multinomial and Dirichlet-multinomial distributions, which may impose a correlation structure not suitable for microbiome data.

Objective: To develop a methodology that accounts for compositional constraints to reduce false discoveries in detecting differentially abundant taxa at an ecosystem level, while maintaining high statistical power.

Methods: We introduced a novel statistical framework called analysis of composition of microbiomes (ANCOM). ANCOM accounts for the underlying structure in the data and can be used for comparing the composition of microbiomes in two or more populations. ANCOM makes no distributional assumptions and can be implemented in a linear model framework to adjust for covariates as well as model longitudinal data. ANCOM also scales well to compare samples involving thousands of taxa.

Results: We compared the performance of ANCOM to the standard t-test and a recently published methodology called Zero Inflated Gaussian (ZIG) methodology (1) for drawing inferences on the mean taxa abundance in two or more populations. ANCOM controlled the false discovery rate (FDR) at the desired nominal level while also improving power, whereas the t-test and ZIG had inflated FDRs, in some instances as high as 68% for the t-test and 60% for ZIG. We illustrate the performance of ANCOM using two publicly available microbial datasets in the human gut, demonstrating its general applicability to testing hypotheses about compositional differences in microbial communities.

Conclusion: Accounting for compositionality using log-ratio analysis results in significantly improved inference in microbiota survey data.

Abstract Image

Abstract Image

Abstract Image

微生物组组成分析:一种研究微生物组成的新方法。
背景:了解调节我们的微生物群的因素很重要,但需要适当的统计方法。当比较两个或多个种群时,大多数现有方法要么忽略微生物组数据中的潜在组成结构,要么使用概率模型,如多项分布和dirichlet -多项分布,这可能会导致不适合微生物组数据的相关结构。目的:建立一种考虑成分限制的方法,以减少在生态系统水平上检测差异丰富分类群时的错误发现,同时保持较高的统计能力。方法:我们引入了一种新的统计框架,称为微生物组组成分析(ANCOM)。ANCOM解释了数据中的底层结构,可用于比较两个或多个种群中微生物组的组成。ANCOM不做任何分布假设,可以在线性模型框架中实现,以调整协变量以及模型纵向数据。ANCOM还可以很好地比较涉及数千个分类群的样本。结果:我们将ANCOM的性能与标准t检验和最近发表的零膨胀高斯(ZIG)方法(1)进行了比较,该方法用于推断两个或多个种群的平均分类群丰度。ANCOM将错误发现率(FDR)控制在理想的名义水平,同时也提高了功率,而t检验和ZIG则夸大了FDR,在某些情况下,t检验高达68%,ZIG高达60%。我们使用两个公开的人类肠道微生物数据集来说明ANCOM的性能,证明其在测试微生物群落组成差异假设方面的普遍适用性。结论:利用对数比分析对微生物群调查数据的组合性进行核算,显著提高了推断能力。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
审稿时长
21 weeks
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信