Identifying stationary microbial interaction networks based on irregularly spaced longitudinal 16S rRNA gene sequencing data

Jie Zhou, Jiang Gui, W. Viles, Haobin Chen, Siting Li, J. Madan, Modupe O. Coker, A. Hoen
{"title":"Identifying stationary microbial interaction networks based on irregularly spaced longitudinal 16S rRNA gene sequencing data","authors":"Jie Zhou, Jiang Gui, W. Viles, Haobin Chen, Siting Li, J. Madan, Modupe O. Coker, A. Hoen","doi":"10.3389/frmbi.2024.1366948","DOIUrl":null,"url":null,"abstract":"The microbial interactions within the human microbiome are complex, and few methods are available to identify these interactions within a longitudinal microbial abundance framework. Existing methods typically impose restrictive constraints, such as requiring long sequences and equal spacing, on the data format which in many cases are violated.To identify microbial interaction networks (MINs) with general longitudinal data settings, we propose a stationary Gaussian graphical model (SGGM) based on 16S rRNA gene sequencing data. In the SGGM, data can be arbitrarily spaced, and there are no restrictions on the length of data sequences from a single subject. Based on the SGGM, EM -type algorithms are devised to compute the L1-penalized maximum likelihood estimate of MINs. The algorithms employ the classical graphical LASSO algorithm as the building block and can be implemented efficiently. Extensive simulation studies show that the proposed algorithms can significantly outperform the conventional algorithms if the correlations among the longitudinal data are reasonably high. When the assumptions in the SGGM areviolated, e.g., zero inflation or data from heterogeneous microbial communities, the proposed algorithms still demonstrate robustness and perform better than the other existing algorithms. The algorithms are applied to a 16S rRNA gene sequencing data set from patients with cystic fibrosis. The results demonstrate strong evidence of an association between the MINs and the phylogenetic tree, indicating that the genetically related taxa tend to have more/stronger interactions. These results strengthen the existing findings in literature. The proposed algorithms can potentially be used to explore the network structure in genome, metabolome etc. as well.","PeriodicalId":73089,"journal":{"name":"Frontiers in microbiomes","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2024-06-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Frontiers in microbiomes","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3389/frmbi.2024.1366948","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

The microbial interactions within the human microbiome are complex, and few methods are available to identify these interactions within a longitudinal microbial abundance framework. Existing methods typically impose restrictive constraints, such as requiring long sequences and equal spacing, on the data format which in many cases are violated.To identify microbial interaction networks (MINs) with general longitudinal data settings, we propose a stationary Gaussian graphical model (SGGM) based on 16S rRNA gene sequencing data. In the SGGM, data can be arbitrarily spaced, and there are no restrictions on the length of data sequences from a single subject. Based on the SGGM, EM -type algorithms are devised to compute the L1-penalized maximum likelihood estimate of MINs. The algorithms employ the classical graphical LASSO algorithm as the building block and can be implemented efficiently. Extensive simulation studies show that the proposed algorithms can significantly outperform the conventional algorithms if the correlations among the longitudinal data are reasonably high. When the assumptions in the SGGM areviolated, e.g., zero inflation or data from heterogeneous microbial communities, the proposed algorithms still demonstrate robustness and perform better than the other existing algorithms. The algorithms are applied to a 16S rRNA gene sequencing data set from patients with cystic fibrosis. The results demonstrate strong evidence of an association between the MINs and the phylogenetic tree, indicating that the genetically related taxa tend to have more/stronger interactions. These results strengthen the existing findings in literature. The proposed algorithms can potentially be used to explore the network structure in genome, metabolome etc. as well.
基于不规则间距纵向 16S rRNA 基因测序数据识别固定微生物相互作用网络
人类微生物组中的微生物相互作用非常复杂,目前很少有方法能在纵向微生物丰度框架内识别这些相互作用。现有的方法通常会对数据格式施加限制性约束,如要求长序列和等间距,而这些约束在很多情况下都会被违反。为了利用一般的纵向数据设置来识别微生物相互作用网络(MINs),我们提出了一种基于 16S rRNA 基因测序数据的静态高斯图形模型(SGGM)。在 SGGM 中,数据可以任意间隔,单个受试者的数据序列长度也不受限制。在 SGGM 的基础上,设计了 EM 型算法来计算 MINs 的 L1 惩罚最大似然估计值。这些算法采用经典的图形 LASSO 算法作为构建模块,可以高效地实现。广泛的仿真研究表明,如果纵向数据之间的相关性相当高,所提出的算法可以明显优于传统算法。当 SGGM 中的假设条件被破坏时,例如零膨胀或来自异质微生物群落的数据,所提出的算法仍然表现出鲁棒性,其性能优于其他现有算法。这些算法被应用于囊性纤维化患者的 16S rRNA 基因测序数据集。结果表明,MINs 与系统发生树之间的关联性很强,表明基因相关的类群往往有更多/更强的相互作用。这些结果加强了现有文献的研究结果。所提出的算法也可用于探索基因组、代谢组等的网络结构。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
文献相关原料
公司名称 产品信息 采购帮参考价格
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信