Unfolding and de-confounding: biologically meaningful causal inference from longitudinal multi-omic networks using METALICA.

IF 5 2区 生物学 Q1 MICROBIOLOGY
mSystems Pub Date : 2024-10-22 Epub Date: 2024-09-06 DOI:10.1128/msystems.01303-23
Daniel Ruiz-Perez, Isabella Gimon, Musfiqur Sazal, Kalai Mathee, Giri Narasimhan
{"title":"Unfolding and de-confounding: biologically meaningful causal inference from longitudinal multi-omic networks using METALICA.","authors":"Daniel Ruiz-Perez, Isabella Gimon, Musfiqur Sazal, Kalai Mathee, Giri Narasimhan","doi":"10.1128/msystems.01303-23","DOIUrl":null,"url":null,"abstract":"<p><p>A key challenge in the analysis of microbiome data is the integration of multi-omic datasets and the discovery of interactions between microbial taxa, their expressed genes, and the metabolites they consume and/or produce. In an effort to improve the state of the art in inferring biologically meaningful multi-omic interactions, we sought to address some of the most fundamental issues in causal inference from longitudinal multi-omics microbiome data sets. We developed METALICA, a suite of tools and techniques that can infer interactions between microbiome entities. METALICA introduces novel unrolling and de-confounding techniques used to uncover multi-omic entities that are believed to act as confounders for some of the relationships that may be inferred using standard causal inferencing tools. The results lend support to predictions about biological models and processes by which microbial taxa interact with each other in a microbiome. The unrolling process helps identify putative intermediaries (genes and/or metabolites) to explain the interactions between microbes; the de-confounding process identifies putative common causes that may lead to spurious relationships to be inferred. METALICA was applied to the networks inferred by existing causal discovery, and network inference algorithms were applied to a multi-omics data set resulting from a longitudinal study of IBD microbiomes. The most significant unrollings and de-confoundings were manually validated using the existing literature and databases.</p><p><strong>Importance: </strong>We have developed a suite of tools and techniques capable of inferring interactions between microbiome entities. METALICA introduces novel techniques called unrolling and de-confounding that are employed to uncover multi-omic entities considered to be confounders for some of the relationships that may be inferred using standard causal inferencing tools. To evaluate our method, we conducted tests on the inflammatory bowel disease (IBD) dataset from the iHMP longitudinal study, which we pre-processed in accordance with our previous work. From this dataset, we generated various subsets, encompassing different combinations of metagenomics, metabolomics, and metatranscriptomics datasets. Using these multi-omics datasets, we demonstrate how the unrolling process aids in the identification of putative intermediaries (genes and/or metabolites) to explain the interactions between microbes. Additionally, the de-confounding process identifies potential common causes that may give rise to spurious relationships to be inferred. The most significant unrollings and de-confoundings were manually validated using the existing literature and databases.</p>","PeriodicalId":18819,"journal":{"name":"mSystems","volume":null,"pages":null},"PeriodicalIF":5.0000,"publicationDate":"2024-10-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11494969/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"mSystems","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1128/msystems.01303-23","RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/9/6 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"MICROBIOLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

A key challenge in the analysis of microbiome data is the integration of multi-omic datasets and the discovery of interactions between microbial taxa, their expressed genes, and the metabolites they consume and/or produce. In an effort to improve the state of the art in inferring biologically meaningful multi-omic interactions, we sought to address some of the most fundamental issues in causal inference from longitudinal multi-omics microbiome data sets. We developed METALICA, a suite of tools and techniques that can infer interactions between microbiome entities. METALICA introduces novel unrolling and de-confounding techniques used to uncover multi-omic entities that are believed to act as confounders for some of the relationships that may be inferred using standard causal inferencing tools. The results lend support to predictions about biological models and processes by which microbial taxa interact with each other in a microbiome. The unrolling process helps identify putative intermediaries (genes and/or metabolites) to explain the interactions between microbes; the de-confounding process identifies putative common causes that may lead to spurious relationships to be inferred. METALICA was applied to the networks inferred by existing causal discovery, and network inference algorithms were applied to a multi-omics data set resulting from a longitudinal study of IBD microbiomes. The most significant unrollings and de-confoundings were manually validated using the existing literature and databases.

Importance: We have developed a suite of tools and techniques capable of inferring interactions between microbiome entities. METALICA introduces novel techniques called unrolling and de-confounding that are employed to uncover multi-omic entities considered to be confounders for some of the relationships that may be inferred using standard causal inferencing tools. To evaluate our method, we conducted tests on the inflammatory bowel disease (IBD) dataset from the iHMP longitudinal study, which we pre-processed in accordance with our previous work. From this dataset, we generated various subsets, encompassing different combinations of metagenomics, metabolomics, and metatranscriptomics datasets. Using these multi-omics datasets, we demonstrate how the unrolling process aids in the identification of putative intermediaries (genes and/or metabolites) to explain the interactions between microbes. Additionally, the de-confounding process identifies potential common causes that may give rise to spurious relationships to be inferred. The most significant unrollings and de-confoundings were manually validated using the existing literature and databases.

展开和去混淆:利用 METALICA 从纵向多原子网络中进行具有生物学意义的因果推断。
微生物组数据分析中的一个关键挑战是整合多组学数据集以及发现微生物类群、其表达基因及其消耗和/或产生的代谢物之间的相互作用。为了提高推断具有生物学意义的多组学相互作用的技术水平,我们试图解决从纵向多组学微生物组数据集中进行因果推断的一些最基本问题。我们开发了 METALICA,这是一套能够推断微生物组实体间相互作用的工具和技术。METALICA 引入了新颖的开卷和去混淆技术,用于发现被认为是标准因果推断工具可能推断出的某些关系的混淆因素的多组学实体。研究结果支持对微生物群中微生物类群相互作用的生物模型和过程的预测。解卷过程有助于识别假定的中间体(基因和/或代谢物),以解释微生物之间的相互作用;去混淆过程则可识别可能导致推断出虚假关系的假定共同原因。METALICA 被应用于现有因果关系发现所推断的网络,而网络推断算法则被应用于 IBD 微生物组纵向研究产生的多组学数据集。我们利用现有的文献和数据库对最重要的开卷和去卷进行了人工验证:我们开发了一套工具和技术,能够推断微生物组实体之间的相互作用。METALICA 引入了称为 "解卷 "和 "去混淆 "的新技术,用于发现被认为是某些关系混淆因素的多微生物组实体,而这些关系可能是使用标准因果推断工具推断出来的。为了评估我们的方法,我们在 iHMP 纵向研究的炎症性肠病(IBD)数据集上进行了测试。我们从该数据集中生成了各种子集,包括元基因组学、代谢组学和元转录组学数据集的不同组合。利用这些多组学数据集,我们展示了开卷过程如何帮助识别推定的中间体(基因和/或代谢物),以解释微生物之间的相互作用。此外,去混淆过程还能识别可能导致推断出虚假关系的潜在共同原因。利用现有文献和数据库对最重要的解卷和去重进行了人工验证。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
mSystems
mSystems Biochemistry, Genetics and Molecular Biology-Biochemistry
CiteScore
10.50
自引率
3.10%
发文量
308
审稿时长
13 weeks
期刊介绍: mSystems™ will publish preeminent work that stems from applying technologies for high-throughput analyses to achieve insights into the metabolic and regulatory systems at the scale of both the single cell and microbial communities. The scope of mSystems™ encompasses all important biological and biochemical findings drawn from analyses of large data sets, as well as new computational approaches for deriving these insights. mSystems™ will welcome submissions from researchers who focus on the microbiome, genomics, metagenomics, transcriptomics, metabolomics, proteomics, glycomics, bioinformatics, and computational microbiology. mSystems™ will provide streamlined decisions, while carrying on ASM''s tradition of rigorous peer review.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信