Bayesian topology inference of regulatory networks under partial observability

IF 3.2 Q3 Mathematics
Mohammad Alali, Mahdi Imani
{"title":"Bayesian topology inference of regulatory networks under partial observability","authors":"Mohammad Alali,&nbsp;Mahdi Imani","doi":"10.1016/j.rico.2025.100570","DOIUrl":null,"url":null,"abstract":"<div><div>Biological systems, such as microbial communities in metagenomics and gene regulatory networks (GRNs) in genomics, are composed of a vast number of interacting components observed through inherently noisy data. These systems play a critical role in understanding fundamental biological processes, including gene regulation, microbial interactions, and cellular dynamics. For example, microbial communities involve complex interactions between microbes, bacteria, genes, and small molecules observed through omics data, while GRNs consist of numerous interacting genes observed via various gene-expression technologies. However, reconstructing the topology of such networks poses significant challenges due to their large scale, high dimensionality, and the presence of noise. Existing inference techniques often struggle with scalability, interpretability, and overfitting, making them unsuitable for analyzing large and complex biological systems. To overcome these challenges, this paper proposes a Bayesian topology optimization framework for efficient and scalable inference of regulatory networks modeled as partially-observed Boolean dynamical systems (POBDS). The method combines the Boolean Kalman Filter (BKF) as an optimal estimator for POBDS, with Bayesian optimization, which employs Gaussian Process regression and a topology-inspired kernel function to model the log-likelihood function. Numerical experiments demonstrate the superior performance of our framework. In the p53-MDM2 network, our method accurately infers topology with 8 and 16 unknown regulations, achieving higher log-likelihood with 100 and 200 evaluations, respectively. For the mammalian cell cycle network with 10 unknown regulations, proposed method identifies the correct topology among 59,049 possibilities with lower error and faster convergence.</div></div>","PeriodicalId":34733,"journal":{"name":"Results in Control and Optimization","volume":"19 ","pages":"Article 100570"},"PeriodicalIF":3.2000,"publicationDate":"2025-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Results in Control and Optimization","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2666720725000566","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"Mathematics","Score":null,"Total":0}
引用次数: 0

Abstract

Biological systems, such as microbial communities in metagenomics and gene regulatory networks (GRNs) in genomics, are composed of a vast number of interacting components observed through inherently noisy data. These systems play a critical role in understanding fundamental biological processes, including gene regulation, microbial interactions, and cellular dynamics. For example, microbial communities involve complex interactions between microbes, bacteria, genes, and small molecules observed through omics data, while GRNs consist of numerous interacting genes observed via various gene-expression technologies. However, reconstructing the topology of such networks poses significant challenges due to their large scale, high dimensionality, and the presence of noise. Existing inference techniques often struggle with scalability, interpretability, and overfitting, making them unsuitable for analyzing large and complex biological systems. To overcome these challenges, this paper proposes a Bayesian topology optimization framework for efficient and scalable inference of regulatory networks modeled as partially-observed Boolean dynamical systems (POBDS). The method combines the Boolean Kalman Filter (BKF) as an optimal estimator for POBDS, with Bayesian optimization, which employs Gaussian Process regression and a topology-inspired kernel function to model the log-likelihood function. Numerical experiments demonstrate the superior performance of our framework. In the p53-MDM2 network, our method accurately infers topology with 8 and 16 unknown regulations, achieving higher log-likelihood with 100 and 200 evaluations, respectively. For the mammalian cell cycle network with 10 unknown regulations, proposed method identifies the correct topology among 59,049 possibilities with lower error and faster convergence.
部分可观测条件下调控网络的贝叶斯拓扑推断
生物系统,如宏基因组学中的微生物群落和基因组学中的基因调控网络(grn),由大量通过固有噪声数据观察到的相互作用成分组成。这些系统在理解基本的生物过程中起着关键作用,包括基因调控、微生物相互作用和细胞动力学。例如,微生物群落涉及微生物、细菌、基因和通过组学数据观察到的小分子之间复杂的相互作用,而grn由通过各种基因表达技术观察到的众多相互作用基因组成。然而,由于这些网络的大规模、高维和存在噪声,重建这些网络的拓扑结构面临着重大挑战。现有的推理技术经常与可扩展性、可解释性和过拟合作斗争,使它们不适合分析大型和复杂的生物系统。为了克服这些挑战,本文提出了一个贝叶斯拓扑优化框架,用于部分观测布尔动力系统(POBDS)模型的有效和可扩展的调节网络推理。该方法将布尔卡尔曼滤波(BKF)作为POBDS的最优估计器,与贝叶斯优化相结合,采用高斯过程回归和拓扑启发核函数对对数似然函数进行建模。数值实验证明了该框架的优越性能。在p53-MDM2网络中,我们的方法准确地推断出具有8个和16个未知规则的拓扑,分别获得了100次和200次评估的更高的对数似然。对于含有10个未知规则的哺乳动物细胞周期网络,该方法在59049种可能性中识别出正确的拓扑结构,误差更小,收敛速度更快。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Results in Control and Optimization
Results in Control and Optimization Mathematics-Control and Optimization
CiteScore
3.00
自引率
0.00%
发文量
51
审稿时长
91 days
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信