Advanced Bayesian kernel machine regression for large-scale exposome studies: Making the impossible possible.

IF 25.7 1区 综合性期刊 Q1 MULTIDISCIPLINARY SCIENCES
The Innovation Pub Date : 2026-01-03 eCollection Date: 2026-04-06 DOI:10.1016/j.xinn.2025.101248
Yi Guo, Huixun Jia, Ziwei Peng, Xinming Xu, Zhicheng Zhang, Keyu Pan, Yuqin Zhou, Haidong Kan, Zhenyu Wu, Cong Liu
{"title":"Advanced Bayesian kernel machine regression for large-scale exposome studies: Making the impossible possible.","authors":"Yi Guo, Huixun Jia, Ziwei Peng, Xinming Xu, Zhicheng Zhang, Keyu Pan, Yuqin Zhou, Haidong Kan, Zhenyu Wu, Cong Liu","doi":"10.1016/j.xinn.2025.101248","DOIUrl":null,"url":null,"abstract":"<p><p>Exposome studies involve analyzing numerous exposures with complex interactions and potential collinearity, presenting challenges for conventional statistical methods. While Bayesian kernel machine regression (BKMR) has emerged as a promising solution, its widespread adoption has been hindered by high computational costs and restricted interpretability. To address these critical limitations in large-scale exposome studies, we developed an advanced BKMR (A-BKMR) model. The Gaussian predictive process and matrix decomposition were used to reduce both processing time and memory requirements. Additionally, we employed the parametric g-formula to generate interpretable statistics, including joint and univariate effects as well as bivariate and multivariate interactions. Across various scenarios with different sample sizes and numbers of exposures, A-BKMR demonstrated both high computational efficiency and model performance. Previously, analyzing datasets with sample sizes of 100,000 was unfeasible for traditional BKMR. The current A-BKMR can complete such analyses in 1 h on a personal computer, making it over 700,000 times faster than conventional BKMR implementations. Additionally, A-BKMR can accurately identify important exposure while preserving an area under the curve (AUC) > 0.99 and an <i>R</i> <sup><i>2</i></sup> > 0.97 across scenarios with varying sample sizes and numbers of exposures. Furthermore, A-BKMR introduces novel quantitative metrics for effect estimates and interaction analyses, substantially enhancing interpretability. These advancements establish A-BKMR as an excellent statistical framework for future large-scale exposome studies.</p>","PeriodicalId":36121,"journal":{"name":"The Innovation","volume":"7 4","pages":"101248"},"PeriodicalIF":25.7000,"publicationDate":"2026-01-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC13069415/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"The Innovation","FirstCategoryId":"95","ListUrlMain":"https://doi.org/10.1016/j.xinn.2025.101248","RegionNum":1,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2026/4/6 0:00:00","PubModel":"eCollection","JCR":"Q1","JCRName":"MULTIDISCIPLINARY SCIENCES","Score":null,"Total":0}
引用次数: 0

Abstract

Exposome studies involve analyzing numerous exposures with complex interactions and potential collinearity, presenting challenges for conventional statistical methods. While Bayesian kernel machine regression (BKMR) has emerged as a promising solution, its widespread adoption has been hindered by high computational costs and restricted interpretability. To address these critical limitations in large-scale exposome studies, we developed an advanced BKMR (A-BKMR) model. The Gaussian predictive process and matrix decomposition were used to reduce both processing time and memory requirements. Additionally, we employed the parametric g-formula to generate interpretable statistics, including joint and univariate effects as well as bivariate and multivariate interactions. Across various scenarios with different sample sizes and numbers of exposures, A-BKMR demonstrated both high computational efficiency and model performance. Previously, analyzing datasets with sample sizes of 100,000 was unfeasible for traditional BKMR. The current A-BKMR can complete such analyses in 1 h on a personal computer, making it over 700,000 times faster than conventional BKMR implementations. Additionally, A-BKMR can accurately identify important exposure while preserving an area under the curve (AUC) > 0.99 and an R 2 > 0.97 across scenarios with varying sample sizes and numbers of exposures. Furthermore, A-BKMR introduces novel quantitative metrics for effect estimates and interaction analyses, substantially enhancing interpretability. These advancements establish A-BKMR as an excellent statistical framework for future large-scale exposome studies.

大规模暴露研究的高级贝叶斯核机回归:使不可能成为可能。
暴露研究涉及分析大量具有复杂相互作用和潜在共线性的暴露,这对传统的统计方法提出了挑战。虽然贝叶斯核机回归(BKMR)已经成为一种很有前途的解决方案,但其广泛采用受到高计算成本和有限的可解释性的阻碍。为了解决大规模暴露研究中的这些关键限制,我们开发了一种先进的BKMR (A-BKMR)模型。采用高斯预测过程和矩阵分解来减少处理时间和内存需求。此外,我们采用参数g公式来生成可解释的统计数据,包括联合和单变量效应,以及双变量和多变量相互作用。在不同样本量和暴露次数的各种场景中,A-BKMR显示出较高的计算效率和模型性能。以前,对于传统的BKMR来说,分析100,000个样本量的数据集是不可行的。目前的a -BKMR可以在个人计算机上1小时内完成这样的分析,使其比传统的BKMR实现快70万倍以上。此外,A-BKMR可以准确识别重要的暴露,同时在不同样本量和暴露次数的情况下保持曲线下面积(AUC) >.99和R 2 >.97。此外,A-BKMR为效应估计和相互作用分析引入了新的定量指标,大大提高了可解释性。这些进展使A-BKMR成为未来大规模暴露研究的优秀统计框架。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
The Innovation
The Innovation MULTIDISCIPLINARY SCIENCES-
CiteScore
38.30
自引率
1.20%
发文量
134
审稿时长
6 weeks
期刊介绍: The Innovation is an interdisciplinary journal that aims to promote scientific application. It publishes cutting-edge research and high-quality reviews in various scientific disciplines, including physics, chemistry, materials, nanotechnology, biology, translational medicine, geoscience, and engineering. The journal adheres to the peer review and publishing standards of Cell Press journals. The Innovation is committed to serving scientists and the public. It aims to publish significant advances promptly and provides a transparent exchange platform. The journal also strives to efficiently promote the translation from scientific discovery to technological achievements and rapidly disseminate scientific findings worldwide. Indexed in the following databases, The Innovation has visibility in Scopus, Directory of Open Access Journals (DOAJ), Web of Science, Emerging Sources Citation Index (ESCI), PubMed Central, Compendex (previously Ei index), INSPEC, and CABI A&I.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信
小红书