Development of multiple microbiome biomarkers using penalized regression methods.

IF 3.2 3区 生物学 Q2 BIOTECHNOLOGY & APPLIED MICROBIOLOGY
Thi Huyen Nguyen, Ibrahim Hamad, Markus Kleinewietfeld, Dhammika Amaratunga, Javier Cabrera, Davit Sargsyan, Rudradev Sengupta, Olajumoke Evangelina Owokotomo, Michael N Katehakis, Ziv Shkedy
{"title":"Development of multiple microbiome biomarkers using penalized regression methods.","authors":"Thi Huyen Nguyen, Ibrahim Hamad, Markus Kleinewietfeld, Dhammika Amaratunga, Javier Cabrera, Davit Sargsyan, Rudradev Sengupta, Olajumoke Evangelina Owokotomo, Michael N Katehakis, Ziv Shkedy","doi":"10.1093/jambio/lxaf242","DOIUrl":null,"url":null,"abstract":"<p><strong>Aims: </strong>Identifying biomarkers that reflect the complex relationship between the microbiome and health outcomes in microbiome studies is essential for advancing the understanding and improving disease management. While past research was focused on a single biomarker modeling approach, this study extends that work by combining multiple taxa to identify a subset of multiple biomarkers relevant to clinical outcomes.</p><p><strong>Methods and results: </strong>We extend the information theory framework for surrogate endpoint evaluation by applying LASSO and Elastic Net models to identify combinations of taxa as biomarkers for clinical outcomes. Feature selection for the biomarker's construction is done in order to maximize the goodness of fit of the predictive biomarker model. Monte Carlo cross validation is used to enhance the reliability of feature selection. The high salt diet study on mice is used to illustrate the methodology for continuous outcome (tumor size). The top 5 selected genera yielded a correlation of 0.9274 between predicted and observed tumor size, with a 67.92% reduction in uncertainty when the multiple microbiome biomarkers score is known. To illustrate the methodology for binary outcome, the CERTIFI study on Crohn's disease patients treated with ustekinumab is used. A multiple microbiome biomarkers score, constructed using the top 5 selected families, significantly improved prediction of remission 6 weeks after induction treatment (the clinical outcome of interest).</p><p><strong>Conclusions: </strong>This study presents a unified approach for identifying multiple microbiome biomarkers using penalized regression for clinical outcome prediction. The proposed methods are applied to both continuous and binary outcomes. The method enhances the detection of meaningful biomarkers with potential for personalized treatment and disease management.</p>","PeriodicalId":15036,"journal":{"name":"Journal of Applied Microbiology","volume":" ","pages":""},"PeriodicalIF":3.2000,"publicationDate":"2025-10-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Applied Microbiology","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1093/jambio/lxaf242","RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"BIOTECHNOLOGY & APPLIED MICROBIOLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

Aims: Identifying biomarkers that reflect the complex relationship between the microbiome and health outcomes in microbiome studies is essential for advancing the understanding and improving disease management. While past research was focused on a single biomarker modeling approach, this study extends that work by combining multiple taxa to identify a subset of multiple biomarkers relevant to clinical outcomes.

Methods and results: We extend the information theory framework for surrogate endpoint evaluation by applying LASSO and Elastic Net models to identify combinations of taxa as biomarkers for clinical outcomes. Feature selection for the biomarker's construction is done in order to maximize the goodness of fit of the predictive biomarker model. Monte Carlo cross validation is used to enhance the reliability of feature selection. The high salt diet study on mice is used to illustrate the methodology for continuous outcome (tumor size). The top 5 selected genera yielded a correlation of 0.9274 between predicted and observed tumor size, with a 67.92% reduction in uncertainty when the multiple microbiome biomarkers score is known. To illustrate the methodology for binary outcome, the CERTIFI study on Crohn's disease patients treated with ustekinumab is used. A multiple microbiome biomarkers score, constructed using the top 5 selected families, significantly improved prediction of remission 6 weeks after induction treatment (the clinical outcome of interest).

Conclusions: This study presents a unified approach for identifying multiple microbiome biomarkers using penalized regression for clinical outcome prediction. The proposed methods are applied to both continuous and binary outcomes. The method enhances the detection of meaningful biomarkers with potential for personalized treatment and disease management.

使用惩罚回归方法开发多种微生物组生物标志物。
目的:在微生物组研究中,识别反映微生物组与健康结果之间复杂关系的生物标志物对于促进对微生物组的理解和改善疾病管理至关重要。虽然过去的研究主要集中在单一生物标志物建模方法上,但本研究通过结合多个分类群来确定与临床结果相关的多个生物标志物的子集,从而扩展了这一工作。方法和结果:通过应用LASSO和Elastic Net模型(Zou和Hastie, 2005; Hastie等,2015),我们扩展了替代终点评估的信息论框架(Alonso和Molenberghs, 2007),以确定分类群组合作为临床结果的生物标志物。为了使生物标志物预测模型的拟合优度最大化,对生物标志物的构建进行特征选择。采用蒙特卡罗交叉验证来提高特征选择的可靠性。高盐饮食(HSD)对小鼠的研究,被用来说明连续结果(肿瘤大小)的方法。前5个选择的属在预测和观察肿瘤大小之间的相关性为0.9274,当多个微生物组生物标志物得分已知时,不确定性降低了67.92%。为了说明二元结果的方法,使用了使用ustekinumab治疗的克罗恩病患者的CERTIFI研究。使用前5个选定的家族构建的多种微生物组生物标志物评分显着提高了诱导治疗后6周缓解的预测(感兴趣的临床结果)。结论:本研究提出了一种统一的方法来识别多种微生物组生物标志物,使用惩罚回归进行临床结果预测。所提出的方法既适用于连续结果,也适用于二进制结果。该方法增强了对有意义的生物标志物的检测,具有个性化治疗和疾病管理的潜力。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Journal of Applied Microbiology
Journal of Applied Microbiology 生物-生物工程与应用微生物
CiteScore
7.30
自引率
2.50%
发文量
427
审稿时长
2.7 months
期刊介绍: Journal of & Letters in Applied Microbiology are two of the flagship research journals of the Society for Applied Microbiology (SfAM). For more than 75 years they have been publishing top quality research and reviews in the broad field of applied microbiology. The journals are provided to all SfAM members as well as having a global online readership totalling more than 500,000 downloads per year in more than 200 countries. Submitting authors can expect fast decision and publication times, averaging 33 days to first decision and 34 days from acceptance to online publication. There are no page charges.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信