Thi Huyen Nguyen, Ibrahim Hamad, Markus Kleinewietfeld, Dhammika Amaratunga, Javier Cabrera, Davit Sargsyan, Rudradev Sengupta, Olajumoke Evangelina Owokotomo, Michael N Katehakis, Ziv Shkedy
{"title":"使用惩罚回归方法开发多种微生物组生物标志物。","authors":"Thi Huyen Nguyen, Ibrahim Hamad, Markus Kleinewietfeld, Dhammika Amaratunga, Javier Cabrera, Davit Sargsyan, Rudradev Sengupta, Olajumoke Evangelina Owokotomo, Michael N Katehakis, Ziv Shkedy","doi":"10.1093/jambio/lxaf242","DOIUrl":null,"url":null,"abstract":"<p><strong>Aims: </strong>Identifying biomarkers that reflect the complex relationship between the microbiome and health outcomes in microbiome studies is essential for advancing the understanding and improving disease management. While past research was focused on a single biomarker modeling approach, this study extends that work by combining multiple taxa to identify a subset of multiple biomarkers relevant to clinical outcomes.</p><p><strong>Methods and results: </strong>We extend the information theory framework for surrogate endpoint evaluation by applying LASSO and Elastic Net models to identify combinations of taxa as biomarkers for clinical outcomes. Feature selection for the biomarker's construction is done in order to maximize the goodness of fit of the predictive biomarker model. Monte Carlo cross validation is used to enhance the reliability of feature selection. The high salt diet study on mice is used to illustrate the methodology for continuous outcome (tumor size). The top 5 selected genera yielded a correlation of 0.9274 between predicted and observed tumor size, with a 67.92% reduction in uncertainty when the multiple microbiome biomarkers score is known. To illustrate the methodology for binary outcome, the CERTIFI study on Crohn's disease patients treated with ustekinumab is used. A multiple microbiome biomarkers score, constructed using the top 5 selected families, significantly improved prediction of remission 6 weeks after induction treatment (the clinical outcome of interest).</p><p><strong>Conclusions: </strong>This study presents a unified approach for identifying multiple microbiome biomarkers using penalized regression for clinical outcome prediction. The proposed methods are applied to both continuous and binary outcomes. The method enhances the detection of meaningful biomarkers with potential for personalized treatment and disease management.</p>","PeriodicalId":15036,"journal":{"name":"Journal of Applied Microbiology","volume":" ","pages":""},"PeriodicalIF":3.2000,"publicationDate":"2025-10-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Development of multiple microbiome biomarkers using penalized regression methods.\",\"authors\":\"Thi Huyen Nguyen, Ibrahim Hamad, Markus Kleinewietfeld, Dhammika Amaratunga, Javier Cabrera, Davit Sargsyan, Rudradev Sengupta, Olajumoke Evangelina Owokotomo, Michael N Katehakis, Ziv Shkedy\",\"doi\":\"10.1093/jambio/lxaf242\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Aims: </strong>Identifying biomarkers that reflect the complex relationship between the microbiome and health outcomes in microbiome studies is essential for advancing the understanding and improving disease management. While past research was focused on a single biomarker modeling approach, this study extends that work by combining multiple taxa to identify a subset of multiple biomarkers relevant to clinical outcomes.</p><p><strong>Methods and results: </strong>We extend the information theory framework for surrogate endpoint evaluation by applying LASSO and Elastic Net models to identify combinations of taxa as biomarkers for clinical outcomes. Feature selection for the biomarker's construction is done in order to maximize the goodness of fit of the predictive biomarker model. Monte Carlo cross validation is used to enhance the reliability of feature selection. The high salt diet study on mice is used to illustrate the methodology for continuous outcome (tumor size). The top 5 selected genera yielded a correlation of 0.9274 between predicted and observed tumor size, with a 67.92% reduction in uncertainty when the multiple microbiome biomarkers score is known. To illustrate the methodology for binary outcome, the CERTIFI study on Crohn's disease patients treated with ustekinumab is used. A multiple microbiome biomarkers score, constructed using the top 5 selected families, significantly improved prediction of remission 6 weeks after induction treatment (the clinical outcome of interest).</p><p><strong>Conclusions: </strong>This study presents a unified approach for identifying multiple microbiome biomarkers using penalized regression for clinical outcome prediction. The proposed methods are applied to both continuous and binary outcomes. The method enhances the detection of meaningful biomarkers with potential for personalized treatment and disease management.</p>\",\"PeriodicalId\":15036,\"journal\":{\"name\":\"Journal of Applied Microbiology\",\"volume\":\" \",\"pages\":\"\"},\"PeriodicalIF\":3.2000,\"publicationDate\":\"2025-10-06\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Applied Microbiology\",\"FirstCategoryId\":\"99\",\"ListUrlMain\":\"https://doi.org/10.1093/jambio/lxaf242\",\"RegionNum\":3,\"RegionCategory\":\"生物学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"BIOTECHNOLOGY & APPLIED MICROBIOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Applied Microbiology","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1093/jambio/lxaf242","RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"BIOTECHNOLOGY & APPLIED MICROBIOLOGY","Score":null,"Total":0}
Development of multiple microbiome biomarkers using penalized regression methods.
Aims: Identifying biomarkers that reflect the complex relationship between the microbiome and health outcomes in microbiome studies is essential for advancing the understanding and improving disease management. While past research was focused on a single biomarker modeling approach, this study extends that work by combining multiple taxa to identify a subset of multiple biomarkers relevant to clinical outcomes.
Methods and results: We extend the information theory framework for surrogate endpoint evaluation by applying LASSO and Elastic Net models to identify combinations of taxa as biomarkers for clinical outcomes. Feature selection for the biomarker's construction is done in order to maximize the goodness of fit of the predictive biomarker model. Monte Carlo cross validation is used to enhance the reliability of feature selection. The high salt diet study on mice is used to illustrate the methodology for continuous outcome (tumor size). The top 5 selected genera yielded a correlation of 0.9274 between predicted and observed tumor size, with a 67.92% reduction in uncertainty when the multiple microbiome biomarkers score is known. To illustrate the methodology for binary outcome, the CERTIFI study on Crohn's disease patients treated with ustekinumab is used. A multiple microbiome biomarkers score, constructed using the top 5 selected families, significantly improved prediction of remission 6 weeks after induction treatment (the clinical outcome of interest).
Conclusions: This study presents a unified approach for identifying multiple microbiome biomarkers using penalized regression for clinical outcome prediction. The proposed methods are applied to both continuous and binary outcomes. The method enhances the detection of meaningful biomarkers with potential for personalized treatment and disease management.
期刊介绍:
Journal of & Letters in Applied Microbiology are two of the flagship research journals of the Society for Applied Microbiology (SfAM). For more than 75 years they have been publishing top quality research and reviews in the broad field of applied microbiology. The journals are provided to all SfAM members as well as having a global online readership totalling more than 500,000 downloads per year in more than 200 countries. Submitting authors can expect fast decision and publication times, averaging 33 days to first decision and 34 days from acceptance to online publication. There are no page charges.