将多室微生物组数据与临床参数相结合,利用自编码器提高死亡率预测。

IF 1.9 4区 生物学 Q4 BIOCHEMICAL RESEARCH METHODS
Binaya Dhakal, Lakshmi Sai Kishore Savarapu, Khaled Sayed
{"title":"将多室微生物组数据与临床参数相结合,利用自编码器提高死亡率预测。","authors":"Binaya Dhakal,&nbsp;Lakshmi Sai Kishore Savarapu,&nbsp;Khaled Sayed","doi":"10.1016/j.mimet.2025.107267","DOIUrl":null,"url":null,"abstract":"<div><div>The human microbiome, a complex ecosystem of microorganisms residing in different body compartments, significantly influences health outcomes and disease progression, however, leveraging this data for developing clinical prediction models remains challenging due to its high dimensionality, sparsity, and compositional nature. Traditional machine learning approaches often struggle to capture the intricate microbial interactions that contribute to mortality risk, particularly when analyzing data across multiple compartments with distinct microbial compositions. To address these limitations, we introduce a novel framework utilizing an autoencoder-based model trained on high-dimensional microbiome data collected from oral, lung, and gut compartments. Our approach encodes microbiome data into a low-dimensional latent space while preserving essential microbial community characteristics, enabling more effective feature extraction and pattern recognition than conventional dimensionality reduction techniques. Through systematic evaluation of three data configurations—microbiome taxa only, clinical data only, and an integrated model combining both—we demonstrated that the integrated approach consistently achieved superior prediction accuracy (98 % in lung microbiome) compared to using either data source independently. Clinical data alone provided reasonable but inconsistent performance (70–90 %), while microbiome taxa alone yielded the weakest results (53–65 %). Furthermore, our investigation of preprocessing techniques revealed that applying z-score normalization to the taxa data significantly enhanced performance and substantially improved recall metrics across all compartments. By analyzing compartment-specific microbial contributions, our study reveals distinct predictive roles of the oral and lung microbiomes compared to the gut microbiome, underscoring of body-site specificity in microbiome-based predictive modeling.</div></div>","PeriodicalId":16409,"journal":{"name":"Journal of microbiological methods","volume":"238 ","pages":"Article 107267"},"PeriodicalIF":1.9000,"publicationDate":"2025-09-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Integrating multi-compartment microbiome data with clinical parameters enhances mortality prediction using autoencoder\",\"authors\":\"Binaya Dhakal,&nbsp;Lakshmi Sai Kishore Savarapu,&nbsp;Khaled Sayed\",\"doi\":\"10.1016/j.mimet.2025.107267\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>The human microbiome, a complex ecosystem of microorganisms residing in different body compartments, significantly influences health outcomes and disease progression, however, leveraging this data for developing clinical prediction models remains challenging due to its high dimensionality, sparsity, and compositional nature. Traditional machine learning approaches often struggle to capture the intricate microbial interactions that contribute to mortality risk, particularly when analyzing data across multiple compartments with distinct microbial compositions. To address these limitations, we introduce a novel framework utilizing an autoencoder-based model trained on high-dimensional microbiome data collected from oral, lung, and gut compartments. Our approach encodes microbiome data into a low-dimensional latent space while preserving essential microbial community characteristics, enabling more effective feature extraction and pattern recognition than conventional dimensionality reduction techniques. Through systematic evaluation of three data configurations—microbiome taxa only, clinical data only, and an integrated model combining both—we demonstrated that the integrated approach consistently achieved superior prediction accuracy (98 % in lung microbiome) compared to using either data source independently. Clinical data alone provided reasonable but inconsistent performance (70–90 %), while microbiome taxa alone yielded the weakest results (53–65 %). Furthermore, our investigation of preprocessing techniques revealed that applying z-score normalization to the taxa data significantly enhanced performance and substantially improved recall metrics across all compartments. By analyzing compartment-specific microbial contributions, our study reveals distinct predictive roles of the oral and lung microbiomes compared to the gut microbiome, underscoring of body-site specificity in microbiome-based predictive modeling.</div></div>\",\"PeriodicalId\":16409,\"journal\":{\"name\":\"Journal of microbiological methods\",\"volume\":\"238 \",\"pages\":\"Article 107267\"},\"PeriodicalIF\":1.9000,\"publicationDate\":\"2025-09-17\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of microbiological methods\",\"FirstCategoryId\":\"99\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0167701225001836\",\"RegionNum\":4,\"RegionCategory\":\"生物学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q4\",\"JCRName\":\"BIOCHEMICAL RESEARCH METHODS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of microbiological methods","FirstCategoryId":"99","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0167701225001836","RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"BIOCHEMICAL RESEARCH METHODS","Score":null,"Total":0}
引用次数: 0

摘要

人类微生物组是一个复杂的微生物生态系统,居住在不同的身体隔间中,显著影响健康结果和疾病进展,然而,利用这些数据开发临床预测模型仍然具有挑战性,因为它的高维数、稀疏性和组成性质。传统的机器学习方法往往难以捕捉导致死亡风险的复杂微生物相互作用,特别是在分析具有不同微生物组成的多个隔间的数据时。为了解决这些限制,我们引入了一种新的框架,利用基于自编码器的模型训练从口腔、肺和肠道隔间收集的高维微生物组数据。我们的方法将微生物组数据编码到低维潜在空间中,同时保留微生物群落的基本特征,比传统的降维技术实现更有效的特征提取和模式识别。通过对三种数据配置(仅微生物组分类群、仅临床数据和结合两者的集成模型)的系统评估,我们证明,与单独使用任何一种数据源相比,集成方法始终具有更高的预测精度(98 %)。单独的临床数据提供了合理但不一致的性能(70-90 %),而单独的微生物类群效果最差(53-65 %)。此外,我们对预处理技术的研究表明,将z-score归一化应用于分类群数据显著提高了性能,并大大改善了所有隔间的召回指标。通过分析区室特异性微生物的贡献,我们的研究揭示了与肠道微生物组相比,口腔和肺部微生物组具有不同的预测作用,强调了基于微生物组的预测模型中身体部位的特异性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Integrating multi-compartment microbiome data with clinical parameters enhances mortality prediction using autoencoder
The human microbiome, a complex ecosystem of microorganisms residing in different body compartments, significantly influences health outcomes and disease progression, however, leveraging this data for developing clinical prediction models remains challenging due to its high dimensionality, sparsity, and compositional nature. Traditional machine learning approaches often struggle to capture the intricate microbial interactions that contribute to mortality risk, particularly when analyzing data across multiple compartments with distinct microbial compositions. To address these limitations, we introduce a novel framework utilizing an autoencoder-based model trained on high-dimensional microbiome data collected from oral, lung, and gut compartments. Our approach encodes microbiome data into a low-dimensional latent space while preserving essential microbial community characteristics, enabling more effective feature extraction and pattern recognition than conventional dimensionality reduction techniques. Through systematic evaluation of three data configurations—microbiome taxa only, clinical data only, and an integrated model combining both—we demonstrated that the integrated approach consistently achieved superior prediction accuracy (98 % in lung microbiome) compared to using either data source independently. Clinical data alone provided reasonable but inconsistent performance (70–90 %), while microbiome taxa alone yielded the weakest results (53–65 %). Furthermore, our investigation of preprocessing techniques revealed that applying z-score normalization to the taxa data significantly enhanced performance and substantially improved recall metrics across all compartments. By analyzing compartment-specific microbial contributions, our study reveals distinct predictive roles of the oral and lung microbiomes compared to the gut microbiome, underscoring of body-site specificity in microbiome-based predictive modeling.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Journal of microbiological methods
Journal of microbiological methods 生物-生化研究方法
CiteScore
4.30
自引率
4.50%
发文量
151
审稿时长
29 days
期刊介绍: The Journal of Microbiological Methods publishes scholarly and original articles, notes and review articles. These articles must include novel and/or state-of-the-art methods, or significant improvements to existing methods. Novel and innovative applications of current methods that are validated and useful will also be published. JMM strives for scholarship, innovation and excellence. This demands scientific rigour, the best available methods and technologies, correctly replicated experiments/tests, the inclusion of proper controls, calibrations, and the correct statistical analysis. The presentation of the data must support the interpretation of the method/approach. All aspects of microbiology are covered, except virology. These include agricultural microbiology, applied and environmental microbiology, bioassays, bioinformatics, biotechnology, biochemical microbiology, clinical microbiology, diagnostics, food monitoring and quality control microbiology, microbial genetics and genomics, geomicrobiology, microbiome methods regardless of habitat, high through-put sequencing methods and analysis, microbial pathogenesis and host responses, metabolomics, metagenomics, metaproteomics, microbial ecology and diversity, microbial physiology, microbial ultra-structure, microscopic and imaging methods, molecular microbiology, mycology, novel mathematical microbiology and modelling, parasitology, plant-microbe interactions, protein markers/profiles, proteomics, pyrosequencing, public health microbiology, radioisotopes applied to microbiology, robotics applied to microbiological methods,rumen microbiology, microbiological methods for space missions and extreme environments, sampling methods and samplers, soil and sediment microbiology, transcriptomics, veterinary microbiology, sero-diagnostics and typing/identification.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信