一个可解释的机器学习模型的开发和验证,用于预测急性慢性肝衰竭28天内侵袭性真菌感染。

IF 3.1 2区 医学 Q1 DERMATOLOGY
Mycoses Pub Date : 2025-07-01 DOI:10.1111/myc.70090
Fei-Xiang Xiong, Jian-Guo Yan, Xue-Jie Zhang, Yang Zhou, Xiao-Min Ji, Rong-Hua Jin, Yi-Xin Hou
{"title":"一个可解释的机器学习模型的开发和验证,用于预测急性慢性肝衰竭28天内侵袭性真菌感染。","authors":"Fei-Xiang Xiong, Jian-Guo Yan, Xue-Jie Zhang, Yang Zhou, Xiao-Min Ji, Rong-Hua Jin, Yi-Xin Hou","doi":"10.1111/myc.70090","DOIUrl":null,"url":null,"abstract":"<p><strong>Background and objective: </strong>Acute-on-chronic liver failure (ACLF) is associated with significantly higher short-term mortality, and the presence of invasive fungal infection (IFI) further increases this risk. This study aims to develop a ML model that predicts the risk of IFI in ACLF patients.</p><p><strong>Methods: </strong>This study included 1112 patients divided into a training set and a validation set, with another 188 patients serving as an external validation cohort. The Recursive Feature Elimination (RFE) method was used to select the most significant variables for model development. Four machine learning algorithms were compared to identify the optimal model. The models were evaluated and compared using C-index, time-dependent ROC curves, decision curve analysis (DCA), and calibration curves. The LIME (Local Interpretable Model-Agnostic Explanations) method was used to identify the high-risk populations utilised by the model.</p><p><strong>Results: </strong>778 patients were included in the training set, 334 in the internal validation set, and 188 in the external validation set. The study found that Random Forest (RF) was the best-performing ML algorithm. In the training set, the RF model achieved an AUROC of 0.922 (0.911-0.933), significantly higher than MELD (0.854, 0.835-0.873, p < 0.001), CLIF-C OF (0.753, 0.724-0.783, p < 0.001), and CLIF-C ACLF (0.879, 0.863-0.896, p = 0.020). The same trend was observed in both the internal and external validation sets. The time-dependent ROC curve showed that the RF model outperformed the other scores for predicting the risk of IFI in 28 days. DCA and calibration curves also demonstrated superior clinical benefits for the RF model across all datasets. LIME revealed bacterial infection (BI), Na < 136 mmol/L, CRP (C-reactive protein) > 20.1 g/L, and TBIL(Total Bilirubin) > 196.7 μmol/L as the high-risk groups.</p><p><strong>Conclusion: </strong>The RF model effectively predicts the risk of IFI in ACLF patients. The application of LIME enables the identification of high-risk populations, providing clinical value for patient management.</p>","PeriodicalId":18797,"journal":{"name":"Mycoses","volume":"68 7","pages":"e70090"},"PeriodicalIF":3.1000,"publicationDate":"2025-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Development and Validation of an Explainable Machine Learning Model for Predicting Invasive Fungal Infection in Acute-On-Chronic Liver Failure Within 28 Days.\",\"authors\":\"Fei-Xiang Xiong, Jian-Guo Yan, Xue-Jie Zhang, Yang Zhou, Xiao-Min Ji, Rong-Hua Jin, Yi-Xin Hou\",\"doi\":\"10.1111/myc.70090\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Background and objective: </strong>Acute-on-chronic liver failure (ACLF) is associated with significantly higher short-term mortality, and the presence of invasive fungal infection (IFI) further increases this risk. This study aims to develop a ML model that predicts the risk of IFI in ACLF patients.</p><p><strong>Methods: </strong>This study included 1112 patients divided into a training set and a validation set, with another 188 patients serving as an external validation cohort. The Recursive Feature Elimination (RFE) method was used to select the most significant variables for model development. Four machine learning algorithms were compared to identify the optimal model. The models were evaluated and compared using C-index, time-dependent ROC curves, decision curve analysis (DCA), and calibration curves. The LIME (Local Interpretable Model-Agnostic Explanations) method was used to identify the high-risk populations utilised by the model.</p><p><strong>Results: </strong>778 patients were included in the training set, 334 in the internal validation set, and 188 in the external validation set. The study found that Random Forest (RF) was the best-performing ML algorithm. In the training set, the RF model achieved an AUROC of 0.922 (0.911-0.933), significantly higher than MELD (0.854, 0.835-0.873, p < 0.001), CLIF-C OF (0.753, 0.724-0.783, p < 0.001), and CLIF-C ACLF (0.879, 0.863-0.896, p = 0.020). The same trend was observed in both the internal and external validation sets. The time-dependent ROC curve showed that the RF model outperformed the other scores for predicting the risk of IFI in 28 days. DCA and calibration curves also demonstrated superior clinical benefits for the RF model across all datasets. LIME revealed bacterial infection (BI), Na < 136 mmol/L, CRP (C-reactive protein) > 20.1 g/L, and TBIL(Total Bilirubin) > 196.7 μmol/L as the high-risk groups.</p><p><strong>Conclusion: </strong>The RF model effectively predicts the risk of IFI in ACLF patients. The application of LIME enables the identification of high-risk populations, providing clinical value for patient management.</p>\",\"PeriodicalId\":18797,\"journal\":{\"name\":\"Mycoses\",\"volume\":\"68 7\",\"pages\":\"e70090\"},\"PeriodicalIF\":3.1000,\"publicationDate\":\"2025-07-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Mycoses\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.1111/myc.70090\",\"RegionNum\":2,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"DERMATOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Mycoses","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1111/myc.70090","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"DERMATOLOGY","Score":null,"Total":0}
引用次数: 0

摘要

背景和目的:急性慢性肝衰竭(ACLF)与显著较高的短期死亡率相关,侵袭性真菌感染(IFI)的存在进一步增加了这种风险。本研究旨在建立预测ACLF患者IFI风险的ML模型。方法:本研究纳入1112例患者,分为训练组和验证组,另外188例患者作为外部验证队列。采用递归特征消除(RFE)方法选择最重要的变量进行模型开发。比较了四种机器学习算法,以确定最优模型。采用c指数、随时间变化的ROC曲线、决策曲线分析(DCA)和校准曲线对模型进行评价和比较。LIME(局部可解释模型不可知论解释)方法用于识别模型使用的高危人群。结果:778例患者被纳入训练集,334例患者被纳入内部验证集,188例患者被纳入外部验证集。研究发现随机森林(Random Forest, RF)是表现最好的ML算法。在训练集中,RF模型的AUROC为0.922(0.911-0.933),显著高于MELD (0.854, 0.835-0.873, p 20.1 g/L)和TBIL(Total Bilirubin) > 196.7 μmol/L。结论:RF模型可有效预测ACLF患者发生IFI的风险。LIME的应用可以识别高危人群,为患者管理提供临床价值。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Development and Validation of an Explainable Machine Learning Model for Predicting Invasive Fungal Infection in Acute-On-Chronic Liver Failure Within 28 Days.

Background and objective: Acute-on-chronic liver failure (ACLF) is associated with significantly higher short-term mortality, and the presence of invasive fungal infection (IFI) further increases this risk. This study aims to develop a ML model that predicts the risk of IFI in ACLF patients.

Methods: This study included 1112 patients divided into a training set and a validation set, with another 188 patients serving as an external validation cohort. The Recursive Feature Elimination (RFE) method was used to select the most significant variables for model development. Four machine learning algorithms were compared to identify the optimal model. The models were evaluated and compared using C-index, time-dependent ROC curves, decision curve analysis (DCA), and calibration curves. The LIME (Local Interpretable Model-Agnostic Explanations) method was used to identify the high-risk populations utilised by the model.

Results: 778 patients were included in the training set, 334 in the internal validation set, and 188 in the external validation set. The study found that Random Forest (RF) was the best-performing ML algorithm. In the training set, the RF model achieved an AUROC of 0.922 (0.911-0.933), significantly higher than MELD (0.854, 0.835-0.873, p < 0.001), CLIF-C OF (0.753, 0.724-0.783, p < 0.001), and CLIF-C ACLF (0.879, 0.863-0.896, p = 0.020). The same trend was observed in both the internal and external validation sets. The time-dependent ROC curve showed that the RF model outperformed the other scores for predicting the risk of IFI in 28 days. DCA and calibration curves also demonstrated superior clinical benefits for the RF model across all datasets. LIME revealed bacterial infection (BI), Na < 136 mmol/L, CRP (C-reactive protein) > 20.1 g/L, and TBIL(Total Bilirubin) > 196.7 μmol/L as the high-risk groups.

Conclusion: The RF model effectively predicts the risk of IFI in ACLF patients. The application of LIME enables the identification of high-risk populations, providing clinical value for patient management.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Mycoses
Mycoses 医学-皮肤病学
CiteScore
10.00
自引率
8.20%
发文量
143
审稿时长
6-12 weeks
期刊介绍: The journal Mycoses provides an international forum for original papers in English on the pathogenesis, diagnosis, therapy, prophylaxis, and epidemiology of fungal infectious diseases in humans as well as on the biology of pathogenic fungi. Medical mycology as part of medical microbiology is advancing rapidly. Effective therapeutic strategies are already available in chemotherapy and are being further developed. Their application requires reliable laboratory diagnostic techniques, which, in turn, result from mycological basic research. Opportunistic mycoses vary greatly in their clinical and pathological symptoms, because the underlying disease of a patient at risk decisively determines their symptomatology and progress. The journal Mycoses is therefore of interest to scientists in fundamental mycological research, mycological laboratory diagnosticians and clinicians interested in fungal infections.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信