心脏骤停患者ICU死亡率预后模型的构建和验证:可解释的机器学习建模方法。

IF 2.8 3区 医学 Q2 MEDICINE, RESEARCH & EXPERIMENTAL
Yong Li, Ying Liu, Qing Zhang, Hongwei Zhu, Chengli Wen, Xian Jiang
{"title":"心脏骤停患者ICU死亡率预后模型的构建和验证:可解释的机器学习建模方法。","authors":"Yong Li, Ying Liu, Qing Zhang, Hongwei Zhu, Chengli Wen, Xian Jiang","doi":"10.1186/s40001-025-02588-2","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>The incidence and mortality of cardiac arrest (CA) is high. We developed interpretable machine learning models for early prediction of ICU mortality risk in patients diagnosed with CA.</p><p><strong>Methods: </strong>Data from the Medical Information Mart for Intensive Care (MIMIC-IV, version 2.2) was randomized to training set (0.7) and internal validation set (0.3), and data from eICU(version 2.0.1) was used as external validation set. Five models including Logistic Regression (LR), Random Forest (RF), K Nearest Neighbor (KNN), Decision Tree (DT), and Extreme Gradient Boost (XGBoost) were developed. The model with the largest area under the Receiver Operating Characteristic (ROC) curve (AUC) and good performance in other features was defined as the best model, and Shapley Additive Explanations (SHAP) was used to improve the interpretability of the optimal model.</p><p><strong>Results: </strong>A total of 1088 patients from MIMIC-IV, and 3542 patients from eICU were included. Seven variables were selected to construct models by Least Absolute Shrinkage and Selection Operator (LASSO) regression. The RF model was the best predictive model with AUC and 95% CI at 0.83 (0.78-0.88) in internal validation set, and 0.71(0.68-0.74) in external validation set. SHAP analysis found that the variables that had a high impact on the risk of ICU death were minimal Glasgow Coma Scale (GCS), base excess, anion gap, and urine output.</p><p><strong>Conclusion: </strong>RF is the optimal model for predicting the risk of ICU death in CA patients. The development of this model is important for early identification and intervention of CA patients who are at risk of dying in the ICU.</p>","PeriodicalId":11949,"journal":{"name":"European Journal of Medical Research","volume":"30 1","pages":"328"},"PeriodicalIF":2.8000,"publicationDate":"2025-04-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12020013/pdf/","citationCount":"0","resultStr":"{\"title\":\"Construction and validation of prognostic model for ICU mortality in cardiac arrest patients: an interpretable machine learning modeling approach.\",\"authors\":\"Yong Li, Ying Liu, Qing Zhang, Hongwei Zhu, Chengli Wen, Xian Jiang\",\"doi\":\"10.1186/s40001-025-02588-2\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Background: </strong>The incidence and mortality of cardiac arrest (CA) is high. We developed interpretable machine learning models for early prediction of ICU mortality risk in patients diagnosed with CA.</p><p><strong>Methods: </strong>Data from the Medical Information Mart for Intensive Care (MIMIC-IV, version 2.2) was randomized to training set (0.7) and internal validation set (0.3), and data from eICU(version 2.0.1) was used as external validation set. Five models including Logistic Regression (LR), Random Forest (RF), K Nearest Neighbor (KNN), Decision Tree (DT), and Extreme Gradient Boost (XGBoost) were developed. The model with the largest area under the Receiver Operating Characteristic (ROC) curve (AUC) and good performance in other features was defined as the best model, and Shapley Additive Explanations (SHAP) was used to improve the interpretability of the optimal model.</p><p><strong>Results: </strong>A total of 1088 patients from MIMIC-IV, and 3542 patients from eICU were included. Seven variables were selected to construct models by Least Absolute Shrinkage and Selection Operator (LASSO) regression. The RF model was the best predictive model with AUC and 95% CI at 0.83 (0.78-0.88) in internal validation set, and 0.71(0.68-0.74) in external validation set. SHAP analysis found that the variables that had a high impact on the risk of ICU death were minimal Glasgow Coma Scale (GCS), base excess, anion gap, and urine output.</p><p><strong>Conclusion: </strong>RF is the optimal model for predicting the risk of ICU death in CA patients. The development of this model is important for early identification and intervention of CA patients who are at risk of dying in the ICU.</p>\",\"PeriodicalId\":11949,\"journal\":{\"name\":\"European Journal of Medical Research\",\"volume\":\"30 1\",\"pages\":\"328\"},\"PeriodicalIF\":2.8000,\"publicationDate\":\"2025-04-24\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12020013/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"European Journal of Medical Research\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.1186/s40001-025-02588-2\",\"RegionNum\":3,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"MEDICINE, RESEARCH & EXPERIMENTAL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"European Journal of Medical Research","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1186/s40001-025-02588-2","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"MEDICINE, RESEARCH & EXPERIMENTAL","Score":null,"Total":0}
引用次数: 0

摘要

背景:心脏骤停(CA)的发病率和死亡率很高。我们开发了可解释的机器学习模型,用于早期预测ca患者的ICU死亡风险。方法:重症监护医学信息市场(MIMIC-IV,版本2.2)的数据随机分配到训练集(0.7)和内部验证集(0.3)中,eICU(版本2.0.1)的数据作为外部验证集。建立了Logistic回归(LR)、随机森林(RF)、K近邻(KNN)、决策树(DT)和极限梯度提升(XGBoost) 5种模型。将受试者工作特征(ROC)曲线下面积最大且其他特征表现较好的模型定义为最佳模型,并采用Shapley加性解释(SHAP)来提高最优模型的可解释性。结果:共纳入MIMIC-IV患者1088例,eICU患者3542例。通过最小绝对收缩和选择算子(LASSO)回归选择7个变量构建模型。RF模型为最佳预测模型,内部验证集的AUC为0.83(0.78 ~ 0.88),外部验证集的95% CI为0.71(0.68 ~ 0.74)。SHAP分析发现,对ICU死亡风险有高影响的变量是最小格拉斯哥昏迷评分(GCS)、碱过量、阴离子间隙和尿量。结论:RF是预测CA患者ICU死亡风险的最佳模型。该模型的发展对于ICU中有死亡风险的CA患者的早期识别和干预具有重要意义。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Construction and validation of prognostic model for ICU mortality in cardiac arrest patients: an interpretable machine learning modeling approach.

Background: The incidence and mortality of cardiac arrest (CA) is high. We developed interpretable machine learning models for early prediction of ICU mortality risk in patients diagnosed with CA.

Methods: Data from the Medical Information Mart for Intensive Care (MIMIC-IV, version 2.2) was randomized to training set (0.7) and internal validation set (0.3), and data from eICU(version 2.0.1) was used as external validation set. Five models including Logistic Regression (LR), Random Forest (RF), K Nearest Neighbor (KNN), Decision Tree (DT), and Extreme Gradient Boost (XGBoost) were developed. The model with the largest area under the Receiver Operating Characteristic (ROC) curve (AUC) and good performance in other features was defined as the best model, and Shapley Additive Explanations (SHAP) was used to improve the interpretability of the optimal model.

Results: A total of 1088 patients from MIMIC-IV, and 3542 patients from eICU were included. Seven variables were selected to construct models by Least Absolute Shrinkage and Selection Operator (LASSO) regression. The RF model was the best predictive model with AUC and 95% CI at 0.83 (0.78-0.88) in internal validation set, and 0.71(0.68-0.74) in external validation set. SHAP analysis found that the variables that had a high impact on the risk of ICU death were minimal Glasgow Coma Scale (GCS), base excess, anion gap, and urine output.

Conclusion: RF is the optimal model for predicting the risk of ICU death in CA patients. The development of this model is important for early identification and intervention of CA patients who are at risk of dying in the ICU.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
European Journal of Medical Research
European Journal of Medical Research 医学-医学:研究与实验
CiteScore
3.20
自引率
0.00%
发文量
247
审稿时长
>12 weeks
期刊介绍: European Journal of Medical Research publishes translational and clinical research of international interest across all medical disciplines, enabling clinicians and other researchers to learn about developments and innovations within these disciplines and across the boundaries between disciplines. The journal publishes high quality research and reviews and aims to ensure that the results of all well-conducted research are published, regardless of their outcome.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信