{"title":"心脏骤停患者ICU死亡率预后模型的构建和验证:可解释的机器学习建模方法。","authors":"Yong Li, Ying Liu, Qing Zhang, Hongwei Zhu, Chengli Wen, Xian Jiang","doi":"10.1186/s40001-025-02588-2","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>The incidence and mortality of cardiac arrest (CA) is high. We developed interpretable machine learning models for early prediction of ICU mortality risk in patients diagnosed with CA.</p><p><strong>Methods: </strong>Data from the Medical Information Mart for Intensive Care (MIMIC-IV, version 2.2) was randomized to training set (0.7) and internal validation set (0.3), and data from eICU(version 2.0.1) was used as external validation set. Five models including Logistic Regression (LR), Random Forest (RF), K Nearest Neighbor (KNN), Decision Tree (DT), and Extreme Gradient Boost (XGBoost) were developed. The model with the largest area under the Receiver Operating Characteristic (ROC) curve (AUC) and good performance in other features was defined as the best model, and Shapley Additive Explanations (SHAP) was used to improve the interpretability of the optimal model.</p><p><strong>Results: </strong>A total of 1088 patients from MIMIC-IV, and 3542 patients from eICU were included. Seven variables were selected to construct models by Least Absolute Shrinkage and Selection Operator (LASSO) regression. The RF model was the best predictive model with AUC and 95% CI at 0.83 (0.78-0.88) in internal validation set, and 0.71(0.68-0.74) in external validation set. SHAP analysis found that the variables that had a high impact on the risk of ICU death were minimal Glasgow Coma Scale (GCS), base excess, anion gap, and urine output.</p><p><strong>Conclusion: </strong>RF is the optimal model for predicting the risk of ICU death in CA patients. The development of this model is important for early identification and intervention of CA patients who are at risk of dying in the ICU.</p>","PeriodicalId":11949,"journal":{"name":"European Journal of Medical Research","volume":"30 1","pages":"328"},"PeriodicalIF":2.8000,"publicationDate":"2025-04-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12020013/pdf/","citationCount":"0","resultStr":"{\"title\":\"Construction and validation of prognostic model for ICU mortality in cardiac arrest patients: an interpretable machine learning modeling approach.\",\"authors\":\"Yong Li, Ying Liu, Qing Zhang, Hongwei Zhu, Chengli Wen, Xian Jiang\",\"doi\":\"10.1186/s40001-025-02588-2\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Background: </strong>The incidence and mortality of cardiac arrest (CA) is high. We developed interpretable machine learning models for early prediction of ICU mortality risk in patients diagnosed with CA.</p><p><strong>Methods: </strong>Data from the Medical Information Mart for Intensive Care (MIMIC-IV, version 2.2) was randomized to training set (0.7) and internal validation set (0.3), and data from eICU(version 2.0.1) was used as external validation set. Five models including Logistic Regression (LR), Random Forest (RF), K Nearest Neighbor (KNN), Decision Tree (DT), and Extreme Gradient Boost (XGBoost) were developed. The model with the largest area under the Receiver Operating Characteristic (ROC) curve (AUC) and good performance in other features was defined as the best model, and Shapley Additive Explanations (SHAP) was used to improve the interpretability of the optimal model.</p><p><strong>Results: </strong>A total of 1088 patients from MIMIC-IV, and 3542 patients from eICU were included. Seven variables were selected to construct models by Least Absolute Shrinkage and Selection Operator (LASSO) regression. The RF model was the best predictive model with AUC and 95% CI at 0.83 (0.78-0.88) in internal validation set, and 0.71(0.68-0.74) in external validation set. SHAP analysis found that the variables that had a high impact on the risk of ICU death were minimal Glasgow Coma Scale (GCS), base excess, anion gap, and urine output.</p><p><strong>Conclusion: </strong>RF is the optimal model for predicting the risk of ICU death in CA patients. The development of this model is important for early identification and intervention of CA patients who are at risk of dying in the ICU.</p>\",\"PeriodicalId\":11949,\"journal\":{\"name\":\"European Journal of Medical Research\",\"volume\":\"30 1\",\"pages\":\"328\"},\"PeriodicalIF\":2.8000,\"publicationDate\":\"2025-04-24\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12020013/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"European Journal of Medical Research\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.1186/s40001-025-02588-2\",\"RegionNum\":3,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"MEDICINE, RESEARCH & EXPERIMENTAL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"European Journal of Medical Research","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1186/s40001-025-02588-2","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"MEDICINE, RESEARCH & EXPERIMENTAL","Score":null,"Total":0}
Construction and validation of prognostic model for ICU mortality in cardiac arrest patients: an interpretable machine learning modeling approach.
Background: The incidence and mortality of cardiac arrest (CA) is high. We developed interpretable machine learning models for early prediction of ICU mortality risk in patients diagnosed with CA.
Methods: Data from the Medical Information Mart for Intensive Care (MIMIC-IV, version 2.2) was randomized to training set (0.7) and internal validation set (0.3), and data from eICU(version 2.0.1) was used as external validation set. Five models including Logistic Regression (LR), Random Forest (RF), K Nearest Neighbor (KNN), Decision Tree (DT), and Extreme Gradient Boost (XGBoost) were developed. The model with the largest area under the Receiver Operating Characteristic (ROC) curve (AUC) and good performance in other features was defined as the best model, and Shapley Additive Explanations (SHAP) was used to improve the interpretability of the optimal model.
Results: A total of 1088 patients from MIMIC-IV, and 3542 patients from eICU were included. Seven variables were selected to construct models by Least Absolute Shrinkage and Selection Operator (LASSO) regression. The RF model was the best predictive model with AUC and 95% CI at 0.83 (0.78-0.88) in internal validation set, and 0.71(0.68-0.74) in external validation set. SHAP analysis found that the variables that had a high impact on the risk of ICU death were minimal Glasgow Coma Scale (GCS), base excess, anion gap, and urine output.
Conclusion: RF is the optimal model for predicting the risk of ICU death in CA patients. The development of this model is important for early identification and intervention of CA patients who are at risk of dying in the ICU.
期刊介绍:
European Journal of Medical Research publishes translational and clinical research of international interest across all medical disciplines, enabling clinicians and other researchers to learn about developments and innovations within these disciplines and across the boundaries between disciplines. The journal publishes high quality research and reviews and aims to ensure that the results of all well-conducted research are published, regardless of their outcome.