Machine Learning for the Prediction of Acute Kidney Injury in Critically Ill Patients With Coronary Heart Disease: Algorithm Development and Validation.
{"title":"Machine Learning for the Prediction of Acute Kidney Injury in Critically Ill Patients With Coronary Heart Disease: Algorithm Development and Validation.","authors":"Yike Li, Mingyang Xiao, Yaqian Li, Lulu Lv, Shanshan Zhang, Yuhui Liu, Juan Zhang","doi":"10.2196/72349","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Acute kidney injury (AKI) frequently occurs in critically ill patients with coronary heart disease (CHD), and its development markedly elevates mortality rates and prolongs hospitalization duration. Early AKI prediction is crucial for timely intervention and amelioration of patient outcomes.</p><p><strong>Objective: </strong>This study aimed to develop and verify a clinical prediction model for the occurrence of AKI upon admission in the critically ill population with CHD through machine learning (ML).</p><p><strong>Methods: </strong>Data from the MIMIC-IV (Medical Information Mart for Intensive Care IV) version 2.2 database were gathered and included information about critically ill individuals with CHD in the intensive care unit (ICU). The dataset was randomized into a training set (70%) and a testing set (30%). Least absolute shrinkage and selection operator (LASSO) regression was used for feature variable selection. ML models, including logistic regression (LR), decision tree (DT), naive Bayes (NB), random forest (RF), extreme gradient boosting (XGBoost), and support vector machine (SVM), were constructed using 13 variables in the training set. The 6 models were compared in the testing set to identify the best-performing model. Subsequently, the model was assessed using calibration curve analysis and decision curve analysis (DCA). External validation was conducted using data from the Second Affiliated Hospital of Zhengzhou University. Ultimately, the predictive model was interpreted via Shapley Additive Explanation (SHAP) values.</p><p><strong>Results: </strong>In total, 2711 patients with CHD admitted to the ICU were selected, with 1809 (66.7%) having AKI. XGBoost exhibited the best performance regarding discrimination (area under the receiver operating characteristic curve [AUROC]=0.765, 95% CI 0.731-0.800), accuracy (0.725), and sensitivity (0.759). External validation using a cohort of 226 patients confirmed the strong generalizability of the XGBoost model (AUROC=0.835, 95% CI 0.782-0.887). Feature importance analyses derived from SHAP values, DT, RF, and XGBoost consistently identified 5 key predictors associated with the development of AKI: mechanical ventilation, use of antiplatelet agents, age, N-terminal pro-B-type natriuretic peptide (NT-proBNP) levels, and acute physiology score III (APSIII).</p><p><strong>Conclusions: </strong>ML models can serve as reliable tools for forecasting AKI in the critically ill population with CHD. The XGBoost model is highly accurate and may aid doctors in identifying high-risk individuals for early intervention to lower mortality.</p>","PeriodicalId":56334,"journal":{"name":"JMIR Medical Informatics","volume":" ","pages":"e72349"},"PeriodicalIF":3.1000,"publicationDate":"2025-05-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"JMIR Medical Informatics","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.2196/72349","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"MEDICAL INFORMATICS","Score":null,"Total":0}
引用次数: 0
Abstract
Background: Acute kidney injury (AKI) frequently occurs in critically ill patients with coronary heart disease (CHD), and its development markedly elevates mortality rates and prolongs hospitalization duration. Early AKI prediction is crucial for timely intervention and amelioration of patient outcomes.
Objective: This study aimed to develop and verify a clinical prediction model for the occurrence of AKI upon admission in the critically ill population with CHD through machine learning (ML).
Methods: Data from the MIMIC-IV (Medical Information Mart for Intensive Care IV) version 2.2 database were gathered and included information about critically ill individuals with CHD in the intensive care unit (ICU). The dataset was randomized into a training set (70%) and a testing set (30%). Least absolute shrinkage and selection operator (LASSO) regression was used for feature variable selection. ML models, including logistic regression (LR), decision tree (DT), naive Bayes (NB), random forest (RF), extreme gradient boosting (XGBoost), and support vector machine (SVM), were constructed using 13 variables in the training set. The 6 models were compared in the testing set to identify the best-performing model. Subsequently, the model was assessed using calibration curve analysis and decision curve analysis (DCA). External validation was conducted using data from the Second Affiliated Hospital of Zhengzhou University. Ultimately, the predictive model was interpreted via Shapley Additive Explanation (SHAP) values.
Results: In total, 2711 patients with CHD admitted to the ICU were selected, with 1809 (66.7%) having AKI. XGBoost exhibited the best performance regarding discrimination (area under the receiver operating characteristic curve [AUROC]=0.765, 95% CI 0.731-0.800), accuracy (0.725), and sensitivity (0.759). External validation using a cohort of 226 patients confirmed the strong generalizability of the XGBoost model (AUROC=0.835, 95% CI 0.782-0.887). Feature importance analyses derived from SHAP values, DT, RF, and XGBoost consistently identified 5 key predictors associated with the development of AKI: mechanical ventilation, use of antiplatelet agents, age, N-terminal pro-B-type natriuretic peptide (NT-proBNP) levels, and acute physiology score III (APSIII).
Conclusions: ML models can serve as reliable tools for forecasting AKI in the critically ill population with CHD. The XGBoost model is highly accurate and may aid doctors in identifying high-risk individuals for early intervention to lower mortality.
期刊介绍:
JMIR Medical Informatics (JMI, ISSN 2291-9694) is a top-rated, tier A journal which focuses on clinical informatics, big data in health and health care, decision support for health professionals, electronic health records, ehealth infrastructures and implementation. It has a focus on applied, translational research, with a broad readership including clinicians, CIOs, engineers, industry and health informatics professionals.
Published by JMIR Publications, publisher of the Journal of Medical Internet Research (JMIR), the leading eHealth/mHealth journal (Impact Factor 2016: 5.175), JMIR Med Inform has a slightly different scope (emphasizing more on applications for clinicians and health professionals rather than consumers/citizens, which is the focus of JMIR), publishes even faster, and also allows papers which are more technical or more formative than what would be published in the Journal of Medical Internet Research.