{"title":"基于多中心机器学习的ICU低血钙患者死亡率预测。","authors":"Liangpeng Xie, Linxuan Jiang, Mingxuan Xiao, Jiaowen Sheng, Xin Li, Chang Zhang","doi":"10.1097/SHK.0000000000002680","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Hypocalcemia occurs frequently in intensive-care units (ICUs) and is independently associated with excess mortality. Conventional severity scores-such as APACHE II and SOFA-assign fixed weights to a limited set of variables and therefore fail to capture the nonlinear, high-dimensional physiology characteristic of hypocalcemic patients. Although machine-learning (ML) approaches can enhance risk stratification, no interpretable model tailored to this cohort has been available.</p><p><strong>Methods: </strong>We harmonised de-identified data from MIMIC-III, MIMIC-IV and two Grade III Level A hospitals in China, generating a multicentre cohort of 13,979 adult ICU admissions with total serum calcium < 2.12 mmol L-1. MIMIC-IV records were randomly divided into a training set (n = 7,749) and an internal-validation set (n = 1,550). External validation employed MIMIC-III (n = 4,771) and the Chinese multicentre dataset (n = 209). Predictors were filtered with least-absolute-shrinkage-and-selection operator (LASSO) regression and applied to eight ML algorithms: logistic regression, k-nearest neighbors (KNN), support-vector machine, decision tree, random forest, artificial neural network, eXtreme Gradient Boosting (XGBoost) and LightGBM. Model discrimination, calibration and clinical utility were quantified using the area under the receiver-operating-characteristic curve (AUC), F1-score, sensitivity, specificity, calibration plots, decision-curve analysis (DCA) and clinical-impact curves (CIC). SHapley Additive exPlanations (SHAP) were used for interpretability, and the final model was deployed as a public web application.</p><p><strong>Results: </strong>LASSO retained 20 predictive variables; is_noninvasive_ventilator and hospital length of stay were the most influential in SHAP analysis. XGBoost provided the highest discrimination (AUC = 0.914; F1 = 0.844), surpassing logistic regression (AUC = 0.896; F1 = 0.829), LightGBM (AUC = 0.909; F1 = 0.816) and conventional ICU scores. Calibration curves, DCA and CIC confirmed consistent performance and superior net benefit across internal and external validation cohorts.</p><p><strong>Conclusions: </strong>We present and externally validate an interpretable, high-performance ML model that predicts in-hospital mortality in hypocalcemic ICU patients more accurately than established scoring systems. The SHAP-enabled web interface provides real-time, patient-specific risk estimates, facilitating data-driven clinical decisions within the early, critical window of ICU care.</p>","PeriodicalId":21667,"journal":{"name":"SHOCK","volume":" ","pages":""},"PeriodicalIF":2.9000,"publicationDate":"2025-07-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Multi-Center Machine Learning-Based Prediction of Mortality in ICU Patients with Hypocalcemia.\",\"authors\":\"Liangpeng Xie, Linxuan Jiang, Mingxuan Xiao, Jiaowen Sheng, Xin Li, Chang Zhang\",\"doi\":\"10.1097/SHK.0000000000002680\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Background: </strong>Hypocalcemia occurs frequently in intensive-care units (ICUs) and is independently associated with excess mortality. Conventional severity scores-such as APACHE II and SOFA-assign fixed weights to a limited set of variables and therefore fail to capture the nonlinear, high-dimensional physiology characteristic of hypocalcemic patients. Although machine-learning (ML) approaches can enhance risk stratification, no interpretable model tailored to this cohort has been available.</p><p><strong>Methods: </strong>We harmonised de-identified data from MIMIC-III, MIMIC-IV and two Grade III Level A hospitals in China, generating a multicentre cohort of 13,979 adult ICU admissions with total serum calcium < 2.12 mmol L-1. MIMIC-IV records were randomly divided into a training set (n = 7,749) and an internal-validation set (n = 1,550). External validation employed MIMIC-III (n = 4,771) and the Chinese multicentre dataset (n = 209). Predictors were filtered with least-absolute-shrinkage-and-selection operator (LASSO) regression and applied to eight ML algorithms: logistic regression, k-nearest neighbors (KNN), support-vector machine, decision tree, random forest, artificial neural network, eXtreme Gradient Boosting (XGBoost) and LightGBM. Model discrimination, calibration and clinical utility were quantified using the area under the receiver-operating-characteristic curve (AUC), F1-score, sensitivity, specificity, calibration plots, decision-curve analysis (DCA) and clinical-impact curves (CIC). SHapley Additive exPlanations (SHAP) were used for interpretability, and the final model was deployed as a public web application.</p><p><strong>Results: </strong>LASSO retained 20 predictive variables; is_noninvasive_ventilator and hospital length of stay were the most influential in SHAP analysis. XGBoost provided the highest discrimination (AUC = 0.914; F1 = 0.844), surpassing logistic regression (AUC = 0.896; F1 = 0.829), LightGBM (AUC = 0.909; F1 = 0.816) and conventional ICU scores. Calibration curves, DCA and CIC confirmed consistent performance and superior net benefit across internal and external validation cohorts.</p><p><strong>Conclusions: </strong>We present and externally validate an interpretable, high-performance ML model that predicts in-hospital mortality in hypocalcemic ICU patients more accurately than established scoring systems. The SHAP-enabled web interface provides real-time, patient-specific risk estimates, facilitating data-driven clinical decisions within the early, critical window of ICU care.</p>\",\"PeriodicalId\":21667,\"journal\":{\"name\":\"SHOCK\",\"volume\":\" \",\"pages\":\"\"},\"PeriodicalIF\":2.9000,\"publicationDate\":\"2025-07-25\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"SHOCK\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.1097/SHK.0000000000002680\",\"RegionNum\":3,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"CRITICAL CARE MEDICINE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"SHOCK","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1097/SHK.0000000000002680","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"CRITICAL CARE MEDICINE","Score":null,"Total":0}
Multi-Center Machine Learning-Based Prediction of Mortality in ICU Patients with Hypocalcemia.
Background: Hypocalcemia occurs frequently in intensive-care units (ICUs) and is independently associated with excess mortality. Conventional severity scores-such as APACHE II and SOFA-assign fixed weights to a limited set of variables and therefore fail to capture the nonlinear, high-dimensional physiology characteristic of hypocalcemic patients. Although machine-learning (ML) approaches can enhance risk stratification, no interpretable model tailored to this cohort has been available.
Methods: We harmonised de-identified data from MIMIC-III, MIMIC-IV and two Grade III Level A hospitals in China, generating a multicentre cohort of 13,979 adult ICU admissions with total serum calcium < 2.12 mmol L-1. MIMIC-IV records were randomly divided into a training set (n = 7,749) and an internal-validation set (n = 1,550). External validation employed MIMIC-III (n = 4,771) and the Chinese multicentre dataset (n = 209). Predictors were filtered with least-absolute-shrinkage-and-selection operator (LASSO) regression and applied to eight ML algorithms: logistic regression, k-nearest neighbors (KNN), support-vector machine, decision tree, random forest, artificial neural network, eXtreme Gradient Boosting (XGBoost) and LightGBM. Model discrimination, calibration and clinical utility were quantified using the area under the receiver-operating-characteristic curve (AUC), F1-score, sensitivity, specificity, calibration plots, decision-curve analysis (DCA) and clinical-impact curves (CIC). SHapley Additive exPlanations (SHAP) were used for interpretability, and the final model was deployed as a public web application.
Results: LASSO retained 20 predictive variables; is_noninvasive_ventilator and hospital length of stay were the most influential in SHAP analysis. XGBoost provided the highest discrimination (AUC = 0.914; F1 = 0.844), surpassing logistic regression (AUC = 0.896; F1 = 0.829), LightGBM (AUC = 0.909; F1 = 0.816) and conventional ICU scores. Calibration curves, DCA and CIC confirmed consistent performance and superior net benefit across internal and external validation cohorts.
Conclusions: We present and externally validate an interpretable, high-performance ML model that predicts in-hospital mortality in hypocalcemic ICU patients more accurately than established scoring systems. The SHAP-enabled web interface provides real-time, patient-specific risk estimates, facilitating data-driven clinical decisions within the early, critical window of ICU care.
期刊介绍:
SHOCK®: Injury, Inflammation, and Sepsis: Laboratory and Clinical Approaches includes studies of novel therapeutic approaches, such as immunomodulation, gene therapy, nutrition, and others. The mission of the Journal is to foster and promote multidisciplinary studies, both experimental and clinical in nature, that critically examine the etiology, mechanisms and novel therapeutics of shock-related pathophysiological conditions. Its purpose is to excel as a vehicle for timely publication in the areas of basic and clinical studies of shock, trauma, sepsis, inflammation, ischemia, and related pathobiological states, with particular emphasis on the biologic mechanisms that determine the response to such injury. Making such information available will ultimately facilitate improved care of the traumatized or septic individual.