基于多中心机器学习的ICU低血钙患者死亡率预测。

IF 2.9 3区 医学 Q2 CRITICAL CARE MEDICINE
SHOCK Pub Date : 2025-07-25 DOI:10.1097/SHK.0000000000002680
Liangpeng Xie, Linxuan Jiang, Mingxuan Xiao, Jiaowen Sheng, Xin Li, Chang Zhang
{"title":"基于多中心机器学习的ICU低血钙患者死亡率预测。","authors":"Liangpeng Xie, Linxuan Jiang, Mingxuan Xiao, Jiaowen Sheng, Xin Li, Chang Zhang","doi":"10.1097/SHK.0000000000002680","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Hypocalcemia occurs frequently in intensive-care units (ICUs) and is independently associated with excess mortality. Conventional severity scores-such as APACHE II and SOFA-assign fixed weights to a limited set of variables and therefore fail to capture the nonlinear, high-dimensional physiology characteristic of hypocalcemic patients. Although machine-learning (ML) approaches can enhance risk stratification, no interpretable model tailored to this cohort has been available.</p><p><strong>Methods: </strong>We harmonised de-identified data from MIMIC-III, MIMIC-IV and two Grade III Level A hospitals in China, generating a multicentre cohort of 13,979 adult ICU admissions with total serum calcium < 2.12 mmol L-1. MIMIC-IV records were randomly divided into a training set (n = 7,749) and an internal-validation set (n = 1,550). External validation employed MIMIC-III (n = 4,771) and the Chinese multicentre dataset (n = 209). Predictors were filtered with least-absolute-shrinkage-and-selection operator (LASSO) regression and applied to eight ML algorithms: logistic regression, k-nearest neighbors (KNN), support-vector machine, decision tree, random forest, artificial neural network, eXtreme Gradient Boosting (XGBoost) and LightGBM. Model discrimination, calibration and clinical utility were quantified using the area under the receiver-operating-characteristic curve (AUC), F1-score, sensitivity, specificity, calibration plots, decision-curve analysis (DCA) and clinical-impact curves (CIC). SHapley Additive exPlanations (SHAP) were used for interpretability, and the final model was deployed as a public web application.</p><p><strong>Results: </strong>LASSO retained 20 predictive variables; is_noninvasive_ventilator and hospital length of stay were the most influential in SHAP analysis. XGBoost provided the highest discrimination (AUC = 0.914; F1 = 0.844), surpassing logistic regression (AUC = 0.896; F1 = 0.829), LightGBM (AUC = 0.909; F1 = 0.816) and conventional ICU scores. Calibration curves, DCA and CIC confirmed consistent performance and superior net benefit across internal and external validation cohorts.</p><p><strong>Conclusions: </strong>We present and externally validate an interpretable, high-performance ML model that predicts in-hospital mortality in hypocalcemic ICU patients more accurately than established scoring systems. The SHAP-enabled web interface provides real-time, patient-specific risk estimates, facilitating data-driven clinical decisions within the early, critical window of ICU care.</p>","PeriodicalId":21667,"journal":{"name":"SHOCK","volume":" ","pages":""},"PeriodicalIF":2.9000,"publicationDate":"2025-07-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Multi-Center Machine Learning-Based Prediction of Mortality in ICU Patients with Hypocalcemia.\",\"authors\":\"Liangpeng Xie, Linxuan Jiang, Mingxuan Xiao, Jiaowen Sheng, Xin Li, Chang Zhang\",\"doi\":\"10.1097/SHK.0000000000002680\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Background: </strong>Hypocalcemia occurs frequently in intensive-care units (ICUs) and is independently associated with excess mortality. Conventional severity scores-such as APACHE II and SOFA-assign fixed weights to a limited set of variables and therefore fail to capture the nonlinear, high-dimensional physiology characteristic of hypocalcemic patients. Although machine-learning (ML) approaches can enhance risk stratification, no interpretable model tailored to this cohort has been available.</p><p><strong>Methods: </strong>We harmonised de-identified data from MIMIC-III, MIMIC-IV and two Grade III Level A hospitals in China, generating a multicentre cohort of 13,979 adult ICU admissions with total serum calcium < 2.12 mmol L-1. MIMIC-IV records were randomly divided into a training set (n = 7,749) and an internal-validation set (n = 1,550). External validation employed MIMIC-III (n = 4,771) and the Chinese multicentre dataset (n = 209). Predictors were filtered with least-absolute-shrinkage-and-selection operator (LASSO) regression and applied to eight ML algorithms: logistic regression, k-nearest neighbors (KNN), support-vector machine, decision tree, random forest, artificial neural network, eXtreme Gradient Boosting (XGBoost) and LightGBM. Model discrimination, calibration and clinical utility were quantified using the area under the receiver-operating-characteristic curve (AUC), F1-score, sensitivity, specificity, calibration plots, decision-curve analysis (DCA) and clinical-impact curves (CIC). SHapley Additive exPlanations (SHAP) were used for interpretability, and the final model was deployed as a public web application.</p><p><strong>Results: </strong>LASSO retained 20 predictive variables; is_noninvasive_ventilator and hospital length of stay were the most influential in SHAP analysis. XGBoost provided the highest discrimination (AUC = 0.914; F1 = 0.844), surpassing logistic regression (AUC = 0.896; F1 = 0.829), LightGBM (AUC = 0.909; F1 = 0.816) and conventional ICU scores. Calibration curves, DCA and CIC confirmed consistent performance and superior net benefit across internal and external validation cohorts.</p><p><strong>Conclusions: </strong>We present and externally validate an interpretable, high-performance ML model that predicts in-hospital mortality in hypocalcemic ICU patients more accurately than established scoring systems. The SHAP-enabled web interface provides real-time, patient-specific risk estimates, facilitating data-driven clinical decisions within the early, critical window of ICU care.</p>\",\"PeriodicalId\":21667,\"journal\":{\"name\":\"SHOCK\",\"volume\":\" \",\"pages\":\"\"},\"PeriodicalIF\":2.9000,\"publicationDate\":\"2025-07-25\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"SHOCK\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.1097/SHK.0000000000002680\",\"RegionNum\":3,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"CRITICAL CARE MEDICINE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"SHOCK","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1097/SHK.0000000000002680","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"CRITICAL CARE MEDICINE","Score":null,"Total":0}
引用次数: 0

摘要

背景:低钙血症经常发生在重症监护病房(icu),并与高死亡率独立相关。传统的严重程度评分——如APACHE II和sofa——将固定的权重分配给有限的一组变量,因此无法捕捉低钙患者的非线性、高维生理特征。尽管机器学习(ML)方法可以增强风险分层,但目前还没有针对这一人群量身定制的可解释模型。方法:我们统一了来自中国MIMIC-III、MIMIC-IV和两家三级甲等医院的去识别数据,生成了一个多中心队列,包括13979名总血清钙< 2.12 mmol L-1的ICU成人住院患者。MIMIC-IV记录被随机分为训练集(n = 7,749)和内部验证集(n = 1,550)。外部验证采用MIMIC-III (n = 4,771)和中国多中心数据集(n = 209)。预测因子用最小绝对收缩和选择算子(LASSO)回归进行过滤,并应用于8种ML算法:逻辑回归、k近邻(KNN)、支持向量机、决策树、随机森林、人工神经网络、极限梯度增强(XGBoost)和LightGBM。采用受试者工作特征曲线下面积(AUC)、f1评分、敏感性、特异性、校准图、决策曲线分析(DCA)和临床影响曲线(CIC)对模型判别、校准和临床效用进行量化。SHapley加性解释(SHAP)用于可解释性,最终模型被部署为公共web应用程序。结果:LASSO保留了20个预测变量;无创呼吸机和住院时间是影响SHAP分析的主要因素。XGBoost的鉴别率最高(AUC = 0.914;F1 = 0.844),优于logistic回归(AUC = 0.896;F1 = 0.829), LightGBM (AUC = 0.909;F1 = 0.816)与常规ICU评分比较。校准曲线,DCA和CIC在内部和外部验证队列中证实了一致的性能和优越的净效益。结论:我们提出并外部验证了一个可解释的高性能ML模型,该模型比现有评分系统更准确地预测低钙ICU患者的住院死亡率。支持shap的web界面提供实时的、特定于患者的风险评估,促进在ICU护理的早期关键窗口内进行数据驱动的临床决策。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Multi-Center Machine Learning-Based Prediction of Mortality in ICU Patients with Hypocalcemia.

Background: Hypocalcemia occurs frequently in intensive-care units (ICUs) and is independently associated with excess mortality. Conventional severity scores-such as APACHE II and SOFA-assign fixed weights to a limited set of variables and therefore fail to capture the nonlinear, high-dimensional physiology characteristic of hypocalcemic patients. Although machine-learning (ML) approaches can enhance risk stratification, no interpretable model tailored to this cohort has been available.

Methods: We harmonised de-identified data from MIMIC-III, MIMIC-IV and two Grade III Level A hospitals in China, generating a multicentre cohort of 13,979 adult ICU admissions with total serum calcium < 2.12 mmol L-1. MIMIC-IV records were randomly divided into a training set (n = 7,749) and an internal-validation set (n = 1,550). External validation employed MIMIC-III (n = 4,771) and the Chinese multicentre dataset (n = 209). Predictors were filtered with least-absolute-shrinkage-and-selection operator (LASSO) regression and applied to eight ML algorithms: logistic regression, k-nearest neighbors (KNN), support-vector machine, decision tree, random forest, artificial neural network, eXtreme Gradient Boosting (XGBoost) and LightGBM. Model discrimination, calibration and clinical utility were quantified using the area under the receiver-operating-characteristic curve (AUC), F1-score, sensitivity, specificity, calibration plots, decision-curve analysis (DCA) and clinical-impact curves (CIC). SHapley Additive exPlanations (SHAP) were used for interpretability, and the final model was deployed as a public web application.

Results: LASSO retained 20 predictive variables; is_noninvasive_ventilator and hospital length of stay were the most influential in SHAP analysis. XGBoost provided the highest discrimination (AUC = 0.914; F1 = 0.844), surpassing logistic regression (AUC = 0.896; F1 = 0.829), LightGBM (AUC = 0.909; F1 = 0.816) and conventional ICU scores. Calibration curves, DCA and CIC confirmed consistent performance and superior net benefit across internal and external validation cohorts.

Conclusions: We present and externally validate an interpretable, high-performance ML model that predicts in-hospital mortality in hypocalcemic ICU patients more accurately than established scoring systems. The SHAP-enabled web interface provides real-time, patient-specific risk estimates, facilitating data-driven clinical decisions within the early, critical window of ICU care.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
SHOCK
SHOCK 医学-外科
CiteScore
6.20
自引率
3.20%
发文量
199
审稿时长
1 months
期刊介绍: SHOCK®: Injury, Inflammation, and Sepsis: Laboratory and Clinical Approaches includes studies of novel therapeutic approaches, such as immunomodulation, gene therapy, nutrition, and others. The mission of the Journal is to foster and promote multidisciplinary studies, both experimental and clinical in nature, that critically examine the etiology, mechanisms and novel therapeutics of shock-related pathophysiological conditions. Its purpose is to excel as a vehicle for timely publication in the areas of basic and clinical studies of shock, trauma, sepsis, inflammation, ischemia, and related pathobiological states, with particular emphasis on the biologic mechanisms that determine the response to such injury. Making such information available will ultimately facilitate improved care of the traumatized or septic individual.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信