Development and validation of an interpretable multi-task model to predict outcomes in patients with rhabdomyolysis: a multicenter retrospective cohort study.

IF 10 1区医学 Q1 MEDICINE, GENERAL & INTERNAL

EClinicalMedicine Pub Date : 2025-08-21 eCollection Date: 2025-09-01 DOI:10.1016/j.eclinm.2025.103438

Chunli Liu, Jie Shi, Fengjuan Wang, Duo Li, Yu Luo, Bofan Yang, Yunlong Zhao, Li Zhang, Dingwei Yang, Heng Jin, Jie Song, Xiaoqin Guo, Haojun Fan, Qi Lv

{"title":"Development and validation of an interpretable multi-task model to predict outcomes in patients with rhabdomyolysis: a multicenter retrospective cohort study.","authors":"Chunli Liu, Jie Shi, Fengjuan Wang, Duo Li, Yu Luo, Bofan Yang, Yunlong Zhao, Li Zhang, Dingwei Yang, Heng Jin, Jie Song, Xiaoqin Guo, Haojun Fan, Qi Lv","doi":"10.1016/j.eclinm.2025.103438","DOIUrl":null,"url":null,"abstract":"Background: Rhabdomyolysis (RM) is a complex clinical syndrome with heterogeneous progression patterns among patients of varying severity. Early and accurate prediction of acute kidney injury (AKI), disease severity, renal replacement therapy (RRT) requirements, and mortality risk is essential for timely identification of high-risk individuals, personalized treatment planning, and optimal allocation of healthcare resources. We aimed to develop and externally validate an interpretable multi-task machine learning (ML) model to predict four clinical outcomes in patients with rhabdomyolysis: AKI, disease severity, the need for RRT, and in-hospital mortality.Methods: We conducted a retrospective study using three data sources: the eICU Collaborative Research Database (eICU-CRD), the Medical Information Mart for Intensive Care IV (MIMIC-IV), and electronic medical records from four tertiary hospitals in China. Data from eICU-CRD and MIMIC-IV were combined to form the derivation cohort for model training and internal validation, while data from the Chinese hospitals served as the external validation cohort. We analyzed 1429 patients from 2008 to 2019 in the derivation cohort and 362 patients from 2016 to 2022 in the external validation cohort. AKI was defined according to the Kidney Disease: Improving Global Outcomes (KDIGO) criteria, based on serum creatinine levels and urine output. Twenty-two clinical features available within the first 24 h of admission were selected to develop the prediction models. Ten machine learning (ML) algorithms were applied to construct multi-task prediction models. Model performance was evaluated using the area under the receiver operating characteristic curve (AUC). To improve interpretability, feature importance was assessed using the SHapley Additive exPlanation (SHAP) method.Findings: 1429 patients were included in the derivation cohort (69.4% developed AKI, 36.7% were classified as having severe disease, 12.1% required RRT, and 9.8% had in-hospital mortality). 362 patients were included in the external validation cohort (27.9% developed AKI, 25.7% had severe disease, 27.3% required RRT, and 4.1% had in-hospital mortality). Among all evaluated models, the random forest (RF) algorithm exhibited the highest overall discriminative performance across the four prediction tasks. Based on feature importance rankings, interpretable final models were developed for each task using the top five contributing features. These models demonstrated robust predictive accuracy for AKI, disease severity, RRT requirements, and in-hospital mortality, with AUCs and corresponding 95% confidence intervals (CIs) of 0.914 (0.875-0.944), 0.909 (0.869-0.940), 0.888 (0.844-0.921), and 0.823 (0.773-0.865) in the internal validation cohort, and 0.906 (0.871-0.934), 0.856 (0.815-0.890), 0.852 (0.811-0.887), and 0.832 (0.789-0.869) in the external validation cohort, respectively. To support clinical implementation, a web- and Android-based decision support system was developed and is currently undergoing pilot testing in multiple hospitals.Interpretation: We developed and validated an interpretable multi-task ML model capable of accurately predicting key clinical outcomes in patients with RM. To improve clinical applicability, a user-friendly decision support system was implemented, incorporating interactive features to support frontline healthcare providers in real-time risk stratification and individualized management of RM.Funding: National Key Research and Development Program of China (Nos. 2021YFC3002202 and 2023YFF1204104).","PeriodicalId":11393,"journal":{"name":"EClinicalMedicine","volume":"87 ","pages":"103438"},"PeriodicalIF":10.0000,"publicationDate":"2025-08-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12396465/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"EClinicalMedicine","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1016/j.eclinm.2025.103438","RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/9/1 0:00:00","PubModel":"eCollection","JCR":"Q1","JCRName":"MEDICINE, GENERAL & INTERNAL","Score":null,"Total":0}

引用次数: 0

Abstract

Background: Rhabdomyolysis (RM) is a complex clinical syndrome with heterogeneous progression patterns among patients of varying severity. Early and accurate prediction of acute kidney injury (AKI), disease severity, renal replacement therapy (RRT) requirements, and mortality risk is essential for timely identification of high-risk individuals, personalized treatment planning, and optimal allocation of healthcare resources. We aimed to develop and externally validate an interpretable multi-task machine learning (ML) model to predict four clinical outcomes in patients with rhabdomyolysis: AKI, disease severity, the need for RRT, and in-hospital mortality.

Methods: We conducted a retrospective study using three data sources: the eICU Collaborative Research Database (eICU-CRD), the Medical Information Mart for Intensive Care IV (MIMIC-IV), and electronic medical records from four tertiary hospitals in China. Data from eICU-CRD and MIMIC-IV were combined to form the derivation cohort for model training and internal validation, while data from the Chinese hospitals served as the external validation cohort. We analyzed 1429 patients from 2008 to 2019 in the derivation cohort and 362 patients from 2016 to 2022 in the external validation cohort. AKI was defined according to the Kidney Disease: Improving Global Outcomes (KDIGO) criteria, based on serum creatinine levels and urine output. Twenty-two clinical features available within the first 24 h of admission were selected to develop the prediction models. Ten machine learning (ML) algorithms were applied to construct multi-task prediction models. Model performance was evaluated using the area under the receiver operating characteristic curve (AUC). To improve interpretability, feature importance was assessed using the SHapley Additive exPlanation (SHAP) method.

Findings: 1429 patients were included in the derivation cohort (69.4% developed AKI, 36.7% were classified as having severe disease, 12.1% required RRT, and 9.8% had in-hospital mortality). 362 patients were included in the external validation cohort (27.9% developed AKI, 25.7% had severe disease, 27.3% required RRT, and 4.1% had in-hospital mortality). Among all evaluated models, the random forest (RF) algorithm exhibited the highest overall discriminative performance across the four prediction tasks. Based on feature importance rankings, interpretable final models were developed for each task using the top five contributing features. These models demonstrated robust predictive accuracy for AKI, disease severity, RRT requirements, and in-hospital mortality, with AUCs and corresponding 95% confidence intervals (CIs) of 0.914 (0.875-0.944), 0.909 (0.869-0.940), 0.888 (0.844-0.921), and 0.823 (0.773-0.865) in the internal validation cohort, and 0.906 (0.871-0.934), 0.856 (0.815-0.890), 0.852 (0.811-0.887), and 0.832 (0.789-0.869) in the external validation cohort, respectively. To support clinical implementation, a web- and Android-based decision support system was developed and is currently undergoing pilot testing in multiple hospitals.

Interpretation: We developed and validated an interpretable multi-task ML model capable of accurately predicting key clinical outcomes in patients with RM. To improve clinical applicability, a user-friendly decision support system was implemented, incorporating interactive features to support frontline healthcare providers in real-time risk stratification and individualized management of RM.

Funding: National Key Research and Development Program of China (Nos. 2021YFC3002202 and 2023YFF1204104).

查看原文本刊更多论文

开发和验证一个可解释的多任务模型来预测横纹肌溶解患者的预后：一项多中心回顾性队列研究。

背景：横纹肌溶解（RM）是一种复杂的临床综合征，在不同严重程度的患者中具有异质性的进展模式。早期准确预测急性肾损伤（AKI）、疾病严重程度、肾脏替代治疗（RRT）需求和死亡风险对于及时识别高危个体、制定个性化治疗计划和优化医疗资源分配至关重要。我们旨在开发并外部验证一个可解释的多任务机器学习（ML）模型，以预测横纹肌溶解患者的四种临床结果：AKI、疾病严重程度、RRT需求和住院死亡率。方法：采用eICU合作研究数据库（eICU- crd）、重症监护医疗信息市场（MIMIC-IV）和中国四家三级医院的电子病历三个数据来源进行回顾性研究。将来自eICU-CRD和MIMIC-IV的数据合并为衍生队列，用于模型训练和内部验证，而来自中国医院的数据作为外部验证队列。我们分析了2008年至2019年衍生队列中的1429例患者和2016年至2022年外部验证队列中的362例患者。AKI的定义是根据肾脏疾病：改善总体预后（KDIGO）标准，基于血清肌酐水平和尿量。选择入院前24小时内可获得的22个临床特征来建立预测模型。采用10种机器学习算法构建多任务预测模型。使用接收器工作特征曲线下面积（AUC）评估模型性能。为了提高可解释性，使用SHapley加性解释（SHAP）方法评估特征重要性。结果：1429例患者被纳入衍生队列（69.4%发生AKI， 36.7%归类为严重疾病，12.1%需要RRT， 9.8%住院死亡）。362名患者被纳入外部验证队列（27.9%发生AKI， 25.7%有严重疾病，27.3%需要RRT， 4.1%有住院死亡率）。在所有被评估的模型中，随机森林（RF）算法在四个预测任务中表现出最高的总体判别性能。基于特征重要性排名，使用前五个贡献特征为每个任务开发可解释的最终模型。这些模型对AKI、疾病严重程度、RRT需求和住院死亡率的预测具有较强的准确性，内部验证队列的auc和相应的95%置信区间（ci）分别为0.914（0.875-0.944）、0.909（0.869-0.940）、0.888（0.844-0.921）和0.823(0.773-0.865)，外部验证队列的auc和ci分别为0.906（0.871-0.934）、0.856（0.815-0.890）、0.852（0.811-0.887）和0.832（0.889 -0.869）。为了支持临床实施，开发了一个基于web和android的决策支持系统，目前正在多家医院进行试点测试。解释：我们开发并验证了一个可解释的多任务ML模型，该模型能够准确预测RM患者的关键临床结果。为了提高临床适用性，实施了一个用户友好的决策支持系统，该系统包含交互式功能，以支持一线医疗保健提供者实时进行风险分层和RM个性化管理。国家重点研发计划项目（2021YFC3002202和2023YFF1204104）。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

EClinicalMedicine Medicine-Medicine (all)

CiteScore

18.90

自引率

1.30%

发文量

506

审稿时长

22 days

期刊介绍： eClinicalMedicine is a gold open-access clinical journal designed to support frontline health professionals in addressing the complex and rapid health transitions affecting societies globally. The journal aims to assist practitioners in overcoming healthcare challenges across diverse communities, spanning diagnosis, treatment, prevention, and health promotion. Integrating disciplines from various specialties and life stages, it seeks to enhance health systems as fundamental institutions within societies. With a forward-thinking approach, eClinicalMedicine aims to redefine the future of healthcare.