ICU readmission and mortality risk prediction: Generalizability of a multi-hospital model

Journal of intensive medicine Pub Date : 2025-10-01 DOI:10.1016/j.jointm.2025.03.007

Tariq A. Dam , Daan de Bruin , Giovanni Cinà , Patrick J. Thoral , Paul W.G. Elbers , Corstiaan A. den Uil , Reinier F. Crane

{"title":"ICU readmission and mortality risk prediction: Generalizability of a multi-hospital model","authors":"Tariq A. Dam , Daan de Bruin , Giovanni Cinà , Patrick J. Thoral , Paul W.G. Elbers , Corstiaan A. den Uil , Reinier F. Crane","doi":"10.1016/j.jointm.2025.03.007","DOIUrl":null,"url":null,"abstract":"<div><h3>Background</h3><div>Inadvertent intensive care unit (ICU) readmission is associated with longer length of stay and increased mortality. Conversely, delayed ICU discharge may represent inefficient use of resources. To better inform discharge timing, several hospitals have implemented machine learning models to predict readmission risk following discharge. However, these models are typically created locally and may not generalize well to other hospitals or patient populations. A single multi-hospital-based model might provide more accurate predictions and insight into features that are applicable across diverse clinical settings.</div></div><div><h3>Methods</h3><div>This study involved a retrospective multi-center cohort from one academic hospital (Amsterdam University Medical Center [AUMC]) and two large teaching hospitals (Maasstad Ziekenhuis [MSZ] and OLVG). Data from the latter two hospitals were combined to create a pooled model, which was tested on the academic hospital dataset. Data relating to all adult ICU patients were included, starting from the implementation of the electronic health record system until the commencement of model development for each hospital. An XGBoost model was trained to predict a composite outcome of readmission or mortality within 7 days and an autoencoder was used as an out-of-distribution (OOD) detector to capture dataset heterogeneity.</div></div><div><h3>Results</h3><div>In total, 44,837 patients were available for analysis across the three hospitals. The average readmission rates were 7.1 %, 6.9 %, and 5.9 % for MSZ, OLVG, and AUMC, respectively. Performance evaluation of the local models on AUMC data demonstrated weighted area under the receiver operating characteristic curves of 69.7 %±0.8 %, 70.5 %±0.5 %, and 76.5 %±1.9 %, respectively, whereas the pooled model achieved a weighted area under the receiver operating characteristic curves of 71.1 %±0.7 %. The difference between internal and external performance was reduced when cardiac surgery patients were excluded. The key features across models were albumin levels and the use of oxygen therapy.</div></div><div><h3>Discussion</h3><div>A single, multi-hospital-based model performed comparably on external datasets, especially when cardiac surgery patients were excluded. However, when applied externally, model predictions risk being uncalibrated for specific patient subgroups and require careful calibration before implementation. While external models were more stable than local ones over OOD scores, their performance was comparable after excluding cardiac surgery patients. Although pooling data marginally improved performance on external datasets, the incorporation of data from diverse hospitals is likely to provide greater benefits.</div></div>","PeriodicalId":73799,"journal":{"name":"Journal of intensive medicine","volume":"5 4","pages":"Pages 377-384"},"PeriodicalIF":0.0000,"publicationDate":"2025-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of intensive medicine","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2667100X25000301","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Background

Inadvertent intensive care unit (ICU) readmission is associated with longer length of stay and increased mortality. Conversely, delayed ICU discharge may represent inefficient use of resources. To better inform discharge timing, several hospitals have implemented machine learning models to predict readmission risk following discharge. However, these models are typically created locally and may not generalize well to other hospitals or patient populations. A single multi-hospital-based model might provide more accurate predictions and insight into features that are applicable across diverse clinical settings.

Methods

This study involved a retrospective multi-center cohort from one academic hospital (Amsterdam University Medical Center [AUMC]) and two large teaching hospitals (Maasstad Ziekenhuis [MSZ] and OLVG). Data from the latter two hospitals were combined to create a pooled model, which was tested on the academic hospital dataset. Data relating to all adult ICU patients were included, starting from the implementation of the electronic health record system until the commencement of model development for each hospital. An XGBoost model was trained to predict a composite outcome of readmission or mortality within 7 days and an autoencoder was used as an out-of-distribution (OOD) detector to capture dataset heterogeneity.

Results

In total, 44,837 patients were available for analysis across the three hospitals. The average readmission rates were 7.1 %, 6.9 %, and 5.9 % for MSZ, OLVG, and AUMC, respectively. Performance evaluation of the local models on AUMC data demonstrated weighted area under the receiver operating characteristic curves of 69.7 %±0.8 %, 70.5 %±0.5 %, and 76.5 %±1.9 %, respectively, whereas the pooled model achieved a weighted area under the receiver operating characteristic curves of 71.1 %±0.7 %. The difference between internal and external performance was reduced when cardiac surgery patients were excluded. The key features across models were albumin levels and the use of oxygen therapy.

Discussion

A single, multi-hospital-based model performed comparably on external datasets, especially when cardiac surgery patients were excluded. However, when applied externally, model predictions risk being uncalibrated for specific patient subgroups and require careful calibration before implementation. While external models were more stable than local ones over OOD scores, their performance was comparable after excluding cardiac surgery patients. Although pooling data marginally improved performance on external datasets, the incorporation of data from diverse hospitals is likely to provide greater benefits.

查看原文本刊更多论文

ICU再入院和死亡风险预测：多医院模型的通用性

背景：重症监护病房（ICU）再入院与住院时间延长和死亡率增加有关。相反，延迟ICU出院可能代表资源利用效率低下。为了更好地了解出院时间，几家医院已经实施了机器学习模型来预测出院后的再入院风险。然而，这些模型通常是在当地创建的，可能无法很好地推广到其他医院或患者群体。单一的基于多医院的模型可能提供更准确的预测和对适用于不同临床环境的特征的洞察。方法采用一所学术医院（阿姆斯特丹大学医学中心[AUMC]）和两所大型教学医院（Maasstad Ziekenhuis [MSZ]和OLVG）的回顾性多中心队列研究。后两家医院的数据被结合起来创建了一个汇总模型，并在学术医院数据集上进行了测试。从电子健康记录系统的实施开始，直到每家医院的模型开发开始，所有成人ICU患者的数据都被纳入其中。训练XGBoost模型来预测7天内再入院或死亡的综合结果，并使用自动编码器作为分布外（OOD）检测器来捕获数据集异质性。结果三所医院共收集患者44837例。MSZ、OLVG和AUMC的平均再入院率分别为7.1 %、6.9 %和5.9 %。局部模型在AUMC数据上的性能评价显示，受者工作特征曲线下的加权面积分别为69.7 %±0.8 %、70.5 %±0.5 %和76.5 %±1.9 %，而合并模型在受者工作特征曲线下的加权面积为71.1 %±0.7 %。当排除心脏手术患者时，内部和外部表现的差异减小。各模型的关键特征是白蛋白水平和氧治疗的使用。基于单一、多医院的模型在外部数据集上的表现相当，特别是在排除心脏手术患者的情况下。然而，在外部应用时，模型预测有可能无法针对特定患者亚组进行校准，并且在实施之前需要仔细校准。虽然外部模型在OOD评分上比局部模型更稳定，但在排除心脏手术患者后，它们的表现是相当的。虽然汇集数据略微提高了外部数据集的性能，但合并来自不同医院的数据可能会带来更大的好处。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊