Developing interpretable machine learning models to predict length of stay and disposition decision for adult patients in emergency departments.

IF 4.1 Q1 HEALTH CARE SCIENCES & SERVICES

BMJ Health & Care Informatics Pub Date : 2025-06-26 DOI:10.1136/bmjhci-2024-101152

Long Song, Uwe Aickelin, Timothy N Fazio, Abhishek Sharma, Mojgan Kouhounestani, Samantha Plumb, Mark John Putland

{"title":"Developing interpretable machine learning models to predict length of stay and disposition decision for adult patients in emergency departments.","authors":"Long Song, Uwe Aickelin, Timothy N Fazio, Abhishek Sharma, Mojgan Kouhounestani, Samantha Plumb, Mark John Putland","doi":"10.1136/bmjhci-2024-101152","DOIUrl":null,"url":null,"abstract":"Objective: Machine learning (ML) models have emerged as tools to predict length of stay (LOS) and disposition decision (DD) in emergency departments (EDs) to combat overcrowding. However, site-specific ML models are not transferable to different sites. Our objective was to develop interpretable ML models to predict LOS and DD at specific time points, all while establishing a transparent data analysis framework. This framework was designed to be easily adapted by other institutions for the development of their own ML models.Methods: We analysed data from 297 392 ED visits of patients aged 18 and above at a quaternary hospital between 30 June 2019 and 31 December 2022. Eight ML algorithms were evaluated, and ultimately, twelve lasso models built from 21 features were trained to predict four outcomes of LOS and DD at three time points post-triage. Hold-out testing and cross-validation were conducted for these models.Results: The area under the curve values were 0.862/0.868/0.878 for binary LOS predictions at 10, 60 and 120-minute time points and 0.839/0.851/0.863 for binary DD predictions. The accuracies were 60.2%/60.7%/61.9% for ternary LOS predictions and 61.5%/62.3%/63.4% for ternary DD predictions.Conclusions: Interpretable ML models demonstrated outstanding performances in predicting both LOS and DD. The transparent data analysis framework can be easily adapted by other institutions.","PeriodicalId":9050,"journal":{"name":"BMJ Health & Care Informatics","volume":"32 1","pages":""},"PeriodicalIF":4.1000,"publicationDate":"2025-06-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"BMJ Health & Care Informatics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1136/bmjhci-2024-101152","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"HEALTH CARE SCIENCES & SERVICES","Score":null,"Total":0}

引用次数: 0

Abstract

Objective: Machine learning (ML) models have emerged as tools to predict length of stay (LOS) and disposition decision (DD) in emergency departments (EDs) to combat overcrowding. However, site-specific ML models are not transferable to different sites. Our objective was to develop interpretable ML models to predict LOS and DD at specific time points, all while establishing a transparent data analysis framework. This framework was designed to be easily adapted by other institutions for the development of their own ML models.

Methods: We analysed data from 297 392 ED visits of patients aged 18 and above at a quaternary hospital between 30 June 2019 and 31 December 2022. Eight ML algorithms were evaluated, and ultimately, twelve lasso models built from 21 features were trained to predict four outcomes of LOS and DD at three time points post-triage. Hold-out testing and cross-validation were conducted for these models.

Results: The area under the curve values were 0.862/0.868/0.878 for binary LOS predictions at 10, 60 and 120-minute time points and 0.839/0.851/0.863 for binary DD predictions. The accuracies were 60.2%/60.7%/61.9% for ternary LOS predictions and 61.5%/62.3%/63.4% for ternary DD predictions.

Conclusions: Interpretable ML models demonstrated outstanding performances in predicting both LOS and DD. The transparent data analysis framework can be easily adapted by other institutions.

查看原文本刊更多论文

开发可解释的机器学习模型来预测急诊科成年患者的住院时间和处置决定。

目的：机器学习（ML）模型已经成为预测急诊部门（ed）住院时间（LOS）和处置决定（DD）的工具，以应对过度拥挤。但是，特定于站点的ML模型不能转移到不同的站点。我们的目标是开发可解释的ML模型来预测特定时间点的LOS和DD，同时建立一个透明的数据分析框架。这个框架被设计成很容易被其他机构用于开发他们自己的机器学习模型。方法：我们分析了2019年6月30日至2022年12月31日期间一家第四医院18岁及以上患者的297,392例急诊科就诊数据。评估了8种ML算法，最终训练了12个基于21个特征构建的lasso模型，以预测分诊后三个时间点的LOS和DD的四种结果。对这些模型进行了Hold-out检验和交叉验证。结果：10、60、120分钟二元LOS预测曲线下面积为0.862/0.868/0.878，二元DD预测曲线下面积为0.839/0.851/0.863。三元LOS预测准确率为60.2%/60.7%/61.9%，三元DD预测准确率为61.5%/62.3%/63.4%。结论：可解释的ML模型在预测LOS和DD方面表现出色。透明的数据分析框架可以很容易地被其他机构采用。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊