Long Song, Uwe Aickelin, Timothy N Fazio, Abhishek Sharma, Mojgan Kouhounestani, Samantha Plumb, Mark John Putland
{"title":"Developing interpretable machine learning models to predict length of stay and disposition decision for adult patients in emergency departments.","authors":"Long Song, Uwe Aickelin, Timothy N Fazio, Abhishek Sharma, Mojgan Kouhounestani, Samantha Plumb, Mark John Putland","doi":"10.1136/bmjhci-2024-101152","DOIUrl":null,"url":null,"abstract":"<p><strong>Objective: </strong>Machine learning (ML) models have emerged as tools to predict length of stay (LOS) and disposition decision (DD) in emergency departments (EDs) to combat overcrowding. However, site-specific ML models are not transferable to different sites. Our objective was to develop interpretable ML models to predict LOS and DD at specific time points, all while establishing a transparent data analysis framework. This framework was designed to be easily adapted by other institutions for the development of their own ML models.</p><p><strong>Methods: </strong>We analysed data from 297 392 ED visits of patients aged 18 and above at a quaternary hospital between 30 June 2019 and 31 December 2022. Eight ML algorithms were evaluated, and ultimately, twelve lasso models built from 21 features were trained to predict four outcomes of LOS and DD at three time points post-triage. Hold-out testing and cross-validation were conducted for these models.</p><p><strong>Results: </strong>The area under the curve values were 0.862/0.868/0.878 for binary LOS predictions at 10, 60 and 120-minute time points and 0.839/0.851/0.863 for binary DD predictions. The accuracies were 60.2%/60.7%/61.9% for ternary LOS predictions and 61.5%/62.3%/63.4% for ternary DD predictions.</p><p><strong>Conclusions: </strong>Interpretable ML models demonstrated outstanding performances in predicting both LOS and DD. The transparent data analysis framework can be easily adapted by other institutions.</p>","PeriodicalId":9050,"journal":{"name":"BMJ Health & Care Informatics","volume":"32 1","pages":""},"PeriodicalIF":4.1000,"publicationDate":"2025-06-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"BMJ Health & Care Informatics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1136/bmjhci-2024-101152","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"HEALTH CARE SCIENCES & SERVICES","Score":null,"Total":0}
引用次数: 0
Abstract
Objective: Machine learning (ML) models have emerged as tools to predict length of stay (LOS) and disposition decision (DD) in emergency departments (EDs) to combat overcrowding. However, site-specific ML models are not transferable to different sites. Our objective was to develop interpretable ML models to predict LOS and DD at specific time points, all while establishing a transparent data analysis framework. This framework was designed to be easily adapted by other institutions for the development of their own ML models.
Methods: We analysed data from 297 392 ED visits of patients aged 18 and above at a quaternary hospital between 30 June 2019 and 31 December 2022. Eight ML algorithms were evaluated, and ultimately, twelve lasso models built from 21 features were trained to predict four outcomes of LOS and DD at three time points post-triage. Hold-out testing and cross-validation were conducted for these models.
Results: The area under the curve values were 0.862/0.868/0.878 for binary LOS predictions at 10, 60 and 120-minute time points and 0.839/0.851/0.863 for binary DD predictions. The accuracies were 60.2%/60.7%/61.9% for ternary LOS predictions and 61.5%/62.3%/63.4% for ternary DD predictions.
Conclusions: Interpretable ML models demonstrated outstanding performances in predicting both LOS and DD. The transparent data analysis framework can be easily adapted by other institutions.