Jonathan Nover MBA, RN , Matthew Bai MD , Prem Tismina MS , Ganesh Raut MS , Dhavalkumar Patel MS , Girish N. Nadkarni MD, MPH , Benjamin S. Abella MD, MPhil , Eyal Klang MD , Robert Freeman DNP, RN, NE-BC
{"title":"Comparing Machine Learning and Nurse Predictions for Hospital Admissions in a Multisite Emergency Care System","authors":"Jonathan Nover MBA, RN , Matthew Bai MD , Prem Tismina MS , Ganesh Raut MS , Dhavalkumar Patel MS , Girish N. Nadkarni MD, MPH , Benjamin S. Abella MD, MPhil , Eyal Klang MD , Robert Freeman DNP, RN, NE-BC","doi":"10.1016/j.mcpdig.2025.100249","DOIUrl":null,"url":null,"abstract":"<div><h3>Objective</h3><div>To prospectively compare nurse predictions with a machine learning (ML) model for hospital admissions and evaluate whether adding the nurse prediction to ML outputs enhances predictive performance.</div></div><div><h3>Patients and Methods</h3><div>In this prospective, observational study at 6 hospitals in a large mixed quaternary/community emergency department (ED) system (annual ED census ∼500,000), triage nurses recorded a binary admission prediction for adult patients. These predictions were compared with an ensemble ML model (XGBoost + Bio-Clinical BERT) trained on structured data (demographics, vital signs, and medical history) and triage text. Nurse predictions were similarly analyzed and then integrated with the ML output to assess for improvement in predictive accuracy.</div></div><div><h3>Results</h3><div>The ensemble ML model (XGBoost + Bio-Clinical BERT) was trained on 1.8 million historical ED visits (January 2019 to December 2023). It was then tested on 46,912 prospective ED visits with recorded nurse predictions (September 1, 2024 to October 31, 2024). In the prospective arm, nurse predictions yielded an accuracy of 81.6% (95% CI, 81.3-81.9), a sensitivity of 64.8% (63.7-65.8), and a specificity of 85.7% (85.3-86.0). At a 0.30 probability threshold, the ML model attained an accuracy of 85.4% (85.0-85.7) and a sensitivity of 70.8% (69.8-71.7). Combining nurse predictions with the ML output did not improve accuracy beyond the model alone.</div></div><div><h3>Conclusion</h3><div>Machine learning-based predictions outperformed triage nurse estimates for hospital admissions. These findings suggest that an admission prediction system anchored by ML can perform reliably using data available at triage.</div></div>","PeriodicalId":74127,"journal":{"name":"Mayo Clinic Proceedings. Digital health","volume":"3 3","pages":"Article 100249"},"PeriodicalIF":0.0000,"publicationDate":"2025-07-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Mayo Clinic Proceedings. Digital health","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2949761225000562","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Objective
To prospectively compare nurse predictions with a machine learning (ML) model for hospital admissions and evaluate whether adding the nurse prediction to ML outputs enhances predictive performance.
Patients and Methods
In this prospective, observational study at 6 hospitals in a large mixed quaternary/community emergency department (ED) system (annual ED census ∼500,000), triage nurses recorded a binary admission prediction for adult patients. These predictions were compared with an ensemble ML model (XGBoost + Bio-Clinical BERT) trained on structured data (demographics, vital signs, and medical history) and triage text. Nurse predictions were similarly analyzed and then integrated with the ML output to assess for improvement in predictive accuracy.
Results
The ensemble ML model (XGBoost + Bio-Clinical BERT) was trained on 1.8 million historical ED visits (January 2019 to December 2023). It was then tested on 46,912 prospective ED visits with recorded nurse predictions (September 1, 2024 to October 31, 2024). In the prospective arm, nurse predictions yielded an accuracy of 81.6% (95% CI, 81.3-81.9), a sensitivity of 64.8% (63.7-65.8), and a specificity of 85.7% (85.3-86.0). At a 0.30 probability threshold, the ML model attained an accuracy of 85.4% (85.0-85.7) and a sensitivity of 70.8% (69.8-71.7). Combining nurse predictions with the ML output did not improve accuracy beyond the model alone.
Conclusion
Machine learning-based predictions outperformed triage nurse estimates for hospital admissions. These findings suggest that an admission prediction system anchored by ML can perform reliably using data available at triage.