Abubakar Sadiq Bouda Abdulai , Jean Storm , Michael Ehrlich
{"title":"“我不知道”:一个不确定性感知机器学习模型,用于预测急诊科分诊时患者的处置情况","authors":"Abubakar Sadiq Bouda Abdulai , Jean Storm , Michael Ehrlich","doi":"10.1016/j.ijmedinf.2025.105957","DOIUrl":null,"url":null,"abstract":"<div><h3>Background</h3><div>Machine learning (ML) models are widely used for predicting patient disposition at emergency department (ED) triage. However, these models generate predictions regardless of the level of uncertainty, potentially leading to overconfident outputs that can compromise clinical decision-making.</div></div><div><h3>Objective</h3><div>To develop a conformal prediction model for ED triage that provides uncertainty-aware patient disposition predictions.</div></div><div><h3>Methods</h3><div>This retrospective study analyzed 560,486 adult ED visits (March 2014 – July 2017) from one academic and two community hospitals. An extreme gradient boosting (XGBoost) model was trained, validated, and conformalized to introduce a “Don’t know” prediction for high-uncertainty cases. The model was tested on a random sample of 56,000 ED cases.</div></div><div><h3>Results</h3><div>The standard XGBoost model achieved an AUC of 0.9307 (95% CI: 0.9285 – 0.9329), with sensitivity of 0.72 and specificity of 0.94. With conformal prediction at a lower confidence threshold of 60%, the model indicated “Don’t know” in 4.9% of cases while returning sensitivity and specificity values of 0.74 and 0.95, respectively. As confidence thresholds increased, the model returned more “Don’t know” predictions and fewer misclassifications. At 90% confidence, the model returned “Don’t know” in 34.5% of cases while returning sensitivity and specificity values of 0.88 and 0.99, respectively. This trade-off highlights a balance between model confidence and prediction accuracy<strong>.</strong></div></div><div><h3>Conclusion</h3><div>Incorporating uncertainty-awareness in ML models improves reliability in ED triage. By acknowledging uncertainty, clinicians receive more interpretable insights, reducing the risk of overconfident predictions and enhancing patient safety.</div></div>","PeriodicalId":54950,"journal":{"name":"International Journal of Medical Informatics","volume":"201 ","pages":"Article 105957"},"PeriodicalIF":3.7000,"publicationDate":"2025-04-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"“I don’t know”: An uncertainty-aware machine learning model for predicting patient disposition at emergency department triage\",\"authors\":\"Abubakar Sadiq Bouda Abdulai , Jean Storm , Michael Ehrlich\",\"doi\":\"10.1016/j.ijmedinf.2025.105957\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><h3>Background</h3><div>Machine learning (ML) models are widely used for predicting patient disposition at emergency department (ED) triage. However, these models generate predictions regardless of the level of uncertainty, potentially leading to overconfident outputs that can compromise clinical decision-making.</div></div><div><h3>Objective</h3><div>To develop a conformal prediction model for ED triage that provides uncertainty-aware patient disposition predictions.</div></div><div><h3>Methods</h3><div>This retrospective study analyzed 560,486 adult ED visits (March 2014 – July 2017) from one academic and two community hospitals. An extreme gradient boosting (XGBoost) model was trained, validated, and conformalized to introduce a “Don’t know” prediction for high-uncertainty cases. The model was tested on a random sample of 56,000 ED cases.</div></div><div><h3>Results</h3><div>The standard XGBoost model achieved an AUC of 0.9307 (95% CI: 0.9285 – 0.9329), with sensitivity of 0.72 and specificity of 0.94. With conformal prediction at a lower confidence threshold of 60%, the model indicated “Don’t know” in 4.9% of cases while returning sensitivity and specificity values of 0.74 and 0.95, respectively. As confidence thresholds increased, the model returned more “Don’t know” predictions and fewer misclassifications. At 90% confidence, the model returned “Don’t know” in 34.5% of cases while returning sensitivity and specificity values of 0.88 and 0.99, respectively. This trade-off highlights a balance between model confidence and prediction accuracy<strong>.</strong></div></div><div><h3>Conclusion</h3><div>Incorporating uncertainty-awareness in ML models improves reliability in ED triage. By acknowledging uncertainty, clinicians receive more interpretable insights, reducing the risk of overconfident predictions and enhancing patient safety.</div></div>\",\"PeriodicalId\":54950,\"journal\":{\"name\":\"International Journal of Medical Informatics\",\"volume\":\"201 \",\"pages\":\"Article 105957\"},\"PeriodicalIF\":3.7000,\"publicationDate\":\"2025-04-28\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"International Journal of Medical Informatics\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S1386505625001741\",\"RegionNum\":2,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"COMPUTER SCIENCE, INFORMATION SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Medical Informatics","FirstCategoryId":"3","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1386505625001741","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
“I don’t know”: An uncertainty-aware machine learning model for predicting patient disposition at emergency department triage
Background
Machine learning (ML) models are widely used for predicting patient disposition at emergency department (ED) triage. However, these models generate predictions regardless of the level of uncertainty, potentially leading to overconfident outputs that can compromise clinical decision-making.
Objective
To develop a conformal prediction model for ED triage that provides uncertainty-aware patient disposition predictions.
Methods
This retrospective study analyzed 560,486 adult ED visits (March 2014 – July 2017) from one academic and two community hospitals. An extreme gradient boosting (XGBoost) model was trained, validated, and conformalized to introduce a “Don’t know” prediction for high-uncertainty cases. The model was tested on a random sample of 56,000 ED cases.
Results
The standard XGBoost model achieved an AUC of 0.9307 (95% CI: 0.9285 – 0.9329), with sensitivity of 0.72 and specificity of 0.94. With conformal prediction at a lower confidence threshold of 60%, the model indicated “Don’t know” in 4.9% of cases while returning sensitivity and specificity values of 0.74 and 0.95, respectively. As confidence thresholds increased, the model returned more “Don’t know” predictions and fewer misclassifications. At 90% confidence, the model returned “Don’t know” in 34.5% of cases while returning sensitivity and specificity values of 0.88 and 0.99, respectively. This trade-off highlights a balance between model confidence and prediction accuracy.
Conclusion
Incorporating uncertainty-awareness in ML models improves reliability in ED triage. By acknowledging uncertainty, clinicians receive more interpretable insights, reducing the risk of overconfident predictions and enhancing patient safety.
期刊介绍:
International Journal of Medical Informatics provides an international medium for dissemination of original results and interpretative reviews concerning the field of medical informatics. The Journal emphasizes the evaluation of systems in healthcare settings.
The scope of journal covers:
Information systems, including national or international registration systems, hospital information systems, departmental and/or physician''s office systems, document handling systems, electronic medical record systems, standardization, systems integration etc.;
Computer-aided medical decision support systems using heuristic, algorithmic and/or statistical methods as exemplified in decision theory, protocol development, artificial intelligence, etc.
Educational computer based programs pertaining to medical informatics or medicine in general;
Organizational, economic, social, clinical impact, ethical and cost-benefit aspects of IT applications in health care.