一个可解释的机器学习模型来预测住院情况

Clinical eHealth Pub Date : 2025-04-04 DOI:10.1016/j.ceh.2025.03.004

Hagar Elbatanouny , Hissam Tawfik , Tarek Khater , Anatoliy Gorbenko

{"title":"一个可解释的机器学习模型来预测住院情况","authors":"Hagar Elbatanouny , Hissam Tawfik , Tarek Khater , Anatoliy Gorbenko","doi":"10.1016/j.ceh.2025.03.004","DOIUrl":null,"url":null,"abstract":"<div><div>Hospital management plays a pivotal role in ensuring the efficient delivery of medical services, especially in the face of challenges posed by pandemics such as COVID-19. This paper explores the application of machine learning techniques in addressing the challenge of hospitalization during pandemics. Leveraging a comprehensive dataset sourced from the Mexican government, various supervised learning algorithms including Random Forest, Gradient Boosting, Support Vector Machine, K-Nearest Neighbors, and Multilayer Perceptron are trained and evaluated to discern factors contributing to hospitalizations. Feature importance analysis and dimensionality reduction techniques are employed to enhance models predictive performance. The best model was Gradient Boosting algorithm with an accuracy of 85.63% and AUC score of 0.8696. The interpretability plots showed that pneumonia had a positive impact on the hospitalization prediction of the model. Our analysis indicates that women aged over 45 with pneumonia and concurrent COVID-19 exhibit the highest likelihood of hospitalization. This study underscores the potential of interpretable machine learning in aiding hospital managers to optimize resource allocation, hospitalization cases, and make data-driven decisions during pandemics.</div></div>","PeriodicalId":100268,"journal":{"name":"Clinical eHealth","volume":"8 ","pages":"Pages 53-65"},"PeriodicalIF":0.0000,"publicationDate":"2025-04-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"An interpretable machine learning model to predict hospitalizations\",\"authors\":\"Hagar Elbatanouny , Hissam Tawfik , Tarek Khater , Anatoliy Gorbenko\",\"doi\":\"10.1016/j.ceh.2025.03.004\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Hospital management plays a pivotal role in ensuring the efficient delivery of medical services, especially in the face of challenges posed by pandemics such as COVID-19. This paper explores the application of machine learning techniques in addressing the challenge of hospitalization during pandemics. Leveraging a comprehensive dataset sourced from the Mexican government, various supervised learning algorithms including Random Forest, Gradient Boosting, Support Vector Machine, K-Nearest Neighbors, and Multilayer Perceptron are trained and evaluated to discern factors contributing to hospitalizations. Feature importance analysis and dimensionality reduction techniques are employed to enhance models predictive performance. The best model was Gradient Boosting algorithm with an accuracy of 85.63% and AUC score of 0.8696. The interpretability plots showed that pneumonia had a positive impact on the hospitalization prediction of the model. Our analysis indicates that women aged over 45 with pneumonia and concurrent COVID-19 exhibit the highest likelihood of hospitalization. This study underscores the potential of interpretable machine learning in aiding hospital managers to optimize resource allocation, hospitalization cases, and make data-driven decisions during pandemics.</div></div>\",\"PeriodicalId\":100268,\"journal\":{\"name\":\"Clinical eHealth\",\"volume\":\"8 \",\"pages\":\"Pages 53-65\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2025-04-04\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Clinical eHealth\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S2588914125000140\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Clinical eHealth","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2588914125000140","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

医院管理在确保高效提供医疗服务方面发挥着关键作用，特别是在面临COVID-19等大流行病带来的挑战时。本文探讨了机器学习技术在应对流行病期间住院治疗挑战中的应用。利用来自墨西哥政府的综合数据集，对各种监督学习算法（包括随机森林、梯度增强、支持向量机、k近邻和多层感知器）进行了训练和评估，以识别导致住院的因素。采用特征重要性分析和降维技术来提高模型的预测性能。最佳模型为Gradient Boosting算法，准确率为85.63%，AUC得分为0.8696。可解释性图显示肺炎对模型的住院预测有正向影响。我们的分析表明，45岁以上的女性肺炎和COVID-19合并住院的可能性最高。这项研究强调了可解释机器学习在帮助医院管理人员优化资源分配、住院病例和在大流行期间做出数据驱动决策方面的潜力。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

An interpretable machine learning model to predict hospitalizations

Hospital management plays a pivotal role in ensuring the efficient delivery of medical services, especially in the face of challenges posed by pandemics such as COVID-19. This paper explores the application of machine learning techniques in addressing the challenge of hospitalization during pandemics. Leveraging a comprehensive dataset sourced from the Mexican government, various supervised learning algorithms including Random Forest, Gradient Boosting, Support Vector Machine, K-Nearest Neighbors, and Multilayer Perceptron are trained and evaluated to discern factors contributing to hospitalizations. Feature importance analysis and dimensionality reduction techniques are employed to enhance models predictive performance. The best model was Gradient Boosting algorithm with an accuracy of 85.63% and AUC score of 0.8696. The interpretability plots showed that pneumonia had a positive impact on the hospitalization prediction of the model. Our analysis indicates that women aged over 45 with pneumonia and concurrent COVID-19 exhibit the highest likelihood of hospitalization. This study underscores the potential of interpretable machine learning in aiding hospital managers to optimize resource allocation, hospitalization cases, and make data-driven decisions during pandemics.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Clinical eHealth

CiteScore

8.10

自引率

0.00%

发文量