Khaled M. Toffaha , Mecit Can Emre Simsekler , Mohammed Atif Omar , Imad ElKebbi
{"title":"利用机器学习预测患者未就诊情况:全面回顾与未来研究议程","authors":"Khaled M. Toffaha , Mecit Can Emre Simsekler , Mohammed Atif Omar , Imad ElKebbi","doi":"10.1016/j.ibmed.2025.100229","DOIUrl":null,"url":null,"abstract":"<div><div>Patient no-shows for scheduled medical appointments pose significant challenges to healthcare systems, resulting in wasted resources, increased costs, and disrupted continuity of care. This comprehensive review examines state-of-the-art machine learning (ML) approaches for predicting patient no-shows in outpatient settings, analyzing 52 publications from 2010 to 2025.</div><div>The study reveals significant advancements in the field, with Logistic Regression (LR) as the most commonly used model in 68% of the studies. Tree-based models, ensemble methods, and deep learning techniques have gained traction in recent years, reflecting the field’s evolution. The best-performing models achieved Area Under the Curve (AUC) scores between 0.75 and 0.95, with accuracy ranging from 52% to 99.44%. Methodologically, researchers addressed common challenges such as class imbalance using various sampling techniques and employed a wide range of feature selection methods to improve model efficiency. The review also highlighted the importance of considering temporal factors and the context-dependent nature of no-show behavior across different healthcare settings.</div><div>Using the ITPOSMO framework (Information, Technology, Processes, Objectives, Staffing, Management, and Other Resources), the study identified several gaps in current ML approaches. Key challenges include data quality and completeness, model interpretability, and integration with existing healthcare systems. Future research directions include improving data collection methods, incorporating organizational factors, ensuring ethical implementation, and developing standardized approaches for handling data imbalance. The review also suggests exploring new data sources, utilizing ML algorithms to analyze patient behavior patterns, and using transfer learning techniques to adapt models across different healthcare facilities.</div><div>By addressing these challenges, healthcare providers can leverage ML to improve resource allocation, enhance patient care quality, and advance predictive analytics in healthcare. This comprehensive review underscores the potential of ML in predicting no-shows while acknowledging the complexities and challenges in its practical implementation.</div></div>","PeriodicalId":73399,"journal":{"name":"Intelligence-based medicine","volume":"11 ","pages":"Article 100229"},"PeriodicalIF":0.0000,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Predicting patient no-shows using machine learning: A comprehensive review and future research agenda\",\"authors\":\"Khaled M. Toffaha , Mecit Can Emre Simsekler , Mohammed Atif Omar , Imad ElKebbi\",\"doi\":\"10.1016/j.ibmed.2025.100229\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Patient no-shows for scheduled medical appointments pose significant challenges to healthcare systems, resulting in wasted resources, increased costs, and disrupted continuity of care. This comprehensive review examines state-of-the-art machine learning (ML) approaches for predicting patient no-shows in outpatient settings, analyzing 52 publications from 2010 to 2025.</div><div>The study reveals significant advancements in the field, with Logistic Regression (LR) as the most commonly used model in 68% of the studies. Tree-based models, ensemble methods, and deep learning techniques have gained traction in recent years, reflecting the field’s evolution. The best-performing models achieved Area Under the Curve (AUC) scores between 0.75 and 0.95, with accuracy ranging from 52% to 99.44%. Methodologically, researchers addressed common challenges such as class imbalance using various sampling techniques and employed a wide range of feature selection methods to improve model efficiency. The review also highlighted the importance of considering temporal factors and the context-dependent nature of no-show behavior across different healthcare settings.</div><div>Using the ITPOSMO framework (Information, Technology, Processes, Objectives, Staffing, Management, and Other Resources), the study identified several gaps in current ML approaches. Key challenges include data quality and completeness, model interpretability, and integration with existing healthcare systems. Future research directions include improving data collection methods, incorporating organizational factors, ensuring ethical implementation, and developing standardized approaches for handling data imbalance. The review also suggests exploring new data sources, utilizing ML algorithms to analyze patient behavior patterns, and using transfer learning techniques to adapt models across different healthcare facilities.</div><div>By addressing these challenges, healthcare providers can leverage ML to improve resource allocation, enhance patient care quality, and advance predictive analytics in healthcare. This comprehensive review underscores the potential of ML in predicting no-shows while acknowledging the complexities and challenges in its practical implementation.</div></div>\",\"PeriodicalId\":73399,\"journal\":{\"name\":\"Intelligence-based medicine\",\"volume\":\"11 \",\"pages\":\"Article 100229\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2025-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Intelligence-based medicine\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S2666521225000328\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Intelligence-based medicine","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2666521225000328","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
摘要
病人爽约给医疗保健系统带来了巨大挑战,导致资源浪费、成本增加和护理连续性中断。本综述分析了 2010 年至 2025 年间发表的 52 篇论文,探讨了用于预测门诊患者爽约情况的最先进的机器学习(ML)方法。研究显示,该领域取得了重大进展,在 68% 的研究中,逻辑回归(LR)是最常用的模型。基于树的模型、集合方法和深度学习技术在近几年得到了广泛应用,反映了该领域的发展。表现最好的模型的曲线下面积(AUC)得分在 0.75 到 0.95 之间,准确率在 52% 到 99.44% 之间。在方法上,研究人员利用各种采样技术解决了类不平衡等常见难题,并采用了多种特征选择方法来提高模型效率。该综述还强调了考虑时间因素和不同医疗环境中缺席行为的环境依赖性的重要性。利用 ITPOSMO 框架(信息、技术、流程、目标、人员配备、管理和其他资源),该研究确定了当前 ML 方法中的几个差距。主要挑战包括数据质量和完整性、模型可解释性以及与现有医疗保健系统的集成。未来的研究方向包括改进数据收集方法、纳入组织因素、确保符合道德规范的实施,以及开发处理数据不平衡的标准化方法。该综述还建议探索新的数据源,利用 ML 算法分析患者行为模式,并使用迁移学习技术在不同的医疗机构间调整模型。通过应对这些挑战,医疗机构可以利用 ML 改善资源分配,提高患者护理质量,并推进医疗领域的预测分析。这篇全面的综述强调了人工智能在预测病例缺席方面的潜力,同时也承认了其实际应用中的复杂性和挑战。
Predicting patient no-shows using machine learning: A comprehensive review and future research agenda
Patient no-shows for scheduled medical appointments pose significant challenges to healthcare systems, resulting in wasted resources, increased costs, and disrupted continuity of care. This comprehensive review examines state-of-the-art machine learning (ML) approaches for predicting patient no-shows in outpatient settings, analyzing 52 publications from 2010 to 2025.
The study reveals significant advancements in the field, with Logistic Regression (LR) as the most commonly used model in 68% of the studies. Tree-based models, ensemble methods, and deep learning techniques have gained traction in recent years, reflecting the field’s evolution. The best-performing models achieved Area Under the Curve (AUC) scores between 0.75 and 0.95, with accuracy ranging from 52% to 99.44%. Methodologically, researchers addressed common challenges such as class imbalance using various sampling techniques and employed a wide range of feature selection methods to improve model efficiency. The review also highlighted the importance of considering temporal factors and the context-dependent nature of no-show behavior across different healthcare settings.
Using the ITPOSMO framework (Information, Technology, Processes, Objectives, Staffing, Management, and Other Resources), the study identified several gaps in current ML approaches. Key challenges include data quality and completeness, model interpretability, and integration with existing healthcare systems. Future research directions include improving data collection methods, incorporating organizational factors, ensuring ethical implementation, and developing standardized approaches for handling data imbalance. The review also suggests exploring new data sources, utilizing ML algorithms to analyze patient behavior patterns, and using transfer learning techniques to adapt models across different healthcare facilities.
By addressing these challenges, healthcare providers can leverage ML to improve resource allocation, enhance patient care quality, and advance predictive analytics in healthcare. This comprehensive review underscores the potential of ML in predicting no-shows while acknowledging the complexities and challenges in its practical implementation.