Machine Learning-Enhanced Surveillance for Surgical Site Infections in Patients Undergoing Colon Surgery: Model Development and Evaluation Study.

IF 2 Q3 HEALTH CARE SCIENCES & SERVICES

JMIR Formative Research Pub Date : 2025-10-01 DOI:10.2196/75121

Ugur Celik, Feifan Liu, Kimiyoshi Kobayashi, Richard T Ellison Iii, Yurima Guilarte-Walker, Deborah Ann Mack, Qiming Shi, Adrian Zai

{"title":"Machine Learning-Enhanced Surveillance for Surgical Site Infections in Patients Undergoing Colon Surgery: Model Development and Evaluation Study.","authors":"Ugur Celik, Feifan Liu, Kimiyoshi Kobayashi, Richard T Ellison Iii, Yurima Guilarte-Walker, Deborah Ann Mack, Qiming Shi, Adrian Zai","doi":"10.2196/75121","DOIUrl":null,"url":null,"abstract":"Background: Surgical site infections (SSIs) are one of the most common health care-associated infections, accounting for nearly 20% of all health care-associated infections in hospitalized patients. SSIs are associated with longer hospital stays, increased readmission rates, higher health care costs, and a mortality rate twice that of patients without infections.Objective: This study aimed to develop and evaluate machine learning (ML) models for augmenting SSI surveillance after colon surgery with the goal of improving the efficiency of infection control practices by prioritizing patients at high risk.Methods: We conducted a retrospective study using data from 1508 patients undergoing colon surgery treated between 2018 and 2023 at a single academic medical center. Of these 1508 patients, 66 (4.4%) developed SSIs as adjudicated by infection control practitioners following Centers for Disease Control and Prevention National Healthcare Safety Network criteria. Data included 78 structured variables (eg, demographics, comorbidities, vital signs, laboratory tests, medications, and operative details) and 2 features derived from unstructured clinical notes using natural language processing. ML models-logistic regression, random forest, and Extreme Gradient Boosting (XGBoost)-were trained using stratified 80/20 train-test splits. Class imbalance was addressed using cost-sensitive learning and the synthetic minority oversampling technique. Model performance was evaluated using precision, recall, F1-score, area under the receiver operating characteristic curve, and Brier scores for calibration.Results: Of the 1508 patients, those who developed SSIs had longer hospital stays (mean 8.1, SD 6.8 days vs mean 6.3, SD 10.5 days; P<.001), higher rates of an American Society of Anesthesiologists score of 3 (52/66, 79% vs 653/1442, 45.3%; P<.001), and elevated white blood cell counts (51/66, 77% vs 734/1442, 50.9%; P<.001). XGBoost achieved the best overall performance with an area under the receiver operating characteristic curve of 0.788, precision of 50%, recall of 38%, and Brier score of 0.035. Random forest yielded perfect precision (100%) but lower recall (23%), with a Brier score of 0.034. Logistic regression showed the highest recall (46%) but the lowest precision (10%), with a Brier score of 0.139. Feature importance analysis using Shapley additive explanations (SHAP) values revealed that the top predictors included recovery duration (SHAP=1.18), SSI keyword frequency (SHAP=1.12), patient age (SHAP=1.12), and American Society of Anesthesiologists score (SHAP=0.94), with natural language processing-derived features ranking among the top 10.Conclusions: ML models can augment traditional SSI surveillance by improving early identification of patients at high risk. The XGBoost model offered the best trade-off between discrimination and calibration, suggesting its utility in clinical workflows. Incorporating structured and unstructured electronic health record data enhances model accuracy and clinical relevance, supporting scalable and efficient infection control practices.","PeriodicalId":14841,"journal":{"name":"JMIR Formative Research","volume":"9 ","pages":"e75121"},"PeriodicalIF":2.0000,"publicationDate":"2025-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"JMIR Formative Research","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.2196/75121","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"HEALTH CARE SCIENCES & SERVICES","Score":null,"Total":0}

引用次数: 0

Abstract

Background: Surgical site infections (SSIs) are one of the most common health care-associated infections, accounting for nearly 20% of all health care-associated infections in hospitalized patients. SSIs are associated with longer hospital stays, increased readmission rates, higher health care costs, and a mortality rate twice that of patients without infections.

Objective: This study aimed to develop and evaluate machine learning (ML) models for augmenting SSI surveillance after colon surgery with the goal of improving the efficiency of infection control practices by prioritizing patients at high risk.

Methods: We conducted a retrospective study using data from 1508 patients undergoing colon surgery treated between 2018 and 2023 at a single academic medical center. Of these 1508 patients, 66 (4.4%) developed SSIs as adjudicated by infection control practitioners following Centers for Disease Control and Prevention National Healthcare Safety Network criteria. Data included 78 structured variables (eg, demographics, comorbidities, vital signs, laboratory tests, medications, and operative details) and 2 features derived from unstructured clinical notes using natural language processing. ML models-logistic regression, random forest, and Extreme Gradient Boosting (XGBoost)-were trained using stratified 80/20 train-test splits. Class imbalance was addressed using cost-sensitive learning and the synthetic minority oversampling technique. Model performance was evaluated using precision, recall, F₁-score, area under the receiver operating characteristic curve, and Brier scores for calibration.

Results: Of the 1508 patients, those who developed SSIs had longer hospital stays (mean 8.1, SD 6.8 days vs mean 6.3, SD 10.5 days; P<.001), higher rates of an American Society of Anesthesiologists score of 3 (52/66, 79% vs 653/1442, 45.3%; P<.001), and elevated white blood cell counts (51/66, 77% vs 734/1442, 50.9%; P<.001). XGBoost achieved the best overall performance with an area under the receiver operating characteristic curve of 0.788, precision of 50%, recall of 38%, and Brier score of 0.035. Random forest yielded perfect precision (100%) but lower recall (23%), with a Brier score of 0.034. Logistic regression showed the highest recall (46%) but the lowest precision (10%), with a Brier score of 0.139. Feature importance analysis using Shapley additive explanations (SHAP) values revealed that the top predictors included recovery duration (SHAP=1.18), SSI keyword frequency (SHAP=1.12), patient age (SHAP=1.12), and American Society of Anesthesiologists score (SHAP=0.94), with natural language processing-derived features ranking among the top 10.

Conclusions: ML models can augment traditional SSI surveillance by improving early identification of patients at high risk. The XGBoost model offered the best trade-off between discrimination and calibration, suggesting its utility in clinical workflows. Incorporating structured and unstructured electronic health record data enhances model accuracy and clinical relevance, supporting scalable and efficient infection control practices.

查看原文本刊更多论文

机器学习增强对结肠手术患者手术部位感染的监测：模型开发和评估研究。

背景：手术部位感染（ssi）是最常见的卫生保健相关感染之一，占住院患者所有卫生保健相关感染的近20%。ssi患者住院时间更长，再入院率增加，医疗费用更高，死亡率是未感染患者的两倍。目的：本研究旨在开发和评估用于增强结肠手术后SSI监测的机器学习（ML）模型，目的是通过优先考虑高风险患者来提高感染控制实践的效率。方法：我们对2018年至2023年在单个学术医疗中心接受结肠手术治疗的1508例患者的数据进行了回顾性研究。在这1508名患者中，66名（4.4%）发展为ssi，由感染控制从业人员根据疾病控制和预防中心国家卫生保健安全网络标准判定。数据包括78个结构化变量（如人口统计学、合并症、生命体征、实验室检查、药物和手术细节）和2个使用自然语言处理的非结构化临床记录特征。ML模型——逻辑回归、随机森林和极端梯度增强（XGBoost）——使用分层80/20训练测试分割进行训练。使用代价敏感学习和合成少数派过采样技术解决了类不平衡问题。使用精确度、召回率、f1评分、受试者工作特征曲线下面积和Brier评分来评估模型的性能。结果：在1508例患者中，发生SSI的患者住院时间更长（平均8.1天，SD 6.8天vs平均6.3天，SD 10.5天）。结论：ML模型可以通过提高早期识别高风险患者来增强传统的SSI监测。XGBoost模型提供了区分和校准之间的最佳权衡，表明其在临床工作流程中的实用性。结合结构化和非结构化电子健康记录数据可提高模型准确性和临床相关性，支持可扩展和高效的感染控制实践。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊