Prediction of postoperative infections by strategic data imputation and explainable machine learning.

IF 4.6 2区医学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS

Journal of the American Medical Informatics Association Pub Date : 2025-08-31 DOI:10.1093/jamia/ocaf145

Hugo Guillen-Ramirez, Daniel Sanchez-Taltavull, Stéphanie Perrodin, Sarah Peisl, Karen Triep, Christophe Gaudet-Blavignac, Olga Endrich, Guido Beldi

{"title":"Prediction of postoperative infections by strategic data imputation and explainable machine learning.","authors":"Hugo Guillen-Ramirez, Daniel Sanchez-Taltavull, Stéphanie Perrodin, Sarah Peisl, Karen Triep, Christophe Gaudet-Blavignac, Olga Endrich, Guido Beldi","doi":"10.1093/jamia/ocaf145","DOIUrl":null,"url":null,"abstract":"Objectives: Infections following healthcare-associated interventions drive patient morbidity and mortality, making early detection essential. Traditional predictive models utilize preoperative surgical characteristics. This study evaluated whether integrating postoperative laboratory values and their kinetics could improve outcome prediction.Materials and methods: 91 794 surgical cases were extracted from electronic health records (EHR) and analyzed to predict bacterial infection as the endpoint. The endpoint was documented in the EHR as ICD-10 by a professional coding team. Variables were grouped as preoperative, intraoperative, or postoperative. Strategic imputation was used for postoperative missing laboratory values. Procedure-agnostic prediction models were built incorporating both static and kinetic properties of laboratory values.Results: The integration of kinetics of laboratory values into a machine learning predictor achieved a recall, precision and ROC AUC at postoperative day 2 of 0.71, 0.69, and 0.83, respectively. Moreover, infection detection outperformed clinician-based decision-making, as reflected by the postoperative timing of antibiotic administration. The analysis identified previously unknown, informative combinations of routine markers from hepatic, renal, and bone marrow functions that predict outcome.Discussion: Dynamic modelling of postoperative laboratory values enhanced the timeliness and accuracy of infection detection compared with static or preoperative-only models. The integration of explainable machine learning supports clinical interpretation and highlights the contribution of multiple organ systems to postoperative infection risk.Conclusion: A surgery-independent workflow integrating time-series values from laboratory parameters to enhance baseline predictors of infection. This interpretable approach is generalizable across procedures and has the potential to optimize patient outcomes and resource use in surgical care.","PeriodicalId":50016,"journal":{"name":"Journal of the American Medical Informatics Association","volume":" ","pages":""},"PeriodicalIF":4.6000,"publicationDate":"2025-08-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of the American Medical Informatics Association","FirstCategoryId":"91","ListUrlMain":"https://doi.org/10.1093/jamia/ocaf145","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}

引用次数: 0

Abstract

Objectives: Infections following healthcare-associated interventions drive patient morbidity and mortality, making early detection essential. Traditional predictive models utilize preoperative surgical characteristics. This study evaluated whether integrating postoperative laboratory values and their kinetics could improve outcome prediction.

Materials and methods: 91 794 surgical cases were extracted from electronic health records (EHR) and analyzed to predict bacterial infection as the endpoint. The endpoint was documented in the EHR as ICD-10 by a professional coding team. Variables were grouped as preoperative, intraoperative, or postoperative. Strategic imputation was used for postoperative missing laboratory values. Procedure-agnostic prediction models were built incorporating both static and kinetic properties of laboratory values.

Results: The integration of kinetics of laboratory values into a machine learning predictor achieved a recall, precision and ROC AUC at postoperative day 2 of 0.71, 0.69, and 0.83, respectively. Moreover, infection detection outperformed clinician-based decision-making, as reflected by the postoperative timing of antibiotic administration. The analysis identified previously unknown, informative combinations of routine markers from hepatic, renal, and bone marrow functions that predict outcome.

Discussion: Dynamic modelling of postoperative laboratory values enhanced the timeliness and accuracy of infection detection compared with static or preoperative-only models. The integration of explainable machine learning supports clinical interpretation and highlights the contribution of multiple organ systems to postoperative infection risk.

Conclusion: A surgery-independent workflow integrating time-series values from laboratory parameters to enhance baseline predictors of infection. This interpretable approach is generalizable across procedures and has the potential to optimize patient outcomes and resource use in surgical care.

查看原文本刊更多论文

策略性数据输入和可解释的机器学习预测术后感染。

目的：卫生保健相关干预措施后的感染会导致患者发病率和死亡率，因此早期发现至关重要。传统的预测模型利用术前手术特征。本研究评估了整合术后实验室值及其动力学是否可以改善预后预测。材料与方法：从电子病历（EHR）中提取91794例手术病例进行分析，以预测细菌感染为终点。终点由专业编码团队以ICD-10记录在EHR中。变量分为术前、术中、术后。术后缺失的实验室值采用策略补全。建立了程序不可知的预测模型，同时考虑了实验室值的静态和动态特性。结果：将实验室值的动力学整合到机器学习预测器中，术后第2天的召回率、精确度和ROC AUC分别为0.71、0.69和0.83。此外，感染检测优于基于临床的决策，反映在术后抗生素给药的时机。该分析确定了以前未知的、信息丰富的肝、肾和骨髓功能常规标志物组合，可预测预后。讨论：与静态或术前模型相比，术后实验室值的动态建模提高了感染检测的及时性和准确性。可解释的机器学习的整合支持临床解释，并强调多器官系统对术后感染风险的贡献。结论：一个与手术无关的工作流程整合了实验室参数的时间序列值，以增强感染的基线预测。这种可解释的方法可以在手术过程中推广，并有可能优化患者的预后和外科护理中的资源利用。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Journal of the American Medical Informatics Association 医学-计算机：跨学科应用

CiteScore

14.50

自引率

7.80%

发文量

230

审稿时长

3-8 weeks

期刊介绍： JAMIA is AMIA''s premier peer-reviewed journal for biomedical and health informatics. Covering the full spectrum of activities in the field, JAMIA includes informatics articles in the areas of clinical care, clinical research, translational science, implementation science, imaging, education, consumer health, public health, and policy. JAMIA''s articles describe innovative informatics research and systems that help to advance biomedical science and to promote health. Case reports, perspectives and reviews also help readers stay connected with the most important informatics developments in implementation, policy and education.