Hugo Guillen-Ramirez, Daniel Sanchez-Taltavull, Stéphanie Perrodin, Sarah Peisl, Karen Triep, Christophe Gaudet-Blavignac, Olga Endrich, Guido Beldi
{"title":"Prediction of postoperative infections by strategic data imputation and explainable machine learning.","authors":"Hugo Guillen-Ramirez, Daniel Sanchez-Taltavull, Stéphanie Perrodin, Sarah Peisl, Karen Triep, Christophe Gaudet-Blavignac, Olga Endrich, Guido Beldi","doi":"10.1093/jamia/ocaf145","DOIUrl":null,"url":null,"abstract":"<p><strong>Objectives: </strong>Infections following healthcare-associated interventions drive patient morbidity and mortality, making early detection essential. Traditional predictive models utilize preoperative surgical characteristics. This study evaluated whether integrating postoperative laboratory values and their kinetics could improve outcome prediction.</p><p><strong>Materials and methods: </strong>91 794 surgical cases were extracted from electronic health records (EHR) and analyzed to predict bacterial infection as the endpoint. The endpoint was documented in the EHR as ICD-10 by a professional coding team. Variables were grouped as preoperative, intraoperative, or postoperative. Strategic imputation was used for postoperative missing laboratory values. Procedure-agnostic prediction models were built incorporating both static and kinetic properties of laboratory values.</p><p><strong>Results: </strong>The integration of kinetics of laboratory values into a machine learning predictor achieved a recall, precision and ROC AUC at postoperative day 2 of 0.71, 0.69, and 0.83, respectively. Moreover, infection detection outperformed clinician-based decision-making, as reflected by the postoperative timing of antibiotic administration. The analysis identified previously unknown, informative combinations of routine markers from hepatic, renal, and bone marrow functions that predict outcome.</p><p><strong>Discussion: </strong>Dynamic modelling of postoperative laboratory values enhanced the timeliness and accuracy of infection detection compared with static or preoperative-only models. The integration of explainable machine learning supports clinical interpretation and highlights the contribution of multiple organ systems to postoperative infection risk.</p><p><strong>Conclusion: </strong>A surgery-independent workflow integrating time-series values from laboratory parameters to enhance baseline predictors of infection. This interpretable approach is generalizable across procedures and has the potential to optimize patient outcomes and resource use in surgical care.</p>","PeriodicalId":50016,"journal":{"name":"Journal of the American Medical Informatics Association","volume":" ","pages":""},"PeriodicalIF":4.6000,"publicationDate":"2025-08-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of the American Medical Informatics Association","FirstCategoryId":"91","ListUrlMain":"https://doi.org/10.1093/jamia/ocaf145","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
引用次数: 0
Abstract
Objectives: Infections following healthcare-associated interventions drive patient morbidity and mortality, making early detection essential. Traditional predictive models utilize preoperative surgical characteristics. This study evaluated whether integrating postoperative laboratory values and their kinetics could improve outcome prediction.
Materials and methods: 91 794 surgical cases were extracted from electronic health records (EHR) and analyzed to predict bacterial infection as the endpoint. The endpoint was documented in the EHR as ICD-10 by a professional coding team. Variables were grouped as preoperative, intraoperative, or postoperative. Strategic imputation was used for postoperative missing laboratory values. Procedure-agnostic prediction models were built incorporating both static and kinetic properties of laboratory values.
Results: The integration of kinetics of laboratory values into a machine learning predictor achieved a recall, precision and ROC AUC at postoperative day 2 of 0.71, 0.69, and 0.83, respectively. Moreover, infection detection outperformed clinician-based decision-making, as reflected by the postoperative timing of antibiotic administration. The analysis identified previously unknown, informative combinations of routine markers from hepatic, renal, and bone marrow functions that predict outcome.
Discussion: Dynamic modelling of postoperative laboratory values enhanced the timeliness and accuracy of infection detection compared with static or preoperative-only models. The integration of explainable machine learning supports clinical interpretation and highlights the contribution of multiple organ systems to postoperative infection risk.
Conclusion: A surgery-independent workflow integrating time-series values from laboratory parameters to enhance baseline predictors of infection. This interpretable approach is generalizable across procedures and has the potential to optimize patient outcomes and resource use in surgical care.
期刊介绍:
JAMIA is AMIA''s premier peer-reviewed journal for biomedical and health informatics. Covering the full spectrum of activities in the field, JAMIA includes informatics articles in the areas of clinical care, clinical research, translational science, implementation science, imaging, education, consumer health, public health, and policy. JAMIA''s articles describe innovative informatics research and systems that help to advance biomedical science and to promote health. Case reports, perspectives and reviews also help readers stay connected with the most important informatics developments in implementation, policy and education.