Julien Favresse, Julien Cabo, Maxime Bosse, Benjamin Lardinois, Julie Cadrobbi, Kim Laffineur, Marc Elsen, Jonathan Douxfils, Liam Roelandts, Sander De Bruyne
{"title":"Machine Learning Algorithms for Predicting Urinary Tract Infections: Integration of Demographic Data and Dipstick Reflectance Results","authors":"Julien Favresse, Julien Cabo, Maxime Bosse, Benjamin Lardinois, Julie Cadrobbi, Kim Laffineur, Marc Elsen, Jonathan Douxfils, Liam Roelandts, Sander De Bruyne","doi":"10.1093/clinchem/hvaf088","DOIUrl":null,"url":null,"abstract":"Background Urinary tract infections (UTIs) are among the most common infections encountered in healthcare settings. Current diagnostic practices often require 24–48 h due to the time needed for culture results. Given that 70%–80% of cultures return negative, there is significant interest in rapidly identifying negative samples to reduce unnecessary antibiotic use. This study aimed to develop and evaluate 6 machine learning models to predict UTIs. Methods Urine samples from 22 961 patients, collected between September 28, 2023 and June 29, 2024, were analyzed. Six machine learning models were assessed for their ability to predict UTIs based on 5 definitions incorporating pyuria and culture outcomes. The dataset was randomly divided into a training set (70%, n = 16 072) and an independent test set (30%, n = 6889). Seventeen predictive parameters, including dipstick reflectance results and demographic variables, were evaluated. Results The CatBoost Classifier emerged as the best-performing model, achieving an area under the ROC curve of 92.0%–94.7% depending on the UTI definition, with a negative predictive value consistently exceeding 95%, and an average precision ranging from 68.2% to 81.6%. In comparison, the predictive performance of nitrite and/or leukocyte esterase was significantly lower. Conclusion Machine learning models, particularly the CatBoost Classifier, demonstrate high accuracy and offer a promising tool to aid clinicians in UTI diagnosis. Unlike traditional culture methods, these models deliver results within an hour. Further external validation with an independent dataset and prospective studies assessing the impact on antibiotic prescribing practices is recommended.","PeriodicalId":10690,"journal":{"name":"Clinical chemistry","volume":"15 1","pages":""},"PeriodicalIF":6.3000,"publicationDate":"2025-08-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Clinical chemistry","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1093/clinchem/hvaf088","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"MEDICAL LABORATORY TECHNOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Background Urinary tract infections (UTIs) are among the most common infections encountered in healthcare settings. Current diagnostic practices often require 24–48 h due to the time needed for culture results. Given that 70%–80% of cultures return negative, there is significant interest in rapidly identifying negative samples to reduce unnecessary antibiotic use. This study aimed to develop and evaluate 6 machine learning models to predict UTIs. Methods Urine samples from 22 961 patients, collected between September 28, 2023 and June 29, 2024, were analyzed. Six machine learning models were assessed for their ability to predict UTIs based on 5 definitions incorporating pyuria and culture outcomes. The dataset was randomly divided into a training set (70%, n = 16 072) and an independent test set (30%, n = 6889). Seventeen predictive parameters, including dipstick reflectance results and demographic variables, were evaluated. Results The CatBoost Classifier emerged as the best-performing model, achieving an area under the ROC curve of 92.0%–94.7% depending on the UTI definition, with a negative predictive value consistently exceeding 95%, and an average precision ranging from 68.2% to 81.6%. In comparison, the predictive performance of nitrite and/or leukocyte esterase was significantly lower. Conclusion Machine learning models, particularly the CatBoost Classifier, demonstrate high accuracy and offer a promising tool to aid clinicians in UTI diagnosis. Unlike traditional culture methods, these models deliver results within an hour. Further external validation with an independent dataset and prospective studies assessing the impact on antibiotic prescribing practices is recommended.
期刊介绍:
Clinical Chemistry is a peer-reviewed scientific journal that is the premier publication for the science and practice of clinical laboratory medicine. It was established in 1955 and is associated with the Association for Diagnostics & Laboratory Medicine (ADLM).
The journal focuses on laboratory diagnosis and management of patients, and has expanded to include other clinical laboratory disciplines such as genomics, hematology, microbiology, and toxicology. It also publishes articles relevant to clinical specialties including cardiology, endocrinology, gastroenterology, genetics, immunology, infectious diseases, maternal-fetal medicine, neurology, nutrition, oncology, and pediatrics.
In addition to original research, editorials, and reviews, Clinical Chemistry features recurring sections such as clinical case studies, perspectives, podcasts, and Q&A articles. It has the highest impact factor among journals of clinical chemistry, laboratory medicine, pathology, analytical chemistry, transfusion medicine, and clinical microbiology.
The journal is indexed in databases such as MEDLINE and Web of Science.