{"title":"Establishment of a Machine Learning-Based Predictive Model for <i>Klebsiella pneumoniae</i> Liver Abscess.","authors":"Haoran Li, Yan Yu, Xi Chen, Qingqing Sun, Xiumin Li, Qiujing Shang, Minghua Ying, Xiulin Liu, Jing Meng, Lele Bian, Shanshan Wu, Yuejuan Gao","doi":"10.2147/IDR.S545440","DOIUrl":null,"url":null,"abstract":"<p><strong>Purpose: </strong>To investigate the clinical and ultrasonographic characteristics of pyogenic liver abscess (PLA) caused by <i>Klebsiella pneumoniae</i> (K-PLA) and <i>non-Klebsiella pneumoniae</i> pathogens, and to develop machine learning models for the differential diagnosis of K-PLA.</p><p><strong>Materials and methods: </strong>In this retrospective study, patients clinically diagnosed with PLA and confirmed by ultrasound-guided puncture at the Fifth Medical Center of PLA General Hospital between April 2013 and December 2020 were enrolled. Based on the causative pathogens, patients were categorized into K-PLA and non-K-PLA groups. Baseline data, including ultrasonographic features, clinical characteristics, and laboratory findings, were collected. The Boruta algorithm was employed for feature selection, and four machine learning models-Deep Learning-Fully Connected Neural Network (deeplearning), Distributed Random Forest (drf), Gradient Boosting Machine (gbm), and Generalized Linear Model (glm)-were developed to diagnose K-PLA. The models were validated using 5-fold cross-validation.</p><p><strong>Results: </strong>A total of 201 patients with bacterial liver abscess were included (median age: 57 years; range: 49-66; 136 males), comprising 134 K-PLA cases and 67 non-K-PLA cases. The Boruta algorithm identified seven significant predictive variables: history of diabetes, history of hepatocellular carcinoma, history of biliary tract disease, history of infectious diseases, duration of fever, body temperature, and alanine aminotransferase (ALT) levels. Using these variables, the four machine learning models were constructed. In the training set, the area under the receiver operating characteristic curve (AUC) for predicting K-PLA was 0.716 (deeplearning), 0.999 (drf), 0.922 (gbm), and 0.718 (glm). In the validation set, the corresponding AUC values were 0.799, 0.763, 0.848, and 0.805, respectively.</p><p><strong>Conclusion: </strong>This study successfully established four machine learning models for predicting the risk of K-PLA, with the gbm-based model demonstrating the highest diagnostic performance. These models may facilitate early clinical diagnosis and treatment of K-PLA, thereby reducing antibiotic misuse.</p>","PeriodicalId":13577,"journal":{"name":"Infection and Drug Resistance","volume":"18 ","pages":"5097-5108"},"PeriodicalIF":2.9000,"publicationDate":"2025-09-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12477283/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Infection and Drug Resistance","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.2147/IDR.S545440","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/1/1 0:00:00","PubModel":"eCollection","JCR":"Q2","JCRName":"INFECTIOUS DISEASES","Score":null,"Total":0}
引用次数: 0
Abstract
Purpose: To investigate the clinical and ultrasonographic characteristics of pyogenic liver abscess (PLA) caused by Klebsiella pneumoniae (K-PLA) and non-Klebsiella pneumoniae pathogens, and to develop machine learning models for the differential diagnosis of K-PLA.
Materials and methods: In this retrospective study, patients clinically diagnosed with PLA and confirmed by ultrasound-guided puncture at the Fifth Medical Center of PLA General Hospital between April 2013 and December 2020 were enrolled. Based on the causative pathogens, patients were categorized into K-PLA and non-K-PLA groups. Baseline data, including ultrasonographic features, clinical characteristics, and laboratory findings, were collected. The Boruta algorithm was employed for feature selection, and four machine learning models-Deep Learning-Fully Connected Neural Network (deeplearning), Distributed Random Forest (drf), Gradient Boosting Machine (gbm), and Generalized Linear Model (glm)-were developed to diagnose K-PLA. The models were validated using 5-fold cross-validation.
Results: A total of 201 patients with bacterial liver abscess were included (median age: 57 years; range: 49-66; 136 males), comprising 134 K-PLA cases and 67 non-K-PLA cases. The Boruta algorithm identified seven significant predictive variables: history of diabetes, history of hepatocellular carcinoma, history of biliary tract disease, history of infectious diseases, duration of fever, body temperature, and alanine aminotransferase (ALT) levels. Using these variables, the four machine learning models were constructed. In the training set, the area under the receiver operating characteristic curve (AUC) for predicting K-PLA was 0.716 (deeplearning), 0.999 (drf), 0.922 (gbm), and 0.718 (glm). In the validation set, the corresponding AUC values were 0.799, 0.763, 0.848, and 0.805, respectively.
Conclusion: This study successfully established four machine learning models for predicting the risk of K-PLA, with the gbm-based model demonstrating the highest diagnostic performance. These models may facilitate early clinical diagnosis and treatment of K-PLA, thereby reducing antibiotic misuse.
期刊介绍:
About Journal
Editors
Peer Reviewers
Articles
Article Publishing Charges
Aims and Scope
Call For Papers
ISSN: 1178-6973
Editor-in-Chief: Professor Suresh Antony
An international, peer-reviewed, open access journal that focuses on the optimal treatment of infection (bacterial, fungal and viral) and the development and institution of preventative strategies to minimize the development and spread of resistance.