Qin Li, Nan Lin, Zuheng Wang, Yuexi Chen, Yuli Xie, Xuemei Wang, Jirui Tang, Yuling Xu, Min Xu, Na Lu, Yiqian Huang, Jiamin Luo, Zhenfang Liu, Li Jing
{"title":"Machine learning-based prognostic model for bloodstream infections in hematological malignancies using Th1/Th2 cytokines.","authors":"Qin Li, Nan Lin, Zuheng Wang, Yuexi Chen, Yuli Xie, Xuemei Wang, Jirui Tang, Yuling Xu, Min Xu, Na Lu, Yiqian Huang, Jiamin Luo, Zhenfang Liu, Li Jing","doi":"10.1186/s12879-025-10808-7","DOIUrl":null,"url":null,"abstract":"<p><strong>Objective: </strong>Bloodstream infection (BSI) is a significant cause of mortality in patients with hematologic malignancies(HMs), particularly amid rising antibiotic resistance. This study aimed to analyze pathogen distribution, drug-resistance patterns and develop a novel predictive model for 30-day mortality in HM patients with BSIs.</p><p><strong>Methods: </strong>A retrospective analysis of 231 HM patients with positive blood cultures was conducted. Logistic regression identified risk factors for 30-day mortality. Th1/Th2 cytokines were collected at BSI onset, with LASSO regression and restricted cubic spline analysis used to refine predictors. Seven machine learning(ML) algorithm (XGBoost, Logistic Regression, LightGBM, RandomForest, AdaBoost, GBDT and GNB) were trained using 10-fold cross-validation and model performance was evaluated with the ROC, calibration plots, decision and learning curves and the Shapley Additive Explanations (SHAP) analysis. The predictive model was developed by integrating Th1/Th2 cytokines with clinical features, aiming to enhance the accuracy of 30-day mortality prediction.</p><p><strong>Results: </strong>Among the cohort, acute myeloid leukemia (38%) was the most common HM, while gram negative bacteria (64%) were the predominant pathogens causing BSI. Age, polymicrobial BSI, IL-4, IL-6 and AST levels were significant predictors of 30-day mortality. The Logistic Regression model achieved AUCs of 0.802, 0.792, and 0.822 in training, validation, and test cohorts, respectively, with strong calibration and clinical benefit shown in decision curves. SHAP analysis highlighted IL-4 and IL-6 as key predictors.</p><p><strong>Conclusions: </strong>This study introduces a novel ML-based model integrating Th1/Th2 cytokines and clinical features to predict 30-day mortality in HM patients with BSIs, demonstrating strong performance and clinical applicability.</p>","PeriodicalId":8981,"journal":{"name":"BMC Infectious Diseases","volume":"25 1","pages":"415"},"PeriodicalIF":3.4000,"publicationDate":"2025-03-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11948653/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"BMC Infectious Diseases","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1186/s12879-025-10808-7","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"INFECTIOUS DISEASES","Score":null,"Total":0}
引用次数: 0
Abstract
Objective: Bloodstream infection (BSI) is a significant cause of mortality in patients with hematologic malignancies(HMs), particularly amid rising antibiotic resistance. This study aimed to analyze pathogen distribution, drug-resistance patterns and develop a novel predictive model for 30-day mortality in HM patients with BSIs.
Methods: A retrospective analysis of 231 HM patients with positive blood cultures was conducted. Logistic regression identified risk factors for 30-day mortality. Th1/Th2 cytokines were collected at BSI onset, with LASSO regression and restricted cubic spline analysis used to refine predictors. Seven machine learning(ML) algorithm (XGBoost, Logistic Regression, LightGBM, RandomForest, AdaBoost, GBDT and GNB) were trained using 10-fold cross-validation and model performance was evaluated with the ROC, calibration plots, decision and learning curves and the Shapley Additive Explanations (SHAP) analysis. The predictive model was developed by integrating Th1/Th2 cytokines with clinical features, aiming to enhance the accuracy of 30-day mortality prediction.
Results: Among the cohort, acute myeloid leukemia (38%) was the most common HM, while gram negative bacteria (64%) were the predominant pathogens causing BSI. Age, polymicrobial BSI, IL-4, IL-6 and AST levels were significant predictors of 30-day mortality. The Logistic Regression model achieved AUCs of 0.802, 0.792, and 0.822 in training, validation, and test cohorts, respectively, with strong calibration and clinical benefit shown in decision curves. SHAP analysis highlighted IL-4 and IL-6 as key predictors.
Conclusions: This study introduces a novel ML-based model integrating Th1/Th2 cytokines and clinical features to predict 30-day mortality in HM patients with BSIs, demonstrating strong performance and clinical applicability.
期刊介绍:
BMC Infectious Diseases is an open access, peer-reviewed journal that considers articles on all aspects of the prevention, diagnosis and management of infectious and sexually transmitted diseases in humans, as well as related molecular genetics, pathophysiology, and epidemiology.