{"title":"Machine learning model to predict mortality in patients with skin and soft tissue infection in emergency department.","authors":"Yu-Wei Chen, Kai-Hsiang Wu, Po-Han Wu, Cheng-Ting Hsiao, Chiao-Hsuan Hsieh, Wen-Chih Fann, Leng-Chieh Lin, Chia-Peng Chang","doi":"10.1186/s13049-025-01463-7","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Accurately predicting mortality in patients with skin and soft-tissue infections (SSTIs) remains challenging. Machine learning models offer rapid processing, algorithmic impartiality, and strong predictive accuracy, which may improve early risk stratification in the emergency department (ED).</p><p><strong>Methods: </strong>We retrospectively analyzed clinical data from 1,294 ED patients diagnosed with SSTIs between March 2015 and December 2020. Five machine learning algorithms-logistic regression (LR), k-nearest neighbours (KNN), support vector machine (SVM), random forest (RF), and Extreme Gradient Boosting (XGBoost)-were developed using 20 candidate variables, with model performance evaluated in independent runs. A simplified XGBoost model using only the six most influential predictors was also derived for bedside application.</p><p><strong>Results: </strong>Among the five models, XGBoost achieved the highest performance (AUC = 0.892, sensitivity = 86.9%, specificity = 93.4%). The streamlined six-variable XGBoost model further improved predictive metrics (AUC = 0.922, sensitivity = 88.5%, specificity = 95.4%), matching or slightly surpassing the full model while reducing data requirements.</p><p><strong>Conclusions: </strong>XGBoost outperformed LR, KNN, SVM, and RF in predicting SSTI mortality, offering both higher accuracy and operational efficiency. Its sequential tree-building, regularization, and robust handling of missing data enable superior discrimination in tabular clinical datasets. The simplified model, requiring only standard admission variables, provides a fast, cost-effective, and highly accurate tool for early identification of high-risk patients in the ED.</p>","PeriodicalId":49292,"journal":{"name":"Scandinavian Journal of Trauma Resuscitation & Emergency Medicine","volume":"33 1","pages":"148"},"PeriodicalIF":3.1000,"publicationDate":"2025-09-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12462322/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Scandinavian Journal of Trauma Resuscitation & Emergency Medicine","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1186/s13049-025-01463-7","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"EMERGENCY MEDICINE","Score":null,"Total":0}
引用次数: 0
Abstract
Background: Accurately predicting mortality in patients with skin and soft-tissue infections (SSTIs) remains challenging. Machine learning models offer rapid processing, algorithmic impartiality, and strong predictive accuracy, which may improve early risk stratification in the emergency department (ED).
Methods: We retrospectively analyzed clinical data from 1,294 ED patients diagnosed with SSTIs between March 2015 and December 2020. Five machine learning algorithms-logistic regression (LR), k-nearest neighbours (KNN), support vector machine (SVM), random forest (RF), and Extreme Gradient Boosting (XGBoost)-were developed using 20 candidate variables, with model performance evaluated in independent runs. A simplified XGBoost model using only the six most influential predictors was also derived for bedside application.
Results: Among the five models, XGBoost achieved the highest performance (AUC = 0.892, sensitivity = 86.9%, specificity = 93.4%). The streamlined six-variable XGBoost model further improved predictive metrics (AUC = 0.922, sensitivity = 88.5%, specificity = 95.4%), matching or slightly surpassing the full model while reducing data requirements.
Conclusions: XGBoost outperformed LR, KNN, SVM, and RF in predicting SSTI mortality, offering both higher accuracy and operational efficiency. Its sequential tree-building, regularization, and robust handling of missing data enable superior discrimination in tabular clinical datasets. The simplified model, requiring only standard admission variables, provides a fast, cost-effective, and highly accurate tool for early identification of high-risk patients in the ED.
期刊介绍:
The primary topics of interest in Scandinavian Journal of Trauma, Resuscitation and Emergency Medicine (SJTREM) are the pre-hospital and early in-hospital diagnostic and therapeutic aspects of emergency medicine, trauma, and resuscitation. Contributions focusing on dispatch, major incidents, etiology, pathophysiology, rehabilitation, epidemiology, prevention, education, training, implementation, work environment, as well as ethical and socio-economic aspects may also be assessed for publication.