Integrating Feature Selection, Machine Learning, and SHAP Explainability to Predict Severe Acute Pancreatitis.

IF 3.3 3区医学 Q1 MEDICINE, GENERAL & INTERNAL

Diagnostics Pub Date : 2025-09-27 DOI:10.3390/diagnostics15192473

İzzet Ustaalioğlu, Rohat Ak

{"title":"Integrating Feature Selection, Machine Learning, and SHAP Explainability to Predict Severe Acute Pancreatitis.","authors":"İzzet Ustaalioğlu, Rohat Ak","doi":"10.3390/diagnostics15192473","DOIUrl":null,"url":null,"abstract":"Background/Objectives: Severe acute pancreatitis (SAP) carries substantial morbidity and resource burden, and early risk stratification remains challenging with conventional scores that require serial observations. The aim of this study was to develop and compare supervised machine-learning (ML) pipelines-integrating feature selection and SHAP-based explainability-for early prediction of SAP at emergency department (ED) presentation. Methods: This retrospective, single-center cohort was conducted in a tertiary-care ED between 1 January 2022 and 1 January 2025. Adult patients with acute pancreatitis were identified from electronic records; SAP was classified per the Revised Atlanta criteria (persistent organ failure ≥ 48 h). Six feature-selection methods (univariate AUROC filter, RFE, mRMR, LASSO, elastic net, Boruta) were paired with six classifiers (kNN, elastic-net logistic regression, MARS, random forest, SVM-RBF, XGBoost) to yield 36 pipelines. Discrimination, calibration, and error metrics were estimated with bootstrapping; SHAP was used for model interpretability. Results: Of 743 patients (non-SAP 676; SAP 67), SAP prevalence was 9.0%. Compared with non-SAP, SAP patients more often had hypertension (38.8% vs. 27.1%) and malignancy (19.4% vs. 7.2%); they presented with lower GCS, higher heart and respiratory rates, lower systolic blood pressure, and more frequent peripancreatic fluid (31.3% vs. 16.9%) and pleural effusion (43.3% vs. 17.5%). Albumin was lower by 4.18 g/L, with broader renal-electrolyte and inflammatory derangements. Across the best-performing models, AUROC spanned 0.750-0.826; the top pipeline (RFE-RF features + kNN) reached 0.826, while random-forest-based pipelines showed favorable calibration. SHAP confirmed clinically plausible contributions from routinely available variables. Conclusions: In this study, integrating feature selection with ML produced accurate and interpretable early prediction of SAP using data available at ED arrival. The approach highlights actionable predictors and may support earlier triage and resource allocation; external validation is warranted.","PeriodicalId":11225,"journal":{"name":"Diagnostics","volume":"15 19","pages":""},"PeriodicalIF":3.3000,"publicationDate":"2025-09-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12523390/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Diagnostics","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.3390/diagnostics15192473","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"MEDICINE, GENERAL & INTERNAL","Score":null,"Total":0}

引用次数: 0

Abstract

Background/Objectives: Severe acute pancreatitis (SAP) carries substantial morbidity and resource burden, and early risk stratification remains challenging with conventional scores that require serial observations. The aim of this study was to develop and compare supervised machine-learning (ML) pipelines-integrating feature selection and SHAP-based explainability-for early prediction of SAP at emergency department (ED) presentation. Methods: This retrospective, single-center cohort was conducted in a tertiary-care ED between 1 January 2022 and 1 January 2025. Adult patients with acute pancreatitis were identified from electronic records; SAP was classified per the Revised Atlanta criteria (persistent organ failure ≥ 48 h). Six feature-selection methods (univariate AUROC filter, RFE, mRMR, LASSO, elastic net, Boruta) were paired with six classifiers (kNN, elastic-net logistic regression, MARS, random forest, SVM-RBF, XGBoost) to yield 36 pipelines. Discrimination, calibration, and error metrics were estimated with bootstrapping; SHAP was used for model interpretability. Results: Of 743 patients (non-SAP 676; SAP 67), SAP prevalence was 9.0%. Compared with non-SAP, SAP patients more often had hypertension (38.8% vs. 27.1%) and malignancy (19.4% vs. 7.2%); they presented with lower GCS, higher heart and respiratory rates, lower systolic blood pressure, and more frequent peripancreatic fluid (31.3% vs. 16.9%) and pleural effusion (43.3% vs. 17.5%). Albumin was lower by 4.18 g/L, with broader renal-electrolyte and inflammatory derangements. Across the best-performing models, AUROC spanned 0.750-0.826; the top pipeline (RFE-RF features + kNN) reached 0.826, while random-forest-based pipelines showed favorable calibration. SHAP confirmed clinically plausible contributions from routinely available variables. Conclusions: In this study, integrating feature selection with ML produced accurate and interpretable early prediction of SAP using data available at ED arrival. The approach highlights actionable predictors and may support earlier triage and resource allocation; external validation is warranted.

查看原文本刊更多论文

整合特征选择、机器学习和SHAP可解释性来预测严重急性胰腺炎。

背景/目的：严重急性胰腺炎（SAP）具有很高的发病率和资源负担，早期风险分层仍然具有挑战性，需要连续观察的传统评分。本研究的目的是开发和比较监督机器学习（ML）管道-集成特征选择和基于shap的可解释性-用于急诊部门（ED）演示SAP的早期预测。方法：这项回顾性的单中心队列研究于2022年1月1日至2025年1月1日在一家三级医疗急诊科进行。从电子病历中确定成年急性胰腺炎患者；SAP根据修订的亚特兰大标准（持续器官衰竭≥48小时）进行分类。6种特征选择方法（单变量AUROC filter， RFE, mRMR， LASSO, elastic net, Boruta）与6种分类器（kNN，弹性网络逻辑回归，MARS，随机森林，SVM-RBF, XGBoost）配对，得到36条管道。判别、校准和误差指标用自举法估计；模型可解释性采用SHAP。结果：743例患者（非SAP 676例，SAP 67例），SAP患病率为9.0%。与非SAP患者相比，SAP患者更常伴有高血压（38.8%比27.1%）和恶性肿瘤（19.4%比7.2%）；他们表现为GCS较低，心率和呼吸频率较高，收缩压较低，胰周液（31.3%对16.9%）和胸腔积液（43.3%对17.5%）较多。白蛋白降低4.18 g/L，肾脏电解质和炎症紊乱更广泛。在表现最好的模型中，AUROC范围为0.750-0.826；顶部管道（RFE-RF特征+ kNN）达到0.826，而基于随机森林的管道具有良好的校准效果。SHAP证实了常规可用变量的临床合理贡献。结论：在这项研究中，将特征选择与机器学习相结合，利用ED到达时可用的数据，对SAP进行了准确且可解释的早期预测。该方法突出了可操作的预测因素，并可能支持早期分类和资源分配；外部验证是必要的。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Diagnostics Biochemistry, Genetics and Molecular Biology-Clinical Biochemistry

CiteScore

4.70

自引率

8.30%

发文量

2699

审稿时长

19.64 days

期刊介绍： Diagnostics (ISSN 2075-4418) is an international scholarly open access journal on medical diagnostics. It publishes original research articles, reviews, communications and short notes on the research and development of medical diagnostics. There is no restriction on the length of the papers. Our aim is to encourage scientists to publish their experimental and theoretical research in as much detail as possible. Full experimental and/or methodological details must be provided for research articles.