Machine learning-enabled prediction of prolonged length of stay in hospital after surgery for tuberculosis spondylitis patients with unbalanced data: a novel approach using explainable artificial intelligence (XAI).

IF 2.8 3区医学 Q2 MEDICINE, RESEARCH & EXPERIMENTAL

European Journal of Medical Research Pub Date : 2024-07-25 DOI:10.1186/s40001-024-01988-0

Parhat Yasin, Yasen Yimit, Xiaoyu Cai, Abasi Aimaiti, Weibin Sheng, Mardan Mamat, Mayidili Nijiati

{"title":"Machine learning-enabled prediction of prolonged length of stay in hospital after surgery for tuberculosis spondylitis patients with unbalanced data: a novel approach using explainable artificial intelligence (XAI).","authors":"Parhat Yasin, Yasen Yimit, Xiaoyu Cai, Abasi Aimaiti, Weibin Sheng, Mardan Mamat, Mayidili Nijiati","doi":"10.1186/s40001-024-01988-0","DOIUrl":null,"url":null,"abstract":"Background: Tuberculosis spondylitis (TS), commonly known as Pott's disease, is a severe type of skeletal tuberculosis that typically requires surgical treatment. However, this treatment option has led to an increase in healthcare costs due to prolonged hospital stays (PLOS). Therefore, identifying risk factors associated with extended PLOS is necessary. In this research, we intended to develop an interpretable machine learning model that could predict extended PLOS, which can provide valuable insights for treatments and a web-based application was implemented.Methods: We obtained patient data from the spine surgery department at our hospital. Extended postoperative length of stay (PLOS) refers to a hospitalization duration equal to or exceeding the 75th percentile following spine surgery. To identify relevant variables, we employed several approaches, such as the least absolute shrinkage and selection operator (LASSO), recursive feature elimination (RFE) based on support vector machine classification (SVC), correlation analysis, and permutation importance value. Several models using implemented and some of them are ensembled using soft voting techniques. Models were constructed using grid search with nested cross-validation. The performance of each algorithm was assessed through various metrics, including the AUC value (area under the curve of receiver operating characteristics) and the Brier Score. Model interpretation involved utilizing methods such as Shapley additive explanations (SHAP), the Gini Impurity Index, permutation importance, and local interpretable model-agnostic explanations (LIME). Furthermore, to facilitate the practical application of the model, a web-based interface was developed and deployed.Results: The study included a cohort of 580 patients and 11 features include (CRP, transfusions, infusion volume, blood loss, X-ray bone bridge, X-ray osteophyte, CT-vertebral destruction, CT-paravertebral abscess, MRI-paravertebral abscess, MRI-epidural abscess, postoperative drainage) were selected. Most of the classifiers showed better performance, where the XGBoost model has a higher AUC value (0.86) and lower Brier Score (0.126). The XGBoost model was chosen as the optimal model. The results obtained from the calibration and decision curve analysis (DCA) plots demonstrate that XGBoost has achieved promising performance. After conducting tenfold cross-validation, the XGBoost model demonstrated a mean AUC of 0.85 ± 0.09. SHAP and LIME were used to display the variables' contributions to the predicted value. The stacked bar plots indicated that infusion volume was the primary contributor, as determined by Gini, permutation importance (PFI), and the LIME algorithm.Conclusions: Our methods not only effectively predicted extended PLOS but also identified risk factors that can be utilized for future treatments. The XGBoost model developed in this study is easily accessible through the deployed web application and can aid in clinical research.","PeriodicalId":11949,"journal":{"name":"European Journal of Medical Research","volume":null,"pages":null},"PeriodicalIF":2.8000,"publicationDate":"2024-07-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11270948/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"European Journal of Medical Research","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1186/s40001-024-01988-0","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"MEDICINE, RESEARCH & EXPERIMENTAL","Score":null,"Total":0}

引用次数: 0

Abstract

Background: Tuberculosis spondylitis (TS), commonly known as Pott's disease, is a severe type of skeletal tuberculosis that typically requires surgical treatment. However, this treatment option has led to an increase in healthcare costs due to prolonged hospital stays (PLOS). Therefore, identifying risk factors associated with extended PLOS is necessary. In this research, we intended to develop an interpretable machine learning model that could predict extended PLOS, which can provide valuable insights for treatments and a web-based application was implemented.

Methods: We obtained patient data from the spine surgery department at our hospital. Extended postoperative length of stay (PLOS) refers to a hospitalization duration equal to or exceeding the 75th percentile following spine surgery. To identify relevant variables, we employed several approaches, such as the least absolute shrinkage and selection operator (LASSO), recursive feature elimination (RFE) based on support vector machine classification (SVC), correlation analysis, and permutation importance value. Several models using implemented and some of them are ensembled using soft voting techniques. Models were constructed using grid search with nested cross-validation. The performance of each algorithm was assessed through various metrics, including the AUC value (area under the curve of receiver operating characteristics) and the Brier Score. Model interpretation involved utilizing methods such as Shapley additive explanations (SHAP), the Gini Impurity Index, permutation importance, and local interpretable model-agnostic explanations (LIME). Furthermore, to facilitate the practical application of the model, a web-based interface was developed and deployed.

Results: The study included a cohort of 580 patients and 11 features include (CRP, transfusions, infusion volume, blood loss, X-ray bone bridge, X-ray osteophyte, CT-vertebral destruction, CT-paravertebral abscess, MRI-paravertebral abscess, MRI-epidural abscess, postoperative drainage) were selected. Most of the classifiers showed better performance, where the XGBoost model has a higher AUC value (0.86) and lower Brier Score (0.126). The XGBoost model was chosen as the optimal model. The results obtained from the calibration and decision curve analysis (DCA) plots demonstrate that XGBoost has achieved promising performance. After conducting tenfold cross-validation, the XGBoost model demonstrated a mean AUC of 0.85 ± 0.09. SHAP and LIME were used to display the variables' contributions to the predicted value. The stacked bar plots indicated that infusion volume was the primary contributor, as determined by Gini, permutation importance (PFI), and the LIME algorithm.

Conclusions: Our methods not only effectively predicted extended PLOS but also identified risk factors that can be utilized for future treatments. The XGBoost model developed in this study is easily accessible through the deployed web application and can aid in clinical research.

查看原文本刊更多论文

利用机器学习预测结核性脊柱炎患者手术后住院时间延长的不平衡数据：一种使用可解释人工智能（XAI）的新方法。

背景：结核性脊柱炎（TS）俗称波特氏病，是一种严重的骨骼结核，通常需要手术治疗。然而，这种治疗方法因住院时间延长（PLOS）而导致医疗费用增加。因此，有必要确定与延长住院时间相关的风险因素。在这项研究中，我们打算开发一个可解释的机器学习模型，该模型可以预测延长的住院时间，从而为治疗提供有价值的见解，并实现了一个基于网络的应用程序：我们从本院脊柱外科获得了患者数据。术后住院时间延长（PLOS）是指脊柱手术后住院时间等于或超过第 75 百分位数。为了确定相关变量，我们采用了多种方法，如最小绝对收缩和选择算子（LASSO）、基于支持向量机分类（SVC）的递归特征消除（RFE）、相关性分析和排列重要性值。使用软投票技术实现了多个模型，并对其中一些模型进行了组合。使用网格搜索和嵌套交叉验证构建模型。每种算法的性能都通过各种指标进行评估，包括 AUC 值（接收者操作特征曲线下面积）和 Brier 分数。对模型的解释则采用了夏普利加法解释（SHAP）、吉尼不纯指数（Gini Impurity Index）、置换重要性（permutation importance）和局部可解释模型失真解释（LIME）等方法。此外，为了便于模型的实际应用，还开发并部署了一个基于网络的界面：该研究纳入了 580 例患者，选择了 11 个特征（CRP、输血、输液量、失血、X 光骨桥、X 光骨质增生、CT-椎体破坏、CT-椎旁脓肿、MRI-椎旁脓肿、MRI-硬膜外脓肿、术后引流）。大多数分类器都显示出较好的性能，其中 XGBoost 模型的 AUC 值（0.86）较高，Brier Score 值（0.126）较低。XGBoost 模型被选为最佳模型。校准和决策曲线分析（DCA）图得出的结果表明，XGBoost 的性能很有前途。在进行十倍交叉验证后，XGBoost 模型的平均 AUC 为 0.85 ± 0.09。SHAP 和 LIME 用于显示变量对预测值的贡献。叠加条形图显示，输液量是主要的贡献因素，这是由基尼、置换重要性（PFI）和 LIME 算法决定的：我们的方法不仅能有效预测扩展的 PLOS，还能发现可用于未来治疗的风险因素。本研究开发的 XGBoost 模型可通过部署的网络应用程序轻松访问，有助于临床研究。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

European Journal of Medical Research 医学-医学：研究与实验

CiteScore

3.20

自引率

0.00%

发文量

247

审稿时长

>12 weeks

期刊介绍： European Journal of Medical Research publishes translational and clinical research of international interest across all medical disciplines, enabling clinicians and other researchers to learn about developments and innovations within these disciplines and across the boundaries between disciplines. The journal publishes high quality research and reviews and aims to ensure that the results of all well-conducted research are published, regardless of their outcome.