Pu Wang , Teng-Hui Chen , Mei-Yun Chang , Hai-Yen Hsia , Meng Dai , Yifan Liu , Yeong-Long Hsu , Feng Fu , Zhanqi Zhao
{"title":"An explainable artificial intelligence framework for weaning outcomes prediction using features from electrical impedance tomography","authors":"Pu Wang , Teng-Hui Chen , Mei-Yun Chang , Hai-Yen Hsia , Meng Dai , Yifan Liu , Yeong-Long Hsu , Feng Fu , Zhanqi Zhao","doi":"10.1016/j.cmpb.2025.108811","DOIUrl":null,"url":null,"abstract":"<div><h3>Background</h3><div>Prolonged mechanical ventilation (PMV) might cause ventilator-associated pneumonia and diaphragmatic injury, and may lead to worsening clinical weaning outcomes. The present study proposes a comprehensive machine learning (ML) framework for predicting the weaning outcomes of patients with PMV, without relying on ventilator data, by utilizing features from electrical impedance tomography (EIT).</div></div><div><h3>Methods</h3><div>EIT data from 58 patients with PMV were analyzed. Extracted EIT image features were standardized using the min-max method. The Boruta method was employed to select significant features for the ML model. To balance the data, the SMOTE method was utilized. Ten ML algorithms commonly used in clinical prediction were compared. The SHAP and LIME methods were used to explain the ML models. Feature selection, data balancing, and hyperparameter adjustment all adopt the Leave-One-Out cross-validation method to avoid overfitting.</div></div><div><h3>Results</h3><div>The area under the receiver operating characteristic (AUC), specificity, and precision of the ML model with SMOTE balance were significantly improved (<em>p</em> < 0.05) compared to unbalanced data. However, the sensitivity was reduced considerably (<em>p</em> = 0.02). The optimal ML model, extreme gradient boost (XGBoost), demonstrated excellent performance: AUC = 0.862, sensitivity = 0.923, specificity = 0.800, accuracy = 0.889, precision = 0.923, and f-score = 0.923. Decision Curve Analysis and calibration curve evaluation indicated that the model has high clinical generality and reliability. The SHAP and LIME methods enabled model interpretation at both the global and individual sample levels.</div></div><div><h3>Conclusion</h3><div>The weaning outcome prediction model based on EIT data does not rely on ventilator data, which is suitable for a broader range of weaning scenarios. We proposed a comprehensive ML framework for weaning outcome prediction and incorporated the SHAP and LIME methods, which significantly improved the interpretability of the model.</div></div>","PeriodicalId":10624,"journal":{"name":"Computer methods and programs in biomedicine","volume":"267 ","pages":"Article 108811"},"PeriodicalIF":4.9000,"publicationDate":"2025-04-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computer methods and programs in biomedicine","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0169260725002287","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}
引用次数: 0
Abstract
Background
Prolonged mechanical ventilation (PMV) might cause ventilator-associated pneumonia and diaphragmatic injury, and may lead to worsening clinical weaning outcomes. The present study proposes a comprehensive machine learning (ML) framework for predicting the weaning outcomes of patients with PMV, without relying on ventilator data, by utilizing features from electrical impedance tomography (EIT).
Methods
EIT data from 58 patients with PMV were analyzed. Extracted EIT image features were standardized using the min-max method. The Boruta method was employed to select significant features for the ML model. To balance the data, the SMOTE method was utilized. Ten ML algorithms commonly used in clinical prediction were compared. The SHAP and LIME methods were used to explain the ML models. Feature selection, data balancing, and hyperparameter adjustment all adopt the Leave-One-Out cross-validation method to avoid overfitting.
Results
The area under the receiver operating characteristic (AUC), specificity, and precision of the ML model with SMOTE balance were significantly improved (p < 0.05) compared to unbalanced data. However, the sensitivity was reduced considerably (p = 0.02). The optimal ML model, extreme gradient boost (XGBoost), demonstrated excellent performance: AUC = 0.862, sensitivity = 0.923, specificity = 0.800, accuracy = 0.889, precision = 0.923, and f-score = 0.923. Decision Curve Analysis and calibration curve evaluation indicated that the model has high clinical generality and reliability. The SHAP and LIME methods enabled model interpretation at both the global and individual sample levels.
Conclusion
The weaning outcome prediction model based on EIT data does not rely on ventilator data, which is suitable for a broader range of weaning scenarios. We proposed a comprehensive ML framework for weaning outcome prediction and incorporated the SHAP and LIME methods, which significantly improved the interpretability of the model.
期刊介绍:
To encourage the development of formal computing methods, and their application in biomedical research and medical practice, by illustration of fundamental principles in biomedical informatics research; to stimulate basic research into application software design; to report the state of research of biomedical information processing projects; to report new computer methodologies applied in biomedical areas; the eventual distribution of demonstrable software to avoid duplication of effort; to provide a forum for discussion and improvement of existing software; to optimize contact between national organizations and regional user groups by promoting an international exchange of information on formal methods, standards and software in biomedicine.
Computer Methods and Programs in Biomedicine covers computing methodology and software systems derived from computing science for implementation in all aspects of biomedical research and medical practice. It is designed to serve: biochemists; biologists; geneticists; immunologists; neuroscientists; pharmacologists; toxicologists; clinicians; epidemiologists; psychiatrists; psychologists; cardiologists; chemists; (radio)physicists; computer scientists; programmers and systems analysts; biomedical, clinical, electrical and other engineers; teachers of medical informatics and users of educational software.