Diabetic Tibial Neuropathy Prediction: Improving interpretability of Various Machine-Learning Models Based on Multimodal-Ultrasound Features Using SHAP Methodology.
Yanqiu Chen, Zhen Sun, Huohu Zhong, Yuwei Chen, Xiuming Wu, Liyang Su, Zhenhan Lai, Tao Zheng, Guorong Lyu, Qichen Su
{"title":"Diabetic Tibial Neuropathy Prediction: Improving interpretability of Various Machine-Learning Models Based on Multimodal-Ultrasound Features Using SHAP Methodology.","authors":"Yanqiu Chen, Zhen Sun, Huohu Zhong, Yuwei Chen, Xiuming Wu, Liyang Su, Zhenhan Lai, Tao Zheng, Guorong Lyu, Qichen Su","doi":"10.1016/j.ultrasmedbio.2025.06.016","DOIUrl":null,"url":null,"abstract":"<p><strong>Objective: </strong>This study aimed to develop and evaluate eight machine learning models based on multimodal ultrasound to precisely predict of diabetic tibial neuropathy (DTN) in patients. Additionally, the SHapley Additive exPlanations(SHAP)framework was introduced to quantify the importance of each feature variable, providing a precise and noninvasive assessment tool for DTN patients, optimizing clinical management strategies, and enhancing patient prognosis.</p><p><strong>Methods: </strong>A prospective analysis was conducted using multimodal ultrasound and clinical data from 255 suspected DTN patients who visited the Second Affiliated Hospital of Fujian Medical University between January 2024 and November 2024. Key features were selected using Least Absolute Shrinkage and Selection Operator (LASSO) regression. Predictive models were constructed using Extreme Gradient Boosting (XGB), Logistic Regression, Support Vector Machines, k-Nearest Neighbors, Random Forest, Decision Tree, Naïve Bayes, and Neural Network. The SHAP method was employed to refine model interpretability. Furthermore, in order to verify the generalization degree of the model, this study also collected 135 patients from three other tertiary hospitals for external test.</p><p><strong>Results: </strong>LASSO regression identified Echo intensity(EI), Cross-sectional area (CSA), Mean elasticity value(Emean), Superb microvascular imaging(SMI), and History of smoking were key features for DTN prediction. The XGB model achieved an Area Under the Curve (AUC) of 0.94, 0.83 and 0.79 in the training, internal test and external test sets, respectively. SHAP analysis highlighted the ranking significance of EI, CSA, Emean, SMI, and History of smoking. Personalized prediction explanations provided by theSHAP values demonstrated the contribution of each feature to the final prediction, and enhancing model interpretability. Furthermore, decision plots depicted how different features influenced mispredictions, thereby facilitating further model optimization or feature adjustment.</p><p><strong>Conclusion: </strong>This study proposed a DTN prediction model based on machine-learning algorithms applied to multimodal ultrasound data. The results indicated the superior performance of the XGB model and its interpretability was enhanced using SHAP analysis. This cost-effective and user-friendly approach provides potential support for personalized treatment and precision medicine for DTN.</p>","PeriodicalId":49399,"journal":{"name":"Ultrasound in Medicine and Biology","volume":" ","pages":""},"PeriodicalIF":2.4000,"publicationDate":"2025-07-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Ultrasound in Medicine and Biology","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1016/j.ultrasmedbio.2025.06.016","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ACOUSTICS","Score":null,"Total":0}
引用次数: 0
Abstract
Objective: This study aimed to develop and evaluate eight machine learning models based on multimodal ultrasound to precisely predict of diabetic tibial neuropathy (DTN) in patients. Additionally, the SHapley Additive exPlanations(SHAP)framework was introduced to quantify the importance of each feature variable, providing a precise and noninvasive assessment tool for DTN patients, optimizing clinical management strategies, and enhancing patient prognosis.
Methods: A prospective analysis was conducted using multimodal ultrasound and clinical data from 255 suspected DTN patients who visited the Second Affiliated Hospital of Fujian Medical University between January 2024 and November 2024. Key features were selected using Least Absolute Shrinkage and Selection Operator (LASSO) regression. Predictive models were constructed using Extreme Gradient Boosting (XGB), Logistic Regression, Support Vector Machines, k-Nearest Neighbors, Random Forest, Decision Tree, Naïve Bayes, and Neural Network. The SHAP method was employed to refine model interpretability. Furthermore, in order to verify the generalization degree of the model, this study also collected 135 patients from three other tertiary hospitals for external test.
Results: LASSO regression identified Echo intensity(EI), Cross-sectional area (CSA), Mean elasticity value(Emean), Superb microvascular imaging(SMI), and History of smoking were key features for DTN prediction. The XGB model achieved an Area Under the Curve (AUC) of 0.94, 0.83 and 0.79 in the training, internal test and external test sets, respectively. SHAP analysis highlighted the ranking significance of EI, CSA, Emean, SMI, and History of smoking. Personalized prediction explanations provided by theSHAP values demonstrated the contribution of each feature to the final prediction, and enhancing model interpretability. Furthermore, decision plots depicted how different features influenced mispredictions, thereby facilitating further model optimization or feature adjustment.
Conclusion: This study proposed a DTN prediction model based on machine-learning algorithms applied to multimodal ultrasound data. The results indicated the superior performance of the XGB model and its interpretability was enhanced using SHAP analysis. This cost-effective and user-friendly approach provides potential support for personalized treatment and precision medicine for DTN.
目的:本研究旨在建立和评估8种基于多模态超声的机器学习模型,以准确预测糖尿病性胫神经病变(DTN)患者。此外,引入SHapley加性解释(SHAP)框架,量化各特征变量的重要性,为DTN患者提供精确、无创的评估工具,优化临床管理策略,提高患者预后。方法:对2024年1月至2024年11月在福建医科大学第二附属医院就诊的255例疑似DTN患者的多模态超声及临床资料进行前瞻性分析。使用最小绝对收缩和选择算子(LASSO)回归选择关键特征。使用极端梯度增强(XGB)、逻辑回归、支持向量机、k近邻、随机森林、决策树、Naïve贝叶斯和神经网络构建预测模型。采用SHAP方法改进模型可解释性。此外,为了验证模型的泛化程度,本研究还收集了另外三家三级医院的135例患者进行外部检验。结果:LASSO回归发现回声强度(EI)、横截面积(CSA)、平均弹性值(Emean)、微血管成像(SMI)和吸烟史是预测DTN的关键特征。XGB模型在训练集、内部测试集和外部测试集的曲线下面积(Area Under The Curve, AUC)分别为0.94、0.83和0.79。SHAP分析强调了EI、CSA、Emean、SMI和吸烟史的排序意义。shhap值提供的个性化预测解释展示了每个特征对最终预测的贡献,并增强了模型的可解释性。此外,决策图描述了不同的特征如何影响错误预测,从而促进了进一步的模型优化或特征调整。结论:本研究提出了一种应用于多模态超声数据的基于机器学习算法的DTN预测模型。结果表明,采用SHAP分析增强了XGB模型的可解释性。这种具有成本效益和用户友好的方法为DTN的个性化治疗和精准医疗提供了潜在的支持。
期刊介绍:
Ultrasound in Medicine and Biology is the official journal of the World Federation for Ultrasound in Medicine and Biology. The journal publishes original contributions that demonstrate a novel application of an existing ultrasound technology in clinical diagnostic, interventional and therapeutic applications, new and improved clinical techniques, the physics, engineering and technology of ultrasound in medicine and biology, and the interactions between ultrasound and biological systems, including bioeffects. Papers that simply utilize standard diagnostic ultrasound as a measuring tool will be considered out of scope. Extended critical reviews of subjects of contemporary interest in the field are also published, in addition to occasional editorial articles, clinical and technical notes, book reviews, letters to the editor and a calendar of forthcoming meetings. It is the aim of the journal fully to meet the information and publication requirements of the clinicians, scientists, engineers and other professionals who constitute the biomedical ultrasonic community.