The application of artificial intelligence models in predicting the risk of diabetic foot: a multicenter study.

IF 6.1 3区生物学 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY

Biodata Mining Pub Date : 2025-08-21 DOI:10.1186/s13040-025-00477-2

Yao Li, Siyuan Zhou, Bichen Ren, Shuai Ju, Xiaoyan Li, Wenqiang Li, Bingzhe Li, Yunmin Cai, Chunlei Chang, Lihong Huang, Zhihui Dong

{"title":"The application of artificial intelligence models in predicting the risk of diabetic foot: a multicenter study.","authors":"Yao Li, Siyuan Zhou, Bichen Ren, Shuai Ju, Xiaoyan Li, Wenqiang Li, Bingzhe Li, Yunmin Cai, Chunlei Chang, Lihong Huang, Zhihui Dong","doi":"10.1186/s13040-025-00477-2","DOIUrl":null,"url":null,"abstract":"<p><p>This study explores diabetic foot (DF), a severe complication in diabetes, by combining deep learning (DL) and machine learning (ML) to develop a multi-model prediction tool. Early identification of high-risk DF patients can reduce disability and mortality. The research also aims to create an integrated application to assist clinicians in precise, efficient risk assessment for early intervention. In this multicenter retrospective study, 6,180 elderly diabetic patients (aged 60-85) were enrolled from 11 community hospitals in Shanghai in 2024. Lasso regression was used to identify 16 key DF risk factors, including age, MMSE score, lower limb discomfort, ABI, and hematocrit. Fourteen ML models (RF, XGBoost, CART, MLP, etc.) and three DL models (DNN, CNN, Transformer) were trained, with hyperparameters optimized via cross-validation and grid search. An application was developed integrating these models, offering both single and batch prediction options with visualization tools for clinical use.Experimental results showed the Logistic regression ensemble model achieved robust performance, with AUC values of 0.943 (validation set, 95% CI: 0.935-0.951) and 0.938 (test set, 95% CI: 0.929-0.947), along with high accuracy, precision, recall, and F1 scores. SHAP analysis revealed key predictive features including ABI results, lower limb discomfort, and MMSE score. The developed app integrates multiple models, compares their predictions for different clinical scenarios, and enhances prediction transparency and reliability.The multi-model approach demonstrates strong predictive performance for DF risk, offering clinicians an intuitive and accurate assessment tool tailored to individual patients. By combining multiple models, we enhance result stability and clinical applicability compared to single-model approaches. Future work will focus on algorithm optimization, expanded datasets, and real-time monitoring integration to enable more precise, dynamic risk evaluation for improved DF prevention and early intervention.</p>","PeriodicalId":48947,"journal":{"name":"Biodata Mining","volume":"18 1","pages":"57"},"PeriodicalIF":6.1000,"publicationDate":"2025-08-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12372307/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Biodata Mining","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1186/s13040-025-00477-2","RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"MATHEMATICAL & COMPUTATIONAL BIOLOGY","Score":null,"Total":0}

引用次数: 0

Abstract

This study explores diabetic foot (DF), a severe complication in diabetes, by combining deep learning (DL) and machine learning (ML) to develop a multi-model prediction tool. Early identification of high-risk DF patients can reduce disability and mortality. The research also aims to create an integrated application to assist clinicians in precise, efficient risk assessment for early intervention. In this multicenter retrospective study, 6,180 elderly diabetic patients (aged 60-85) were enrolled from 11 community hospitals in Shanghai in 2024. Lasso regression was used to identify 16 key DF risk factors, including age, MMSE score, lower limb discomfort, ABI, and hematocrit. Fourteen ML models (RF, XGBoost, CART, MLP, etc.) and three DL models (DNN, CNN, Transformer) were trained, with hyperparameters optimized via cross-validation and grid search. An application was developed integrating these models, offering both single and batch prediction options with visualization tools for clinical use.Experimental results showed the Logistic regression ensemble model achieved robust performance, with AUC values of 0.943 (validation set, 95% CI: 0.935-0.951) and 0.938 (test set, 95% CI: 0.929-0.947), along with high accuracy, precision, recall, and F1 scores. SHAP analysis revealed key predictive features including ABI results, lower limb discomfort, and MMSE score. The developed app integrates multiple models, compares their predictions for different clinical scenarios, and enhances prediction transparency and reliability.The multi-model approach demonstrates strong predictive performance for DF risk, offering clinicians an intuitive and accurate assessment tool tailored to individual patients. By combining multiple models, we enhance result stability and clinical applicability compared to single-model approaches. Future work will focus on algorithm optimization, expanded datasets, and real-time monitoring integration to enable more precise, dynamic risk evaluation for improved DF prevention and early intervention.

Abstract Image

查看原文本刊更多论文

人工智能模型在预测糖尿病足风险中的应用：一项多中心研究。

本研究将深度学习（DL）和机器学习（ML）相结合，开发了一种多模型预测工具，探讨糖尿病严重并发症糖尿病足（DF）。早期发现高危DF患者可降低致残率和死亡率。该研究还旨在创建一个综合应用程序，以帮助临床医生对早期干预进行精确、有效的风险评估。在这项多中心回顾性研究中，于2024年从上海11家社区医院招募了6180例老年糖尿病患者（60-85岁）。使用Lasso回归确定16个关键DF危险因素，包括年龄、MMSE评分、下肢不适、ABI和红细胞压积。训练了14个ML模型（RF、XGBoost、CART、MLP等）和3个DL模型（DNN、CNN、Transformer），并通过交叉验证和网格搜索对超参数进行了优化。开发了一个集成这些模型的应用程序，提供单个和批量预测选项以及用于临床使用的可视化工具。实验结果表明，Logistic回归集成模型具有较好的稳健性，AUC值分别为0.943（验证集，95% CI: 0.935-0.951）和0.938（检验集，95% CI: 0.929-0.947），具有较高的正确率、精密度、召回率和F1分数。SHAP分析揭示了关键的预测特征，包括ABI结果、下肢不适和MMSE评分。开发的应用程序集成了多个模型，比较了不同临床情况下的预测，提高了预测的透明度和可靠性。多模型方法显示了对DF风险的强大预测性能，为临床医生提供了针对个体患者的直观和准确的评估工具。与单一模型方法相比，通过组合多个模型，我们提高了结果的稳定性和临床适用性。未来的工作将侧重于算法优化、扩展数据集和实时监测集成，以实现更精确、动态的风险评估，以改进DF预防和早期干预。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Biodata Mining MATHEMATICAL & COMPUTATIONAL BIOLOGY-

CiteScore

7.90

自引率

0.00%

发文量

审稿时长

23 weeks

期刊介绍： BioData Mining is an open access, open peer-reviewed journal encompassing research on all aspects of data mining applied to high-dimensional biological and biomedical data, focusing on computational aspects of knowledge discovery from large-scale genetic, transcriptomic, genomic, proteomic, and metabolomic data. Topical areas include, but are not limited to: -Development, evaluation, and application of novel data mining and machine learning algorithms. -Adaptation, evaluation, and application of traditional data mining and machine learning algorithms. -Open-source software for the application of data mining and machine learning algorithms. -Design, development and integration of databases, software and web services for the storage, management, retrieval, and analysis of data from large scale studies. -Pre-processing, post-processing, modeling, and interpretation of data mining and machine learning results for biological interpretation and knowledge discovery.