{"title":"人工智能模型在预测糖尿病足风险中的应用:一项多中心研究。","authors":"Yao Li, Siyuan Zhou, Bichen Ren, Shuai Ju, Xiaoyan Li, Wenqiang Li, Bingzhe Li, Yunmin Cai, Chunlei Chang, Lihong Huang, Zhihui Dong","doi":"10.1186/s13040-025-00477-2","DOIUrl":null,"url":null,"abstract":"<p><p>This study explores diabetic foot (DF), a severe complication in diabetes, by combining deep learning (DL) and machine learning (ML) to develop a multi-model prediction tool. Early identification of high-risk DF patients can reduce disability and mortality. The research also aims to create an integrated application to assist clinicians in precise, efficient risk assessment for early intervention. In this multicenter retrospective study, 6,180 elderly diabetic patients (aged 60-85) were enrolled from 11 community hospitals in Shanghai in 2024. Lasso regression was used to identify 16 key DF risk factors, including age, MMSE score, lower limb discomfort, ABI, and hematocrit. Fourteen ML models (RF, XGBoost, CART, MLP, etc.) and three DL models (DNN, CNN, Transformer) were trained, with hyperparameters optimized via cross-validation and grid search. An application was developed integrating these models, offering both single and batch prediction options with visualization tools for clinical use.Experimental results showed the Logistic regression ensemble model achieved robust performance, with AUC values of 0.943 (validation set, 95% CI: 0.935-0.951) and 0.938 (test set, 95% CI: 0.929-0.947), along with high accuracy, precision, recall, and F1 scores. SHAP analysis revealed key predictive features including ABI results, lower limb discomfort, and MMSE score. The developed app integrates multiple models, compares their predictions for different clinical scenarios, and enhances prediction transparency and reliability.The multi-model approach demonstrates strong predictive performance for DF risk, offering clinicians an intuitive and accurate assessment tool tailored to individual patients. By combining multiple models, we enhance result stability and clinical applicability compared to single-model approaches. Future work will focus on algorithm optimization, expanded datasets, and real-time monitoring integration to enable more precise, dynamic risk evaluation for improved DF prevention and early intervention.</p>","PeriodicalId":48947,"journal":{"name":"Biodata Mining","volume":"18 1","pages":"57"},"PeriodicalIF":6.1000,"publicationDate":"2025-08-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12372307/pdf/","citationCount":"0","resultStr":"{\"title\":\"The application of artificial intelligence models in predicting the risk of diabetic foot: a multicenter study.\",\"authors\":\"Yao Li, Siyuan Zhou, Bichen Ren, Shuai Ju, Xiaoyan Li, Wenqiang Li, Bingzhe Li, Yunmin Cai, Chunlei Chang, Lihong Huang, Zhihui Dong\",\"doi\":\"10.1186/s13040-025-00477-2\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>This study explores diabetic foot (DF), a severe complication in diabetes, by combining deep learning (DL) and machine learning (ML) to develop a multi-model prediction tool. Early identification of high-risk DF patients can reduce disability and mortality. The research also aims to create an integrated application to assist clinicians in precise, efficient risk assessment for early intervention. In this multicenter retrospective study, 6,180 elderly diabetic patients (aged 60-85) were enrolled from 11 community hospitals in Shanghai in 2024. Lasso regression was used to identify 16 key DF risk factors, including age, MMSE score, lower limb discomfort, ABI, and hematocrit. Fourteen ML models (RF, XGBoost, CART, MLP, etc.) and three DL models (DNN, CNN, Transformer) were trained, with hyperparameters optimized via cross-validation and grid search. An application was developed integrating these models, offering both single and batch prediction options with visualization tools for clinical use.Experimental results showed the Logistic regression ensemble model achieved robust performance, with AUC values of 0.943 (validation set, 95% CI: 0.935-0.951) and 0.938 (test set, 95% CI: 0.929-0.947), along with high accuracy, precision, recall, and F1 scores. SHAP analysis revealed key predictive features including ABI results, lower limb discomfort, and MMSE score. The developed app integrates multiple models, compares their predictions for different clinical scenarios, and enhances prediction transparency and reliability.The multi-model approach demonstrates strong predictive performance for DF risk, offering clinicians an intuitive and accurate assessment tool tailored to individual patients. By combining multiple models, we enhance result stability and clinical applicability compared to single-model approaches. Future work will focus on algorithm optimization, expanded datasets, and real-time monitoring integration to enable more precise, dynamic risk evaluation for improved DF prevention and early intervention.</p>\",\"PeriodicalId\":48947,\"journal\":{\"name\":\"Biodata Mining\",\"volume\":\"18 1\",\"pages\":\"57\"},\"PeriodicalIF\":6.1000,\"publicationDate\":\"2025-08-21\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12372307/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Biodata Mining\",\"FirstCategoryId\":\"99\",\"ListUrlMain\":\"https://doi.org/10.1186/s13040-025-00477-2\",\"RegionNum\":3,\"RegionCategory\":\"生物学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"MATHEMATICAL & COMPUTATIONAL BIOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Biodata Mining","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1186/s13040-025-00477-2","RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"MATHEMATICAL & COMPUTATIONAL BIOLOGY","Score":null,"Total":0}
The application of artificial intelligence models in predicting the risk of diabetic foot: a multicenter study.
This study explores diabetic foot (DF), a severe complication in diabetes, by combining deep learning (DL) and machine learning (ML) to develop a multi-model prediction tool. Early identification of high-risk DF patients can reduce disability and mortality. The research also aims to create an integrated application to assist clinicians in precise, efficient risk assessment for early intervention. In this multicenter retrospective study, 6,180 elderly diabetic patients (aged 60-85) were enrolled from 11 community hospitals in Shanghai in 2024. Lasso regression was used to identify 16 key DF risk factors, including age, MMSE score, lower limb discomfort, ABI, and hematocrit. Fourteen ML models (RF, XGBoost, CART, MLP, etc.) and three DL models (DNN, CNN, Transformer) were trained, with hyperparameters optimized via cross-validation and grid search. An application was developed integrating these models, offering both single and batch prediction options with visualization tools for clinical use.Experimental results showed the Logistic regression ensemble model achieved robust performance, with AUC values of 0.943 (validation set, 95% CI: 0.935-0.951) and 0.938 (test set, 95% CI: 0.929-0.947), along with high accuracy, precision, recall, and F1 scores. SHAP analysis revealed key predictive features including ABI results, lower limb discomfort, and MMSE score. The developed app integrates multiple models, compares their predictions for different clinical scenarios, and enhances prediction transparency and reliability.The multi-model approach demonstrates strong predictive performance for DF risk, offering clinicians an intuitive and accurate assessment tool tailored to individual patients. By combining multiple models, we enhance result stability and clinical applicability compared to single-model approaches. Future work will focus on algorithm optimization, expanded datasets, and real-time monitoring integration to enable more precise, dynamic risk evaluation for improved DF prevention and early intervention.
期刊介绍:
BioData Mining is an open access, open peer-reviewed journal encompassing research on all aspects of data mining applied to high-dimensional biological and biomedical data, focusing on computational aspects of knowledge discovery from large-scale genetic, transcriptomic, genomic, proteomic, and metabolomic data.
Topical areas include, but are not limited to:
-Development, evaluation, and application of novel data mining and machine learning algorithms.
-Adaptation, evaluation, and application of traditional data mining and machine learning algorithms.
-Open-source software for the application of data mining and machine learning algorithms.
-Design, development and integration of databases, software and web services for the storage, management, retrieval, and analysis of data from large scale studies.
-Pre-processing, post-processing, modeling, and interpretation of data mining and machine learning results for biological interpretation and knowledge discovery.