Xing Yang , Jianyuan Liu , Xiaozhi Huang , Hao Liang , Ping Cui , Shiran He , Heng Zhang , Wenping Liao , Guangkun Zhang , Qianqian Huang , Huan Ning , Tingyan Luo , Yinghua Luo , Wei Li , Jiegang Huang
{"title":"Machine learning-driven clinical decision support for low bone mineral density: A web-based prediction model with explainable AI integration","authors":"Xing Yang , Jianyuan Liu , Xiaozhi Huang , Hao Liang , Ping Cui , Shiran He , Heng Zhang , Wenping Liao , Guangkun Zhang , Qianqian Huang , Huan Ning , Tingyan Luo , Yinghua Luo , Wei Li , Jiegang Huang","doi":"10.1016/j.bone.2025.117592","DOIUrl":null,"url":null,"abstract":"<div><h3>Background</h3><div>Low bone mineral density (LBMD), which includes osteopenia and osteoporosis, is associated with substantial health care costs. However, current diagnostic methods for LBMD are limited in terms of accuracy and accessibility. This study aims to develop an interpretable machine learning model for LBMD risk assessment and implement it as a web-based clinical decision support tool.</div></div><div><h3>Methods</h3><div>Data from subjects who underwent dual-energy X-ray absorptiometry (DXA) at the People's Hospital of Guangxi Zhuang Autonomous Region were collected and randomly divided into a training set (70 %) and an internal validation set (30 %). An external validation set was sourced from the National Health and Nutrition Examination Survey (NHANES) database. Least absolute shrinkage and selection operator (LASSO) regression and multiple logistic regression were used for feature selection. Ten common machine learning models were conducted based on the selected features. Model performance was assessed using the area under the receiver operating characteristic curve (AUC), Matthews correlation coefficient (MCC), Brier score, and decision curve analysis (DCA). The decision mechanisms of the best-performing model were explained using SHapley Additive exPlanations (SHAP) and Local Interpretable Model-agnostic Explanations (LIME). The optimal model was deployed as a web application using Streamlit.</div></div><div><h3>Results</h3><div>A total of 16,274 participants were included in this study. Age, body mass index (BMI), alkaline phosphatase, and total cholesterol were identified as key predictors of LBMD. The logistic regression (LR) model demonstrated superior prediction performance (internal validation set [AUC = 0.902, MCC = 0.684, Brier score = 0.123], external validation set [0.812, 0.358, 0.265]). DCA confirmed its clinical utility. Both SHAP and LIME showed consistent results in identifying predictive factors. The LR model was deployed as a web application to predict LBMD.</div></div><div><h3>Conclusion</h3><div>Our interpretable machine learning model and web-based implementation provide a free and reliable tool for predicting LBMD, which represents a significant advancement in making LBMD screening more accessible and cost-effective.</div></div>","PeriodicalId":9301,"journal":{"name":"Bone","volume":"200 ","pages":"Article 117592"},"PeriodicalIF":3.6000,"publicationDate":"2025-07-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Bone","FirstCategoryId":"3","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S8756328225002042","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENDOCRINOLOGY & METABOLISM","Score":null,"Total":0}
引用次数: 0
Abstract
Background
Low bone mineral density (LBMD), which includes osteopenia and osteoporosis, is associated with substantial health care costs. However, current diagnostic methods for LBMD are limited in terms of accuracy and accessibility. This study aims to develop an interpretable machine learning model for LBMD risk assessment and implement it as a web-based clinical decision support tool.
Methods
Data from subjects who underwent dual-energy X-ray absorptiometry (DXA) at the People's Hospital of Guangxi Zhuang Autonomous Region were collected and randomly divided into a training set (70 %) and an internal validation set (30 %). An external validation set was sourced from the National Health and Nutrition Examination Survey (NHANES) database. Least absolute shrinkage and selection operator (LASSO) regression and multiple logistic regression were used for feature selection. Ten common machine learning models were conducted based on the selected features. Model performance was assessed using the area under the receiver operating characteristic curve (AUC), Matthews correlation coefficient (MCC), Brier score, and decision curve analysis (DCA). The decision mechanisms of the best-performing model were explained using SHapley Additive exPlanations (SHAP) and Local Interpretable Model-agnostic Explanations (LIME). The optimal model was deployed as a web application using Streamlit.
Results
A total of 16,274 participants were included in this study. Age, body mass index (BMI), alkaline phosphatase, and total cholesterol were identified as key predictors of LBMD. The logistic regression (LR) model demonstrated superior prediction performance (internal validation set [AUC = 0.902, MCC = 0.684, Brier score = 0.123], external validation set [0.812, 0.358, 0.265]). DCA confirmed its clinical utility. Both SHAP and LIME showed consistent results in identifying predictive factors. The LR model was deployed as a web application to predict LBMD.
Conclusion
Our interpretable machine learning model and web-based implementation provide a free and reliable tool for predicting LBMD, which represents a significant advancement in making LBMD screening more accessible and cost-effective.
期刊介绍:
BONE is an interdisciplinary forum for the rapid publication of original articles and reviews on basic, translational, and clinical aspects of bone and mineral metabolism. The Journal also encourages submissions related to interactions of bone with other organ systems, including cartilage, endocrine, muscle, fat, neural, vascular, gastrointestinal, hematopoietic, and immune systems. Particular attention is placed on the application of experimental studies to clinical practice.