Ziyan Wang, Yuqin Zhou, Xing Zeng, Yi Zhou, Tao Yang, Kongfa Hu
{"title":"An explainable machine learning-based prediction model for sarcopenia in elderly Chinese people with knee osteoarthritis","authors":"Ziyan Wang, Yuqin Zhou, Xing Zeng, Yi Zhou, Tao Yang, Kongfa Hu","doi":"10.1007/s40520-025-02931-x","DOIUrl":null,"url":null,"abstract":"<div><h3>Background</h3><p>Sarcopenia is an age-related progressive skeletal muscle disease that leads to loss of muscle mass and function, resulting in adverse health outcomes such as falls, functional decline, and death. Knee osteoarthritis (KOA) is a common chronic degenerative joint disease among elderly individuals who causes joint pain and functional impairment. These two conditions often coexist in elderly individuals and are closely related. Early identification of the risk of sarcopenia in KOA patients is crucial for developing intervention strategies and improving patient health.</p><h3>Methods</h3><p>This study utilized data from the China Health and Retirement Longitudinal Study (CHARLS), selecting symptomatic KOA patients aged 65 years and above and analyzing a total of 95 variables. Predictive factors were screened via least absolute shrinkage and selection operator (LASSO) regression and logistic regression. Eight machine learning algorithms were employed to construct predictive models, with internal cross-validation and independent test validation performed. The final selected model was analyzed via the SHapley Additive exPlanations (SHAP) method to enhance interpretability and clinical applicability. To facilitate clinical use, we developed a web application based on this model (http://106.54.231.169/).</p><h3>Results</h3><p>The results indicate that six predictive factors—body mass index, upper arm length, marital status, total cholesterol, cystatin C, and shoulder pain—are closely associated with the risk of sarcopenia in KOA patients. CatBoost demonstrated excellent overall performance in both calibration analyses and probability estimates, reflecting accurate and dependable predictions. The final results on the independent test set (accuracy = 0.8902; F1 = 0.8627; AUC = 0.9697; Brier score = 0.0691) indicate that the model possesses strong predictive performance and excellent generalization ability, with predicted probabilities closely aligning with actual occurrence rates and thereby underscoring its reliability.</p><h3>Conclusion</h3><p>From the perspective of public health and aging, this study constructed an interpretable sarcopenia risk prediction model on the basis of routine clinical data. This model can be used for early screening and risk assessment of symptomatic KOA patients, assisting health departments and clinicians in the early detection and follow-up of relevant populations, thereby improving the quality of life and health outcomes of elderly individuals.</p></div>","PeriodicalId":7720,"journal":{"name":"Aging Clinical and Experimental Research","volume":"37 1","pages":""},"PeriodicalIF":3.4000,"publicationDate":"2025-03-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s40520-025-02931-x.pdf","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Aging Clinical and Experimental Research","FirstCategoryId":"3","ListUrlMain":"https://link.springer.com/article/10.1007/s40520-025-02931-x","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"GERIATRICS & GERONTOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Background
Sarcopenia is an age-related progressive skeletal muscle disease that leads to loss of muscle mass and function, resulting in adverse health outcomes such as falls, functional decline, and death. Knee osteoarthritis (KOA) is a common chronic degenerative joint disease among elderly individuals who causes joint pain and functional impairment. These two conditions often coexist in elderly individuals and are closely related. Early identification of the risk of sarcopenia in KOA patients is crucial for developing intervention strategies and improving patient health.
Methods
This study utilized data from the China Health and Retirement Longitudinal Study (CHARLS), selecting symptomatic KOA patients aged 65 years and above and analyzing a total of 95 variables. Predictive factors were screened via least absolute shrinkage and selection operator (LASSO) regression and logistic regression. Eight machine learning algorithms were employed to construct predictive models, with internal cross-validation and independent test validation performed. The final selected model was analyzed via the SHapley Additive exPlanations (SHAP) method to enhance interpretability and clinical applicability. To facilitate clinical use, we developed a web application based on this model (http://106.54.231.169/).
Results
The results indicate that six predictive factors—body mass index, upper arm length, marital status, total cholesterol, cystatin C, and shoulder pain—are closely associated with the risk of sarcopenia in KOA patients. CatBoost demonstrated excellent overall performance in both calibration analyses and probability estimates, reflecting accurate and dependable predictions. The final results on the independent test set (accuracy = 0.8902; F1 = 0.8627; AUC = 0.9697; Brier score = 0.0691) indicate that the model possesses strong predictive performance and excellent generalization ability, with predicted probabilities closely aligning with actual occurrence rates and thereby underscoring its reliability.
Conclusion
From the perspective of public health and aging, this study constructed an interpretable sarcopenia risk prediction model on the basis of routine clinical data. This model can be used for early screening and risk assessment of symptomatic KOA patients, assisting health departments and clinicians in the early detection and follow-up of relevant populations, thereby improving the quality of life and health outcomes of elderly individuals.
期刊介绍:
Aging clinical and experimental research offers a multidisciplinary forum on the progressing field of gerontology and geriatrics. The areas covered by the journal include: biogerontology, neurosciences, epidemiology, clinical gerontology and geriatric assessment, social, economical and behavioral gerontology. “Aging clinical and experimental research” appears bimonthly and publishes review articles, original papers and case reports.