中国老年膝关节骨关节炎患者肌肉减少症的可解释机器学习预测模型

IF 3.4 3区 医学 Q2 GERIATRICS & GERONTOLOGY
Ziyan Wang, Yuqin Zhou, Xing Zeng, Yi Zhou, Tao Yang, Kongfa Hu
{"title":"中国老年膝关节骨关节炎患者肌肉减少症的可解释机器学习预测模型","authors":"Ziyan Wang,&nbsp;Yuqin Zhou,&nbsp;Xing Zeng,&nbsp;Yi Zhou,&nbsp;Tao Yang,&nbsp;Kongfa Hu","doi":"10.1007/s40520-025-02931-x","DOIUrl":null,"url":null,"abstract":"<div><h3>Background</h3><p>Sarcopenia is an age-related progressive skeletal muscle disease that leads to loss of muscle mass and function, resulting in adverse health outcomes such as falls, functional decline, and death. Knee osteoarthritis (KOA) is a common chronic degenerative joint disease among elderly individuals who causes joint pain and functional impairment. These two conditions often coexist in elderly individuals and are closely related. Early identification of the risk of sarcopenia in KOA patients is crucial for developing intervention strategies and improving patient health.</p><h3>Methods</h3><p>This study utilized data from the China Health and Retirement Longitudinal Study (CHARLS), selecting symptomatic KOA patients aged 65 years and above and analyzing a total of 95 variables. Predictive factors were screened via least absolute shrinkage and selection operator (LASSO) regression and logistic regression. Eight machine learning algorithms were employed to construct predictive models, with internal cross-validation and independent test validation performed. The final selected model was analyzed via the SHapley Additive exPlanations (SHAP) method to enhance interpretability and clinical applicability. To facilitate clinical use, we developed a web application based on this model (http://106.54.231.169/).</p><h3>Results</h3><p>The results indicate that six predictive factors—body mass index, upper arm length, marital status, total cholesterol, cystatin C, and shoulder pain—are closely associated with the risk of sarcopenia in KOA patients. CatBoost demonstrated excellent overall performance in both calibration analyses and probability estimates, reflecting accurate and dependable predictions. The final results on the independent test set (accuracy = 0.8902; F1 = 0.8627; AUC = 0.9697; Brier score = 0.0691) indicate that the model possesses strong predictive performance and excellent generalization ability, with predicted probabilities closely aligning with actual occurrence rates and thereby underscoring its reliability.</p><h3>Conclusion</h3><p>From the perspective of public health and aging, this study constructed an interpretable sarcopenia risk prediction model on the basis of routine clinical data. This model can be used for early screening and risk assessment of symptomatic KOA patients, assisting health departments and clinicians in the early detection and follow-up of relevant populations, thereby improving the quality of life and health outcomes of elderly individuals.</p></div>","PeriodicalId":7720,"journal":{"name":"Aging Clinical and Experimental Research","volume":"37 1","pages":""},"PeriodicalIF":3.4000,"publicationDate":"2025-03-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s40520-025-02931-x.pdf","citationCount":"0","resultStr":"{\"title\":\"An explainable machine learning-based prediction model for sarcopenia in elderly Chinese people with knee osteoarthritis\",\"authors\":\"Ziyan Wang,&nbsp;Yuqin Zhou,&nbsp;Xing Zeng,&nbsp;Yi Zhou,&nbsp;Tao Yang,&nbsp;Kongfa Hu\",\"doi\":\"10.1007/s40520-025-02931-x\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><h3>Background</h3><p>Sarcopenia is an age-related progressive skeletal muscle disease that leads to loss of muscle mass and function, resulting in adverse health outcomes such as falls, functional decline, and death. Knee osteoarthritis (KOA) is a common chronic degenerative joint disease among elderly individuals who causes joint pain and functional impairment. These two conditions often coexist in elderly individuals and are closely related. Early identification of the risk of sarcopenia in KOA patients is crucial for developing intervention strategies and improving patient health.</p><h3>Methods</h3><p>This study utilized data from the China Health and Retirement Longitudinal Study (CHARLS), selecting symptomatic KOA patients aged 65 years and above and analyzing a total of 95 variables. Predictive factors were screened via least absolute shrinkage and selection operator (LASSO) regression and logistic regression. Eight machine learning algorithms were employed to construct predictive models, with internal cross-validation and independent test validation performed. The final selected model was analyzed via the SHapley Additive exPlanations (SHAP) method to enhance interpretability and clinical applicability. To facilitate clinical use, we developed a web application based on this model (http://106.54.231.169/).</p><h3>Results</h3><p>The results indicate that six predictive factors—body mass index, upper arm length, marital status, total cholesterol, cystatin C, and shoulder pain—are closely associated with the risk of sarcopenia in KOA patients. CatBoost demonstrated excellent overall performance in both calibration analyses and probability estimates, reflecting accurate and dependable predictions. The final results on the independent test set (accuracy = 0.8902; F1 = 0.8627; AUC = 0.9697; Brier score = 0.0691) indicate that the model possesses strong predictive performance and excellent generalization ability, with predicted probabilities closely aligning with actual occurrence rates and thereby underscoring its reliability.</p><h3>Conclusion</h3><p>From the perspective of public health and aging, this study constructed an interpretable sarcopenia risk prediction model on the basis of routine clinical data. This model can be used for early screening and risk assessment of symptomatic KOA patients, assisting health departments and clinicians in the early detection and follow-up of relevant populations, thereby improving the quality of life and health outcomes of elderly individuals.</p></div>\",\"PeriodicalId\":7720,\"journal\":{\"name\":\"Aging Clinical and Experimental Research\",\"volume\":\"37 1\",\"pages\":\"\"},\"PeriodicalIF\":3.4000,\"publicationDate\":\"2025-03-07\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://link.springer.com/content/pdf/10.1007/s40520-025-02931-x.pdf\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Aging Clinical and Experimental Research\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://link.springer.com/article/10.1007/s40520-025-02931-x\",\"RegionNum\":3,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"GERIATRICS & GERONTOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Aging Clinical and Experimental Research","FirstCategoryId":"3","ListUrlMain":"https://link.springer.com/article/10.1007/s40520-025-02931-x","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"GERIATRICS & GERONTOLOGY","Score":null,"Total":0}
引用次数: 0

摘要

骨骼肌减少症是一种与年龄相关的进行性骨骼肌疾病,可导致肌肉质量和功能丧失,导致跌倒、功能下降和死亡等不良健康结果。膝关节骨性关节炎(KOA)是老年人常见的慢性退行性关节疾病,可引起关节疼痛和功能损害。这两种情况在老年人中经常共存,并且密切相关。早期识别KOA患者肌肉减少症的风险对于制定干预策略和改善患者健康至关重要。方法采用中国健康与退休纵向研究(CHARLS)数据,选取65岁及以上有症状的KOA患者,分析95个变量。通过最小绝对收缩和选择算子(LASSO)回归和逻辑回归筛选预测因素。采用8种机器学习算法构建预测模型,并进行内部交叉验证和独立测试验证。最终选择的模型通过SHapley加性解释(SHAP)方法进行分析,以提高可解释性和临床适用性。为了便于临床使用,我们基于该模型开发了一个web应用程序(http://106.54.231.169/)。结果体重指数、上臂长、婚姻状况、总胆固醇、胱抑素C、肩痛6个预测因素与KOA患者发生肌少症的风险密切相关。CatBoost在校准分析和概率估计方面均表现出出色的整体性能,反映了准确可靠的预测。独立测试集上的最终结果(准确率= 0.8902;f1 = 0.8627;auc = 0.9697;Brier评分= 0.0691)表明该模型具有较强的预测性能和较好的泛化能力,预测概率与实际发生概率基本一致,可靠性较强。结论本研究从公共卫生和老龄化的角度出发,在常规临床数据的基础上构建了可解释的肌少症风险预测模型。该模型可用于有症状的KOA患者的早期筛查和风险评估,协助卫生部门和临床医生对相关人群进行早期发现和随访,从而改善老年人的生活质量和健康结局。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
An explainable machine learning-based prediction model for sarcopenia in elderly Chinese people with knee osteoarthritis

Background

Sarcopenia is an age-related progressive skeletal muscle disease that leads to loss of muscle mass and function, resulting in adverse health outcomes such as falls, functional decline, and death. Knee osteoarthritis (KOA) is a common chronic degenerative joint disease among elderly individuals who causes joint pain and functional impairment. These two conditions often coexist in elderly individuals and are closely related. Early identification of the risk of sarcopenia in KOA patients is crucial for developing intervention strategies and improving patient health.

Methods

This study utilized data from the China Health and Retirement Longitudinal Study (CHARLS), selecting symptomatic KOA patients aged 65 years and above and analyzing a total of 95 variables. Predictive factors were screened via least absolute shrinkage and selection operator (LASSO) regression and logistic regression. Eight machine learning algorithms were employed to construct predictive models, with internal cross-validation and independent test validation performed. The final selected model was analyzed via the SHapley Additive exPlanations (SHAP) method to enhance interpretability and clinical applicability. To facilitate clinical use, we developed a web application based on this model (http://106.54.231.169/).

Results

The results indicate that six predictive factors—body mass index, upper arm length, marital status, total cholesterol, cystatin C, and shoulder pain—are closely associated with the risk of sarcopenia in KOA patients. CatBoost demonstrated excellent overall performance in both calibration analyses and probability estimates, reflecting accurate and dependable predictions. The final results on the independent test set (accuracy = 0.8902; F1 = 0.8627; AUC = 0.9697; Brier score = 0.0691) indicate that the model possesses strong predictive performance and excellent generalization ability, with predicted probabilities closely aligning with actual occurrence rates and thereby underscoring its reliability.

Conclusion

From the perspective of public health and aging, this study constructed an interpretable sarcopenia risk prediction model on the basis of routine clinical data. This model can be used for early screening and risk assessment of symptomatic KOA patients, assisting health departments and clinicians in the early detection and follow-up of relevant populations, thereby improving the quality of life and health outcomes of elderly individuals.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
CiteScore
7.90
自引率
5.00%
发文量
283
审稿时长
1 months
期刊介绍: Aging clinical and experimental research offers a multidisciplinary forum on the progressing field of gerontology and geriatrics. The areas covered by the journal include: biogerontology, neurosciences, epidemiology, clinical gerontology and geriatric assessment, social, economical and behavioral gerontology. “Aging clinical and experimental research” appears bimonthly and publishes review articles, original papers and case reports.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信