{"title":"Development of a Longitudinal Model for Disability Prediction in Older Adults in China: Analysis of CHARLS Data (2015-2020).","authors":"Jingjing Chu, Ying Li, Xinyi Wang, Qun Xu, Zherong Xu","doi":"10.2196/66723","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Disability profoundly affects older adults' quality of life and imposes considerable burdens on health care systems in China's aging society. Timely predictive models are essential for early intervention.</p><p><strong>Objective: </strong>We aimed to build effective predictive models of disability for early intervention and management in older adults in China, integrating physical, cognitive, physiological, and psychological factors.</p><p><strong>Methods: </strong>Data from the China Health and Retirement Longitudinal Study (CHARLS), spanning from 2015 to 2020 and involving 2450 older individuals initially in good health, were analyzed. The dataset was randomly divided into a training set with 70% data and a testing set with 30% data. LASSO regression with 10-fold cross-validation identified key predictors, which were then used to develop an Extreme Gradient Boosting (XGBoost) model. Model performance was evaluated using receiever operating characteristic curves, calibration curves, and clinical decision and impact curves. Variable contributions were interpreted using SHapley Additive exPlanations (SHAP) values.</p><p><strong>Results: </strong>LASSO regression was used to evaluate 36 potential predictors, resulting in a model incorporating 9 key variables: age, hand grip strength, standing balance, the 5-repetition chair stand test (CS-5), pain, depression, cognition, respiratory function, and comorbidities. The XGBoost model demonstrated an area under the curve of 0.846 (95% CI 0.825-0.866) for the training set and 0.698 (95% CI 0.654-0.743) for the testing set. Calibration curves demonstrated reliable predictive accuracy, with mean absolute errors of 0.001 and 0.011 for the training and testing sets, respectively. Clinical decision and impact curves demonstrated significant utility across risk thresholds. SHAP analysis identified pain, respiratory function, and age as top predictors, highlighting their substantial roles in disability risk. Hand grip and the CS-5 also significantly influenced the model. A web-based application was developed for personalized risk assessment and decision-making.</p><p><strong>Conclusions: </strong>A reliable predictive model for 5-year disability risk in Chinese older adults was developed and validated. This model enables the identification of high-risk individuals, supports early interventions, and optimizes resource allocation. Future efforts will focus on updating the model with new CHARLS data and validating it with external datasets.</p>","PeriodicalId":36245,"journal":{"name":"JMIR Aging","volume":"8 ","pages":"e66723"},"PeriodicalIF":5.0000,"publicationDate":"2025-04-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12021300/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"JMIR Aging","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.2196/66723","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"GERIATRICS & GERONTOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Background: Disability profoundly affects older adults' quality of life and imposes considerable burdens on health care systems in China's aging society. Timely predictive models are essential for early intervention.
Objective: We aimed to build effective predictive models of disability for early intervention and management in older adults in China, integrating physical, cognitive, physiological, and psychological factors.
Methods: Data from the China Health and Retirement Longitudinal Study (CHARLS), spanning from 2015 to 2020 and involving 2450 older individuals initially in good health, were analyzed. The dataset was randomly divided into a training set with 70% data and a testing set with 30% data. LASSO regression with 10-fold cross-validation identified key predictors, which were then used to develop an Extreme Gradient Boosting (XGBoost) model. Model performance was evaluated using receiever operating characteristic curves, calibration curves, and clinical decision and impact curves. Variable contributions were interpreted using SHapley Additive exPlanations (SHAP) values.
Results: LASSO regression was used to evaluate 36 potential predictors, resulting in a model incorporating 9 key variables: age, hand grip strength, standing balance, the 5-repetition chair stand test (CS-5), pain, depression, cognition, respiratory function, and comorbidities. The XGBoost model demonstrated an area under the curve of 0.846 (95% CI 0.825-0.866) for the training set and 0.698 (95% CI 0.654-0.743) for the testing set. Calibration curves demonstrated reliable predictive accuracy, with mean absolute errors of 0.001 and 0.011 for the training and testing sets, respectively. Clinical decision and impact curves demonstrated significant utility across risk thresholds. SHAP analysis identified pain, respiratory function, and age as top predictors, highlighting their substantial roles in disability risk. Hand grip and the CS-5 also significantly influenced the model. A web-based application was developed for personalized risk assessment and decision-making.
Conclusions: A reliable predictive model for 5-year disability risk in Chinese older adults was developed and validated. This model enables the identification of high-risk individuals, supports early interventions, and optimizes resource allocation. Future efforts will focus on updating the model with new CHARLS data and validating it with external datasets.
背景:在中国老龄化社会中,残疾严重影响老年人的生活质量,并给医疗保健系统带来相当大的负担。及时的预测模型对于早期干预至关重要。目的:综合身体、认知、生理和心理等因素,建立有效的残疾预测模型,为中国老年人早期干预和管理提供依据。方法:分析2015年至2020年中国健康与退休纵向研究(CHARLS)的数据,涉及2450名最初健康状况良好的老年人。数据集随机分为包含70%数据的训练集和包含30%数据的测试集。LASSO回归与10倍交叉验证确定了关键预测因子,然后用于开发极端梯度提升(XGBoost)模型。使用受试者工作特征曲线、校准曲线、临床决策和影响曲线评估模型的性能。变量贡献使用SHapley加性解释(SHAP)值进行解释。结果:LASSO回归评估了36个潜在的预测因素,建立了一个包含9个关键变量的模型:年龄、手握力量、站立平衡、5次重复椅子站立测试(CS-5)、疼痛、抑郁、认知、呼吸功能和合并症。XGBoost模型显示,训练集曲线下面积为0.846 (95% CI 0.825-0.866),测试集曲线下面积为0.698 (95% CI 0.654-0.743)。校准曲线显示出可靠的预测精度,训练集和测试集的平均绝对误差分别为0.001和0.011。临床决策和影响曲线显示了跨风险阈值的显著效用。SHAP分析发现疼痛、呼吸功能和年龄是最重要的预测因素,突出了它们在残疾风险中的重要作用。握力和CS-5对模型也有显著影响。开发了一个基于网络的应用程序,用于个性化风险评估和决策。结论:建立并验证了中国老年人5年残疾风险的可靠预测模型。该模型能够识别高风险个体,支持早期干预,并优化资源分配。未来的工作将集中在用新的CHARLS数据更新模型,并用外部数据集验证模型。