[Prediction of depression symptoms in seniors and analysis of influencing factors based on explainable machine learning].

Q1 Medicine
W Y Su, S H Dong, H J Ge, Q Yu, G F Ma
{"title":"[Prediction of depression symptoms in seniors and analysis of influencing factors based on explainable machine learning].","authors":"W Y Su, S H Dong, H J Ge, Q Yu, G F Ma","doi":"10.3760/cma.j.cn112338-20240809-00488","DOIUrl":null,"url":null,"abstract":"<p><p><b>Objective:</b> This study aims to construct a machine learning model to predict depression symptoms in the elderly and analyze the key influencing factors of depression in the elderly using the shapley additive interpretation (SHAP) method. <b>Methods:</b> Based on entries from the 2018 China Health and Retirement Longitudinal Study database, a sample of 5 954 elderly individuals was selected. Feature selection using Support Vector Machine Recursive Feature Elimination, Extreme Gradient Boosting (XGBoost) - Recursive Feature Elimination (RFE), and the Lasso algorithm, which was combined with five classifiers-logistic regression, decision trees, random forests, support vector machines, and XGBoost-to explore the classification effectiveness for depressive symptoms in the elderly. Finally, the SHAP method was used to interpret the analysis of the model with the highest receiver operating characteristic curve areas under the curve (AUC). <b>Results:</b> The accuracy of 15 prediction models ranged from 0.702 to 0.743, with AUC between 0.730 and 0.795. Sensitivity was reported at 0.546 to 0.588, while specificity ranges from 0.783 to 0.865. The model XGBoost-RFE-XGBoost presented the highest AUC. Based on SHAP values, the top four factors influencing depressive symptoms in older adults were life satisfaction, duration of nighttime sleep, disability status, and self-rated health. <b>Conclusion:</b> This study developed a highly efficient and interpretable risk prediction model for depressive symptoms in older adults, which could help identify high-risk older adults and give personalized interventions.</p>","PeriodicalId":23968,"journal":{"name":"中华流行病学杂志","volume":"46 2","pages":"316-324"},"PeriodicalIF":0.0000,"publicationDate":"2025-02-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"中华流行病学杂志","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.3760/cma.j.cn112338-20240809-00488","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"Medicine","Score":null,"Total":0}
引用次数: 0

Abstract

Objective: This study aims to construct a machine learning model to predict depression symptoms in the elderly and analyze the key influencing factors of depression in the elderly using the shapley additive interpretation (SHAP) method. Methods: Based on entries from the 2018 China Health and Retirement Longitudinal Study database, a sample of 5 954 elderly individuals was selected. Feature selection using Support Vector Machine Recursive Feature Elimination, Extreme Gradient Boosting (XGBoost) - Recursive Feature Elimination (RFE), and the Lasso algorithm, which was combined with five classifiers-logistic regression, decision trees, random forests, support vector machines, and XGBoost-to explore the classification effectiveness for depressive symptoms in the elderly. Finally, the SHAP method was used to interpret the analysis of the model with the highest receiver operating characteristic curve areas under the curve (AUC). Results: The accuracy of 15 prediction models ranged from 0.702 to 0.743, with AUC between 0.730 and 0.795. Sensitivity was reported at 0.546 to 0.588, while specificity ranges from 0.783 to 0.865. The model XGBoost-RFE-XGBoost presented the highest AUC. Based on SHAP values, the top four factors influencing depressive symptoms in older adults were life satisfaction, duration of nighttime sleep, disability status, and self-rated health. Conclusion: This study developed a highly efficient and interpretable risk prediction model for depressive symptoms in older adults, which could help identify high-risk older adults and give personalized interventions.

[基于可解释机器学习的老年人抑郁症状预测及影响因素分析]。
目的:构建预测老年人抑郁症状的机器学习模型,运用shapley加性解释(SHAP)方法分析老年人抑郁的关键影响因素。方法:基于2018年中国健康与退休纵向研究数据库的条目,选择5 954名老年人作为样本。采用支持向量机递归特征消除、极端梯度增强(XGBoost) -递归特征消除(RFE)和Lasso算法结合逻辑回归、决策树、随机森林、支持向量机和XGBoost五种分类器进行特征选择,探讨老年人抑郁症状的分类效果。最后,采用SHAP方法对具有最高受者工作特征曲线下面积(AUC)的模型进行解释分析。结果:15个预测模型的预测精度在0.702 ~ 0.743之间,AUC在0.730 ~ 0.795之间。敏感度为0.546 ~ 0.588,特异度为0.783 ~ 0.865。模型XGBoost-RFE-XGBoost的AUC最高。根据SHAP值,影响老年人抑郁症状的前四个因素是生活满意度、夜间睡眠时间、残疾状况和自评健康状况。结论:本研究建立了一种高效、可解释的老年人抑郁症状风险预测模型,有助于识别高危老年人并给予个性化干预。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
中华流行病学杂志
中华流行病学杂志 Medicine-Medicine (all)
CiteScore
5.60
自引率
0.00%
发文量
8981
期刊介绍: Chinese Journal of Epidemiology, established in 1981, is an advanced academic periodical in epidemiology and related disciplines in China, which, according to the principle of integrating theory with practice, mainly reports the major progress in epidemiological research. The columns of the journal include commentary, expert forum, original article, field investigation, disease surveillance, laboratory research, clinical epidemiology, basic theory or method and review, etc.  The journal is included by more than ten major biomedical databases and index systems worldwide, such as been indexed in Scopus, PubMed/MEDLINE, PubMed Central (PMC), Europe PubMed Central, Embase, Chemical Abstract, Chinese Science and Technology Paper and Citation Database (CSTPCD), Chinese core journal essentials overview, Chinese Science Citation Database (CSCD) core database, Chinese Biological Medical Disc (CBMdisc), and Chinese Medical Citation Index (CMCI), etc. It is one of the core academic journals and carefully selected core journals in preventive and basic medicine in China.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信