Predicting rapid kidney function decline in middle-aged and elderly Chinese adults using machine learning techniques.

IF 3.3 3区 医学 Q2 MEDICAL INFORMATICS
Yang Li, Kun Zou, Yixuan Wang, Yucheng Zhang, Jingtao Zhong, Wu Zhou, Fang Tang, Lu Peng, Xusheng Liu, Lili Deng
{"title":"Predicting rapid kidney function decline in middle-aged and elderly Chinese adults using machine learning techniques.","authors":"Yang Li, Kun Zou, Yixuan Wang, Yucheng Zhang, Jingtao Zhong, Wu Zhou, Fang Tang, Lu Peng, Xusheng Liu, Lili Deng","doi":"10.1186/s12911-025-03043-2","DOIUrl":null,"url":null,"abstract":"<p><p>The rapid decline of kidney function in middle-aged and elderly people has become an increasingly serious public health problem. Machine learning (ML) technology has substantial potential to disease prediction. The present study use dataset from the Chinese Health and Retirement Longitudinal Study (CHARLS) and utilizes advanced Gradient Boosting algorithms to develop predictive models. Least Absolute Shrinkage and Selection Operator (LASSO) regression was used to identify the key predictors, and multivariate logistic regression was utilized to validate the independent predictive power of the variables. Furthermore, the study integrated SHapley Additive exPlanations (SHAP) to boost the interpretability of the model. The findings show that the Gradient Boosting Model demonstrated robust performance across both the training and test datasets. Specifically, it attained AUC values of 0.8 and 0.765 in the training and test sets, respectively, while achieving accuracy scores of 0.736 and 0.728 in these two datasets. LASSO regression identified key influencing factors, including estimated glomerular filtration rate (eGFR), age, hemoglobin (Hb), glucose, and systolic blood pressure (SBP). Multivariate linear regression further confirmed the independent associations between these variables and rapid kidney function deterioration (P < 0.05). This study developed a risk assessment model for rapid kidney function deterioration that is applicable to middle-aged and elderly populations in China.</p>","PeriodicalId":9340,"journal":{"name":"BMC Medical Informatics and Decision Making","volume":"25 1","pages":"210"},"PeriodicalIF":3.3000,"publicationDate":"2025-06-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12144772/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"BMC Medical Informatics and Decision Making","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1186/s12911-025-03043-2","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"MEDICAL INFORMATICS","Score":null,"Total":0}
引用次数: 0

Abstract

The rapid decline of kidney function in middle-aged and elderly people has become an increasingly serious public health problem. Machine learning (ML) technology has substantial potential to disease prediction. The present study use dataset from the Chinese Health and Retirement Longitudinal Study (CHARLS) and utilizes advanced Gradient Boosting algorithms to develop predictive models. Least Absolute Shrinkage and Selection Operator (LASSO) regression was used to identify the key predictors, and multivariate logistic regression was utilized to validate the independent predictive power of the variables. Furthermore, the study integrated SHapley Additive exPlanations (SHAP) to boost the interpretability of the model. The findings show that the Gradient Boosting Model demonstrated robust performance across both the training and test datasets. Specifically, it attained AUC values of 0.8 and 0.765 in the training and test sets, respectively, while achieving accuracy scores of 0.736 and 0.728 in these two datasets. LASSO regression identified key influencing factors, including estimated glomerular filtration rate (eGFR), age, hemoglobin (Hb), glucose, and systolic blood pressure (SBP). Multivariate linear regression further confirmed the independent associations between these variables and rapid kidney function deterioration (P < 0.05). This study developed a risk assessment model for rapid kidney function deterioration that is applicable to middle-aged and elderly populations in China.

使用机器学习技术预测中国中老年成年人肾功能快速下降。
中老年人肾功能迅速下降已成为日益严重的公共卫生问题。机器学习(ML)技术在疾病预测方面具有巨大的潜力。本研究使用来自中国健康与退休纵向研究(CHARLS)的数据集,并利用先进的梯度增强算法建立预测模型。采用最小绝对收缩和选择算子(LASSO)回归识别关键预测因子,并采用多元逻辑回归验证变量的独立预测能力。此外,本研究还整合了SHapley加性解释(SHAP),以提高模型的可解释性。研究结果表明,梯度增强模型在训练和测试数据集上都表现出稳健的性能。其中,在训练集和测试集的AUC值分别为0.8和0.765,在这两个数据集的准确率得分分别为0.736和0.728。LASSO回归确定了关键的影响因素,包括估计的肾小球滤过率(eGFR)、年龄、血红蛋白(Hb)、葡萄糖和收缩压(SBP)。多元线性回归进一步证实了这些变量与肾功能快速恶化之间的独立关联(P
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
7.20
自引率
5.70%
发文量
297
审稿时长
1 months
期刊介绍: BMC Medical Informatics and Decision Making is an open access journal publishing original peer-reviewed research articles in relation to the design, development, implementation, use, and evaluation of health information technologies and decision-making for human health.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信