Applying stacking ensemble method to predict chronic kidney disease progression in Chinese population based on laboratory information system: a retrospective study.

IF 4.6 Q2 MATERIALS SCIENCE, BIOMATERIALS
ACS Applied Bio Materials Pub Date : 2024-11-01 eCollection Date: 2024-01-01 DOI:10.7717/peerj.18436
Jialin Du, Jie Gao, Jie Guan, Bo Jin, Nan Duan, Lu Pang, Haiming Huang, Qian Ma, Chenwei Huang, Haixia Li
{"title":"Applying stacking ensemble method to predict chronic kidney disease progression in Chinese population based on laboratory information system: a retrospective study.","authors":"Jialin Du, Jie Gao, Jie Guan, Bo Jin, Nan Duan, Lu Pang, Haiming Huang, Qian Ma, Chenwei Huang, Haixia Li","doi":"10.7717/peerj.18436","DOIUrl":null,"url":null,"abstract":"<p><strong>Background and objective: </strong>Chronic kidney disease (CKD) is a major public health issue, and accurate prediction of the progression of kidney failure is critical for clinical decision-making and helps improve patient outcomes. As such, we aimed to develop and externally validate a machine-learned model to predict the progression of CKD using common laboratory variables, demographic characteristics, and an electronic health records database.</p><p><strong>Methods: </strong>We developed a predictive model using longitudinal clinical data from a single center for Chinese CKD patients. The cohort included 987 patients who were followed up for more than 24 months. Fifty-three laboratory features were considered for inclusion in the model. The primary outcome in our study was an estimated glomerular filtration rate ≤15 mL/min/1.73 m<sup>2</sup> or kidney failure. Machine learning algorithms were applied to the modeling dataset (<i>n</i> = 296), and an external dataset (<i>n</i> = 71) was used for model validation. We assessed model discrimination <i>via</i> area under the curve (AUC) values, accuracy, sensitivity, specificity, positive predictive value, negative predictive value, and F1 score.</p><p><strong>Results: </strong>Over a median follow-up period of 3.75 years, 148 patients experienced kidney failure. The optimal model was based on stacking different classifier algorithms with six laboratory features, including 24-h urine protein, potassium, glucose, urea, prealbumin and total protein. The model had considerable predictive power, with AUC values of 0.896 and 0.771 in the validation and external datasets, respectively. This model also accurately predicted the progression of renal function in patients over different follow-up periods after their initial assessment.</p><p><strong>Conclusions: </strong>A prediction model that leverages routinely collected laboratory features in the Chinese population can accurately identify patients with CKD at high risk of progressing to kidney failure. An online version of the model can be easily and quickly applied in clinical management and treatment.</p>","PeriodicalId":2,"journal":{"name":"ACS Applied Bio Materials","volume":null,"pages":null},"PeriodicalIF":4.6000,"publicationDate":"2024-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11533905/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"ACS Applied Bio Materials","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.7717/peerj.18436","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/1/1 0:00:00","PubModel":"eCollection","JCR":"Q2","JCRName":"MATERIALS SCIENCE, BIOMATERIALS","Score":null,"Total":0}
引用次数: 0

Abstract

Background and objective: Chronic kidney disease (CKD) is a major public health issue, and accurate prediction of the progression of kidney failure is critical for clinical decision-making and helps improve patient outcomes. As such, we aimed to develop and externally validate a machine-learned model to predict the progression of CKD using common laboratory variables, demographic characteristics, and an electronic health records database.

Methods: We developed a predictive model using longitudinal clinical data from a single center for Chinese CKD patients. The cohort included 987 patients who were followed up for more than 24 months. Fifty-three laboratory features were considered for inclusion in the model. The primary outcome in our study was an estimated glomerular filtration rate ≤15 mL/min/1.73 m2 or kidney failure. Machine learning algorithms were applied to the modeling dataset (n = 296), and an external dataset (n = 71) was used for model validation. We assessed model discrimination via area under the curve (AUC) values, accuracy, sensitivity, specificity, positive predictive value, negative predictive value, and F1 score.

Results: Over a median follow-up period of 3.75 years, 148 patients experienced kidney failure. The optimal model was based on stacking different classifier algorithms with six laboratory features, including 24-h urine protein, potassium, glucose, urea, prealbumin and total protein. The model had considerable predictive power, with AUC values of 0.896 and 0.771 in the validation and external datasets, respectively. This model also accurately predicted the progression of renal function in patients over different follow-up periods after their initial assessment.

Conclusions: A prediction model that leverages routinely collected laboratory features in the Chinese population can accurately identify patients with CKD at high risk of progressing to kidney failure. An online version of the model can be easily and quickly applied in clinical management and treatment.

基于实验室信息系统的堆叠集合法预测中国人群慢性肾病进展:一项回顾性研究。
背景和目的:慢性肾脏病(CKD)是一个重大的公共卫生问题,准确预测肾衰竭的进展对临床决策至关重要,有助于改善患者的预后。因此,我们旨在利用常见的实验室变量、人口统计学特征和电子健康记录数据库,开发并从外部验证一个机器学习模型来预测 CKD 的进展:方法:我们利用来自一个中心的纵向临床数据,为中国的 CKD 患者开发了一个预测模型。研究对象包括随访超过 24 个月的 987 名患者。模型中考虑了 53 项实验室特征。我们研究的主要结果是估计肾小球滤过率≤15 mL/min/1.73 m2或肾衰竭。建模数据集(n = 296)采用机器学习算法,模型验证采用外部数据集(n = 71)。我们通过曲线下面积(AUC)值、准确性、灵敏度、特异性、阳性预测值、阴性预测值和 F1 分数来评估模型的区分度:中位随访期为 3.75 年,148 名患者出现肾衰竭。最佳模型基于不同分类器算法与六种实验室特征的叠加,包括 24 小时尿蛋白、钾、葡萄糖、尿素、前白蛋白和总蛋白。该模型具有相当高的预测能力,在验证数据集和外部数据集中的AUC值分别为0.896和0.771。该模型还能准确预测患者在初次评估后不同随访期内肾功能的进展情况:结论:利用在中国人群中常规收集的实验室特征建立的预测模型可以准确地识别出有进展到肾衰竭高风险的慢性肾功能衰竭患者。该模型的在线版本可方便快捷地应用于临床管理和治疗。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
ACS Applied Bio Materials
ACS Applied Bio Materials Chemistry-Chemistry (all)
CiteScore
9.40
自引率
2.10%
发文量
464
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信