Establishment and comparison of prediction models for early-stage diabetic kidney disease.

IF 2.9 3区医学 Q2 HEALTH CARE SCIENCES & SERVICES

DIGITAL HEALTH Pub Date : 2025-06-27 eCollection Date: 2025-01-01 DOI:10.1177/20552076251355448

Yingda Sheng, Jianguo Cheng, Caimei Zhang, Feifei Ma, Qian Xiao, Dan Wang, Jianwen Zhang, Xiaoqin Ha

{"title":"Establishment and comparison of prediction models for early-stage diabetic kidney disease.","authors":"Yingda Sheng, Jianguo Cheng, Caimei Zhang, Feifei Ma, Qian Xiao, Dan Wang, Jianwen Zhang, Xiaoqin Ha","doi":"10.1177/20552076251355448","DOIUrl":null,"url":null,"abstract":"Background: The construction of a model to estimate patients' status in early-stage diabetic kidney disease (ES-DKD) is needed. Thus, the risk factors playing a role in the disease diagnosis can be determined when routine examination outcomes are collected.Objective: Routine examination outcomes can also be used to predict patients' ES-DKD. A first-stage study is conducted on how successful conventional statistical models (CSMs) perform when sample sizes are small when compared to machine learning methods (MLMs).Methods: A total of 268 observations were collected from two tertiary hospitals in Lanzhou with demographic information, basic medical history, and routine laboratory tests such as blood routine, common biochemical tests, and urine routine. Then, conventional statistical methods and MLMs are applied to establish models separately to determine optimal prediction models. In addition, machine learning has also been applied to establish fused models to explore new modeling methods.Results: The validation set can better represent the actual performance of the models in clinical practice. Therefore, the comparisons are made based on the predictive performance of the two methods using the validation set. Ultimately, it was concluded that the ensemble model outperforms in terms of performance metrics. The CSMs perform poorly in terms of area under curve values. Compared to various MLMs, the performance of others is not inferior.Conclusion: This article establishes multiple ES-DKD prediction models using CSMs and MLMs. New ideas and methods for the diagnosis, treatment, and prevention of ES-DKD in clinical practice are presented. This article also compares two modeling methods. A comprehensive model was established, which has excellent predictive and generalization ability and stability. Therefore, the integration of the advantages of MLMs based on CSMs is a very fruitful attempt. Fused models have a high chance of being the main research direction for future research to develop better models.","PeriodicalId":51333,"journal":{"name":"DIGITAL HEALTH","volume":"11 ","pages":"20552076251355448"},"PeriodicalIF":2.9000,"publicationDate":"2025-06-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12205200/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"DIGITAL HEALTH","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1177/20552076251355448","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/1/1 0:00:00","PubModel":"eCollection","JCR":"Q2","JCRName":"HEALTH CARE SCIENCES & SERVICES","Score":null,"Total":0}

引用次数: 0

Abstract

Background: The construction of a model to estimate patients' status in early-stage diabetic kidney disease (ES-DKD) is needed. Thus, the risk factors playing a role in the disease diagnosis can be determined when routine examination outcomes are collected.

Objective: Routine examination outcomes can also be used to predict patients' ES-DKD. A first-stage study is conducted on how successful conventional statistical models (CSMs) perform when sample sizes are small when compared to machine learning methods (MLMs).

Methods: A total of 268 observations were collected from two tertiary hospitals in Lanzhou with demographic information, basic medical history, and routine laboratory tests such as blood routine, common biochemical tests, and urine routine. Then, conventional statistical methods and MLMs are applied to establish models separately to determine optimal prediction models. In addition, machine learning has also been applied to establish fused models to explore new modeling methods.

Results: The validation set can better represent the actual performance of the models in clinical practice. Therefore, the comparisons are made based on the predictive performance of the two methods using the validation set. Ultimately, it was concluded that the ensemble model outperforms in terms of performance metrics. The CSMs perform poorly in terms of area under curve values. Compared to various MLMs, the performance of others is not inferior.

Conclusion: This article establishes multiple ES-DKD prediction models using CSMs and MLMs. New ideas and methods for the diagnosis, treatment, and prevention of ES-DKD in clinical practice are presented. This article also compares two modeling methods. A comprehensive model was established, which has excellent predictive and generalization ability and stability. Therefore, the integration of the advantages of MLMs based on CSMs is a very fruitful attempt. Fused models have a high chance of being the main research direction for future research to develop better models.

查看原文本刊更多论文

早期糖尿病肾病预测模型的建立与比较。

背景：需要建立早期糖尿病肾病（ES-DKD）患者状态评估模型。因此，当收集常规检查结果时，可以确定在疾病诊断中起作用的危险因素。目的：常规检查结果也可用于预测患者ES-DKD。第一阶段的研究是关于与机器学习方法（mlm）相比，当样本量较小时，传统统计模型（csm）的成功表现。方法：收集兰州市两所三级医院人口统计资料、基本病史、血常规、常见生化、尿常规等实验室检查结果268例。然后分别运用传统统计方法和传销模型建立模型，确定最优预测模型。此外，机器学习也被应用于建立融合模型，探索新的建模方法。结果：验证集能较好地代表模型在临床实践中的实际性能。因此，使用验证集对两种方法的预测性能进行比较。最后得出的结论是，集成模型在性能度量方面优于集成模型。csm在曲线值下的面积方面表现不佳。与各种传销相比，其他传销的业绩也不逊色。结论：本文建立了基于csm和MLMs的ES-DKD预测模型。为ES-DKD的诊断、治疗和预防提供了新的思路和方法。本文还比较了两种建模方法。建立的综合模型具有良好的预测、泛化能力和稳定性。因此，在csm的基础上整合传销的优势是一个非常有成效的尝试。融合模型很有可能成为未来研究的主要研究方向，以开发出更好的模型。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

DIGITAL HEALTH Multiple-

CiteScore

2.90

自引率

7.70%

发文量

302