超学习者方法在医院T2DM初诊时预测糖尿病肾病的应用

IF 3.3 3区 医学 Q2 MEDICAL INFORMATICS
Xiaomeng Lin, Chao Liu, Huaiyu Wang, Xiaohui Fan, Linfeng Li, Jiming Xu, Changlin Li, Yao Wang, Xudong Cai, Xin Peng
{"title":"超学习者方法在医院T2DM初诊时预测糖尿病肾病的应用","authors":"Xiaomeng Lin, Chao Liu, Huaiyu Wang, Xiaohui Fan, Linfeng Li, Jiming Xu, Changlin Li, Yao Wang, Xudong Cai, Xin Peng","doi":"10.1186/s12911-025-02977-x","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Diabetic kidney disease (DKD) is a serious complication of diabetes mellitus (DM), with patients typically remaining asymptomatic until reaching an advanced stage. We aimed to develop and validate a predictive model for DKD in patients with an initial diagnosis of type 2 diabetes mellitus (T2DM) using real-world data.</p><p><strong>Methods: </strong>We retrospectively examined data from 3,291 patients (1740 men, 1551 women) newly diagnosed with T2DM at Ningbo Municipal Hospital of Traditional Chinese Medicine (2011-2023). The dataset was randomly divided into training and validation cohorts. Forty-six readily available medical characteristics at initial diagnosis of T2DM from the electronic medical records were used to develop prediction models based on linear, non-linear, and SuperLearner approaches. Model performance was evaluated using the area under the curve (AUC). SHapley Additive exPlanation (SHAP) was used to interpret the best-performing models.</p><p><strong>Results: </strong>Among 3291 participants, 563 (17.1%) were diagnosed with DKD during median follow-up of 2.53 years. The SuperLearner model exhibited the highest AUC (0.7138, 95% confidence interval: [0.673, 0.7546]) for the holdout internal validation set in predicting any DKD stage. Top-ranked features were WBC_Cnt*, Neut_Cnt, Hct, and Hb. High WBC_Cnt, low Neut_Cnt, high Hct, and low Hb levels were associated with an increased risk of DKD.</p><p><strong>Conclusions: </strong>We developed and validated a DKD risk prediction model for patients with newly diagnosed T2DM. Using routinely available clinical measurements, the SuperLearner model could predict DKD during hospital visits. Prediction accuracy and SHAP-based model interpretability may help improve early detection, targeted interventions, and prognosis of patients with DM.</p>","PeriodicalId":9340,"journal":{"name":"BMC Medical Informatics and Decision Making","volume":"25 1","pages":"148"},"PeriodicalIF":3.3000,"publicationDate":"2025-03-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11948915/pdf/","citationCount":"0","resultStr":"{\"title\":\"A SuperLearner approach for predicting diabetic kidney disease upon the initial diagnosis of T2DM in hospital.\",\"authors\":\"Xiaomeng Lin, Chao Liu, Huaiyu Wang, Xiaohui Fan, Linfeng Li, Jiming Xu, Changlin Li, Yao Wang, Xudong Cai, Xin Peng\",\"doi\":\"10.1186/s12911-025-02977-x\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Background: </strong>Diabetic kidney disease (DKD) is a serious complication of diabetes mellitus (DM), with patients typically remaining asymptomatic until reaching an advanced stage. We aimed to develop and validate a predictive model for DKD in patients with an initial diagnosis of type 2 diabetes mellitus (T2DM) using real-world data.</p><p><strong>Methods: </strong>We retrospectively examined data from 3,291 patients (1740 men, 1551 women) newly diagnosed with T2DM at Ningbo Municipal Hospital of Traditional Chinese Medicine (2011-2023). The dataset was randomly divided into training and validation cohorts. Forty-six readily available medical characteristics at initial diagnosis of T2DM from the electronic medical records were used to develop prediction models based on linear, non-linear, and SuperLearner approaches. Model performance was evaluated using the area under the curve (AUC). SHapley Additive exPlanation (SHAP) was used to interpret the best-performing models.</p><p><strong>Results: </strong>Among 3291 participants, 563 (17.1%) were diagnosed with DKD during median follow-up of 2.53 years. The SuperLearner model exhibited the highest AUC (0.7138, 95% confidence interval: [0.673, 0.7546]) for the holdout internal validation set in predicting any DKD stage. Top-ranked features were WBC_Cnt*, Neut_Cnt, Hct, and Hb. High WBC_Cnt, low Neut_Cnt, high Hct, and low Hb levels were associated with an increased risk of DKD.</p><p><strong>Conclusions: </strong>We developed and validated a DKD risk prediction model for patients with newly diagnosed T2DM. Using routinely available clinical measurements, the SuperLearner model could predict DKD during hospital visits. Prediction accuracy and SHAP-based model interpretability may help improve early detection, targeted interventions, and prognosis of patients with DM.</p>\",\"PeriodicalId\":9340,\"journal\":{\"name\":\"BMC Medical Informatics and Decision Making\",\"volume\":\"25 1\",\"pages\":\"148\"},\"PeriodicalIF\":3.3000,\"publicationDate\":\"2025-03-26\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11948915/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"BMC Medical Informatics and Decision Making\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.1186/s12911-025-02977-x\",\"RegionNum\":3,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"MEDICAL INFORMATICS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"BMC Medical Informatics and Decision Making","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1186/s12911-025-02977-x","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"MEDICAL INFORMATICS","Score":null,"Total":0}
引用次数: 0

摘要

背景:糖尿病肾病(DKD)是糖尿病(DM)的一种严重并发症,患者通常在进入晚期之前没有症状。我们的目的是利用真实世界的数据,开发并验证初始诊断为2型糖尿病(T2DM)患者的DKD预测模型。方法:回顾性分析宁波市中医医院2011-2023年新诊断为T2DM的3291例患者(男性1740例,女性1551例)的资料。数据集随机分为训练组和验证组。利用电子病历中46个T2DM初始诊断时可获得的医学特征,建立基于线性、非线性和超级学习者方法的预测模型。使用曲线下面积(AUC)评估模型性能。使用SHapley加性解释(SHAP)来解释表现最好的模型。结果:在3291名参与者中,563名(17.1%)在中位随访2.53年期间被诊断为DKD。在预测任何DKD阶段时,SuperLearner模型在holdout内部验证集中显示出最高的AUC(0.7138, 95%置信区间:[0.673,0.7546])。排名靠前的特征是WBC_Cnt*、Neut_Cnt、Hct和Hb。高WBC_Cnt、低Neut_Cnt、高Hct和低Hb水平与DKD风险增加相关。结论:我们开发并验证了新诊断T2DM患者的DKD风险预测模型。使用常规可用的临床测量,超级学习者模型可以预测医院就诊期间的DKD。预测准确性和基于shap的模型可解释性可能有助于改善糖尿病患者的早期发现、有针对性的干预和预后。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
A SuperLearner approach for predicting diabetic kidney disease upon the initial diagnosis of T2DM in hospital.

Background: Diabetic kidney disease (DKD) is a serious complication of diabetes mellitus (DM), with patients typically remaining asymptomatic until reaching an advanced stage. We aimed to develop and validate a predictive model for DKD in patients with an initial diagnosis of type 2 diabetes mellitus (T2DM) using real-world data.

Methods: We retrospectively examined data from 3,291 patients (1740 men, 1551 women) newly diagnosed with T2DM at Ningbo Municipal Hospital of Traditional Chinese Medicine (2011-2023). The dataset was randomly divided into training and validation cohorts. Forty-six readily available medical characteristics at initial diagnosis of T2DM from the electronic medical records were used to develop prediction models based on linear, non-linear, and SuperLearner approaches. Model performance was evaluated using the area under the curve (AUC). SHapley Additive exPlanation (SHAP) was used to interpret the best-performing models.

Results: Among 3291 participants, 563 (17.1%) were diagnosed with DKD during median follow-up of 2.53 years. The SuperLearner model exhibited the highest AUC (0.7138, 95% confidence interval: [0.673, 0.7546]) for the holdout internal validation set in predicting any DKD stage. Top-ranked features were WBC_Cnt*, Neut_Cnt, Hct, and Hb. High WBC_Cnt, low Neut_Cnt, high Hct, and low Hb levels were associated with an increased risk of DKD.

Conclusions: We developed and validated a DKD risk prediction model for patients with newly diagnosed T2DM. Using routinely available clinical measurements, the SuperLearner model could predict DKD during hospital visits. Prediction accuracy and SHAP-based model interpretability may help improve early detection, targeted interventions, and prognosis of patients with DM.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
CiteScore
7.20
自引率
5.70%
发文量
297
审稿时长
1 months
期刊介绍: BMC Medical Informatics and Decision Making is an open access journal publishing original peer-reviewed research articles in relation to the design, development, implementation, use, and evaluation of health information technologies and decision-making for human health.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信