基于Boruta特征选择和随机森林的增殖性糖尿病视网膜病变继发新生血管性青光眼风险预测模型

IF 4.6 2区 生物学 Q2 CELL BIOLOGY
Frontiers in Cell and Developmental Biology Pub Date : 2025-06-27 eCollection Date: 2025-01-01 DOI:10.3389/fcell.2025.1604832
Zihan Huang, Di Gong, Cuicui Tang, Jinghui Wang, Chenchen Zhang, Kuanrong Dang, Xiaoyan Chai, Jiantao Wang, Zhichao Yan
{"title":"基于Boruta特征选择和随机森林的增殖性糖尿病视网膜病变继发新生血管性青光眼风险预测模型","authors":"Zihan Huang, Di Gong, Cuicui Tang, Jinghui Wang, Chenchen Zhang, Kuanrong Dang, Xiaoyan Chai, Jiantao Wang, Zhichao Yan","doi":"10.3389/fcell.2025.1604832","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Neovascular glaucoma (NVG) is one of the most severe complications of proliferative diabetic retinopathy (PDR), carrying a high risk of blindness. Establishing an effective risk prediction model can assist clinicians in early identification of high-risk patients and implementing personalized interventions to reduce the incidence of vision impairment. This study aimed to develop and evaluate a risk prediction model for NVG in PDR patients based on the Boruta feature selection method and random forest algorithm to improve clinical predictive performance.</p><p><strong>Methods: </strong>This retrospective study included 365 PDR patients treated at Shenzhen Eye Hospital between January 2019 and December 2024, comprising 269 controls (non-NVG) and 96 cases (NVG). The Boruta feature selection method was employed to identify key features associated with NVG development in PDR. A risk prediction model was then constructed using the random forest algorithm. Model performance was evaluated based on accuracy, sensitivity, specificity, and area under the curve (AUC). Additionally, calibration curves and decision curve analysis (DCA) were used to assess clinical utility. All data analyses and modeling were performed in R (version 4.2.3).</p><p><strong>Results: </strong>The Boruta algorithm selected 12 significant predictive features. The random forest-based model achieved an accuracy of 90.74%, sensitivity of 82.14%, specificity of 93.75%, and an AUC of 0.87, demonstrating strong predictive performance. Calibration curves indicated reliable prediction probabilities within the 0.4-0.8 range. Decision curve analysis revealed substantial clinical net benefit across threshold probabilities of 0.2-0.8.</p><p><strong>Conclusion: </strong>The Boruta-guided random forest model developed in this study exhibits excellent predictive performance and clinical applicability for assessing NVG risk in PDR patients.</p>","PeriodicalId":12448,"journal":{"name":"Frontiers in Cell and Developmental Biology","volume":"13 ","pages":"1604832"},"PeriodicalIF":4.6000,"publicationDate":"2025-06-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12247303/pdf/","citationCount":"0","resultStr":"{\"title\":\"A risk prediction model for neovascular glaucoma secondary to proliferative diabetic retinopathy based on Boruta feature selection and random forest.\",\"authors\":\"Zihan Huang, Di Gong, Cuicui Tang, Jinghui Wang, Chenchen Zhang, Kuanrong Dang, Xiaoyan Chai, Jiantao Wang, Zhichao Yan\",\"doi\":\"10.3389/fcell.2025.1604832\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Background: </strong>Neovascular glaucoma (NVG) is one of the most severe complications of proliferative diabetic retinopathy (PDR), carrying a high risk of blindness. Establishing an effective risk prediction model can assist clinicians in early identification of high-risk patients and implementing personalized interventions to reduce the incidence of vision impairment. This study aimed to develop and evaluate a risk prediction model for NVG in PDR patients based on the Boruta feature selection method and random forest algorithm to improve clinical predictive performance.</p><p><strong>Methods: </strong>This retrospective study included 365 PDR patients treated at Shenzhen Eye Hospital between January 2019 and December 2024, comprising 269 controls (non-NVG) and 96 cases (NVG). The Boruta feature selection method was employed to identify key features associated with NVG development in PDR. A risk prediction model was then constructed using the random forest algorithm. Model performance was evaluated based on accuracy, sensitivity, specificity, and area under the curve (AUC). Additionally, calibration curves and decision curve analysis (DCA) were used to assess clinical utility. All data analyses and modeling were performed in R (version 4.2.3).</p><p><strong>Results: </strong>The Boruta algorithm selected 12 significant predictive features. The random forest-based model achieved an accuracy of 90.74%, sensitivity of 82.14%, specificity of 93.75%, and an AUC of 0.87, demonstrating strong predictive performance. Calibration curves indicated reliable prediction probabilities within the 0.4-0.8 range. Decision curve analysis revealed substantial clinical net benefit across threshold probabilities of 0.2-0.8.</p><p><strong>Conclusion: </strong>The Boruta-guided random forest model developed in this study exhibits excellent predictive performance and clinical applicability for assessing NVG risk in PDR patients.</p>\",\"PeriodicalId\":12448,\"journal\":{\"name\":\"Frontiers in Cell and Developmental Biology\",\"volume\":\"13 \",\"pages\":\"1604832\"},\"PeriodicalIF\":4.6000,\"publicationDate\":\"2025-06-27\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12247303/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Frontiers in Cell and Developmental Biology\",\"FirstCategoryId\":\"99\",\"ListUrlMain\":\"https://doi.org/10.3389/fcell.2025.1604832\",\"RegionNum\":2,\"RegionCategory\":\"生物学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2025/1/1 0:00:00\",\"PubModel\":\"eCollection\",\"JCR\":\"Q2\",\"JCRName\":\"CELL BIOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Frontiers in Cell and Developmental Biology","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.3389/fcell.2025.1604832","RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/1/1 0:00:00","PubModel":"eCollection","JCR":"Q2","JCRName":"CELL BIOLOGY","Score":null,"Total":0}
引用次数: 0

摘要

背景:新生血管性青光眼(NVG)是增殖性糖尿病视网膜病变(PDR)最严重的并发症之一,具有较高的致盲风险。建立有效的风险预测模型可以帮助临床医生早期识别高危患者,实施个性化的干预措施,降低视力损害的发生率。本研究旨在建立基于Boruta特征选择方法和随机森林算法的PDR患者NVG风险预测模型并进行评估,以提高临床预测性能。方法:回顾性研究纳入2019年1月至2024年12月在深圳眼科医院就诊的365例PDR患者,其中对照组(非NVG) 269例,NVG组(NVG) 96例。采用Boruta特征选择方法识别与PDR中NVG发展相关的关键特征。然后利用随机森林算法构建了风险预测模型。根据准确性、敏感性、特异性和曲线下面积(AUC)来评估模型的性能。此外,采用校准曲线和决策曲线分析(DCA)评估临床效用。所有数据分析和建模均在R(4.2.3版本)中进行。结果:Boruta算法选择了12个显著的预测特征。基于随机森林的模型准确率为90.74%,灵敏度为82.14%,特异性为93.75%,AUC为0.87,具有较强的预测效果。校正曲线显示,预测概率在0.4 ~ 0.8范围内可靠。决策曲线分析显示,在阈值概率为0.2-0.8的范围内,临床净收益显著。结论:本研究建立的boruta引导随机森林模型在评估PDR患者NVG风险方面具有良好的预测性能和临床适用性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
A risk prediction model for neovascular glaucoma secondary to proliferative diabetic retinopathy based on Boruta feature selection and random forest.

Background: Neovascular glaucoma (NVG) is one of the most severe complications of proliferative diabetic retinopathy (PDR), carrying a high risk of blindness. Establishing an effective risk prediction model can assist clinicians in early identification of high-risk patients and implementing personalized interventions to reduce the incidence of vision impairment. This study aimed to develop and evaluate a risk prediction model for NVG in PDR patients based on the Boruta feature selection method and random forest algorithm to improve clinical predictive performance.

Methods: This retrospective study included 365 PDR patients treated at Shenzhen Eye Hospital between January 2019 and December 2024, comprising 269 controls (non-NVG) and 96 cases (NVG). The Boruta feature selection method was employed to identify key features associated with NVG development in PDR. A risk prediction model was then constructed using the random forest algorithm. Model performance was evaluated based on accuracy, sensitivity, specificity, and area under the curve (AUC). Additionally, calibration curves and decision curve analysis (DCA) were used to assess clinical utility. All data analyses and modeling were performed in R (version 4.2.3).

Results: The Boruta algorithm selected 12 significant predictive features. The random forest-based model achieved an accuracy of 90.74%, sensitivity of 82.14%, specificity of 93.75%, and an AUC of 0.87, demonstrating strong predictive performance. Calibration curves indicated reliable prediction probabilities within the 0.4-0.8 range. Decision curve analysis revealed substantial clinical net benefit across threshold probabilities of 0.2-0.8.

Conclusion: The Boruta-guided random forest model developed in this study exhibits excellent predictive performance and clinical applicability for assessing NVG risk in PDR patients.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Frontiers in Cell and Developmental Biology
Frontiers in Cell and Developmental Biology Biochemistry, Genetics and Molecular Biology-Cell Biology
CiteScore
9.70
自引率
3.60%
发文量
2531
审稿时长
12 weeks
期刊介绍: Frontiers in Cell and Developmental Biology is a broad-scope, interdisciplinary open-access journal, focusing on the fundamental processes of life, led by Prof Amanda Fisher and supported by a geographically diverse, high-quality editorial board. The journal welcomes submissions on a wide spectrum of cell and developmental biology, covering intracellular and extracellular dynamics, with sections focusing on signaling, adhesion, migration, cell death and survival and membrane trafficking. Additionally, the journal offers sections dedicated to the cutting edge of fundamental and translational research in molecular medicine and stem cell biology. With a collaborative, rigorous and transparent peer-review, the journal produces the highest scientific quality in both fundamental and applied research, and advanced article level metrics measure the real-time impact and influence of each publication.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信