A machine learning-based prediction of diabetic retinopathy using the Korea national health and nutrition examination survey (2008-2012, 2017-2021).

IF 3.1 3区 医学 Q1 MEDICINE, GENERAL & INTERNAL
Frontiers in Medicine Pub Date : 2025-05-30 eCollection Date: 2025-01-01 DOI:10.3389/fmed.2025.1542860
Min Seok Kim, Young Wook Choi, Borghare Shubham Prakash, Youngju Lee, Soo Lim, Se Joon Woo
{"title":"A machine learning-based prediction of diabetic retinopathy using the Korea national health and nutrition examination survey (2008-2012, 2017-2021).","authors":"Min Seok Kim, Young Wook Choi, Borghare Shubham Prakash, Youngju Lee, Soo Lim, Se Joon Woo","doi":"10.3389/fmed.2025.1542860","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Machine learning technology that uses available clinical data to predict diabetic retinopathy (DR) can be highly valuable in medical settings where fundus cameras are not accessible.</p><p><strong>Objective: </strong>This study aimed to develop and compare machine learning algorithms for predicting DR without fundus image.</p><p><strong>Methods: </strong>We used data from Korea National Health and Nutrition Examination Survey (2008-2012 and 2017-2021) and enrolled individuals aged ≥ 20 years with diabetes who received fundus examination. Predictive models for DR were developed using logistic regression and three machine learning algorithms: extreme gradient boosting, decision tree, and random forest. Model performance was evaluated using area under the receiver operating characteristic curve (AUC) and accuracy for the diagnosis of DR, and feature importance was determined using Shapley Additive Explanations (SHAP).</p><p><strong>Results: </strong>Among the 3,026 diabetic participants (male, 50.7%; mean age, 63.7 ± 10.5 years), 671 (22.2%) had DR. The random forest model, using 16 variables, achieved the highest AUC of 0.748 (95% confidence interval, 0.705-0.790) with a sensitivity 0.669, specificity of 0.729 and an accuracy of 0.715. As interpreted by SHAP, HbA1c, fasting glucose levels, duration of diabetes, and body mass index were identified as common key determinants influencing the model's outcomes.</p><p><strong>Conclusion: </strong>The DR prediction models using machine learning techniques demonstrated reliable performance even without fundus imaging, with the random forest model showing particularly strong results. These models could assist in managing DR by identifying high-risk patients, enabling timely ophthalmic referrals.</p>","PeriodicalId":12488,"journal":{"name":"Frontiers in Medicine","volume":"12 ","pages":"1542860"},"PeriodicalIF":3.1000,"publicationDate":"2025-05-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12163237/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Frontiers in Medicine","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.3389/fmed.2025.1542860","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/1/1 0:00:00","PubModel":"eCollection","JCR":"Q1","JCRName":"MEDICINE, GENERAL & INTERNAL","Score":null,"Total":0}
引用次数: 0

Abstract

Background: Machine learning technology that uses available clinical data to predict diabetic retinopathy (DR) can be highly valuable in medical settings where fundus cameras are not accessible.

Objective: This study aimed to develop and compare machine learning algorithms for predicting DR without fundus image.

Methods: We used data from Korea National Health and Nutrition Examination Survey (2008-2012 and 2017-2021) and enrolled individuals aged ≥ 20 years with diabetes who received fundus examination. Predictive models for DR were developed using logistic regression and three machine learning algorithms: extreme gradient boosting, decision tree, and random forest. Model performance was evaluated using area under the receiver operating characteristic curve (AUC) and accuracy for the diagnosis of DR, and feature importance was determined using Shapley Additive Explanations (SHAP).

Results: Among the 3,026 diabetic participants (male, 50.7%; mean age, 63.7 ± 10.5 years), 671 (22.2%) had DR. The random forest model, using 16 variables, achieved the highest AUC of 0.748 (95% confidence interval, 0.705-0.790) with a sensitivity 0.669, specificity of 0.729 and an accuracy of 0.715. As interpreted by SHAP, HbA1c, fasting glucose levels, duration of diabetes, and body mass index were identified as common key determinants influencing the model's outcomes.

Conclusion: The DR prediction models using machine learning techniques demonstrated reliable performance even without fundus imaging, with the random forest model showing particularly strong results. These models could assist in managing DR by identifying high-risk patients, enabling timely ophthalmic referrals.

使用韩国国家健康和营养检查调查(2008-2012,2017-2021)的基于机器学习的糖尿病视网膜病变预测。
背景:利用现有临床数据预测糖尿病视网膜病变(DR)的机器学习技术在无法获得眼底相机的医疗环境中具有很高的价值。目的:本研究旨在开发和比较无眼底图像预测DR的机器学习算法。方法:我们使用韩国国家健康与营养调查(2008-2012年和2017-2021年)的数据,并招募了年龄≥20岁并接受眼底检查的糖尿病患者。DR的预测模型使用逻辑回归和三种机器学习算法:极端梯度增强、决策树和随机森林。使用受试者工作特征曲线下面积(AUC)和DR诊断准确率评估模型性能,使用Shapley加性解释(SHAP)确定特征重要性。结果:在3026名糖尿病参与者中(男性50.7%;随机森林模型采用16个变量,AUC最高,为0.748(95%可信区间为0.705 ~ 0.790),灵敏度为0.669,特异性为0.729,准确率为0.715。根据SHAP的解释,HbA1c、空腹血糖水平、糖尿病持续时间和体重指数被确定为影响模型结果的常见关键决定因素。结论:使用机器学习技术的DR预测模型即使在没有眼底成像的情况下也表现出可靠的性能,其中随机森林模型显示出特别强的结果。这些模型可以通过识别高风险患者,及时进行眼科转诊,从而帮助管理DR。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Frontiers in Medicine
Frontiers in Medicine Medicine-General Medicine
CiteScore
5.10
自引率
5.10%
发文量
3710
审稿时长
12 weeks
期刊介绍: Frontiers in Medicine publishes rigorously peer-reviewed research linking basic research to clinical practice and patient care, as well as translating scientific advances into new therapies and diagnostic tools. Led by an outstanding Editorial Board of international experts, this multidisciplinary open-access journal is at the forefront of disseminating and communicating scientific knowledge and impactful discoveries to researchers, academics, clinicians and the public worldwide. In addition to papers that provide a link between basic research and clinical practice, a particular emphasis is given to studies that are directly relevant to patient care. In this spirit, the journal publishes the latest research results and medical knowledge that facilitate the translation of scientific advances into new therapies or diagnostic tools. The full listing of the Specialty Sections represented by Frontiers in Medicine is as listed below. As well as the established medical disciplines, Frontiers in Medicine is launching new sections that together will facilitate - the use of patient-reported outcomes under real world conditions - the exploitation of big data and the use of novel information and communication tools in the assessment of new medicines - the scientific bases for guidelines and decisions from regulatory authorities - access to medicinal products and medical devices worldwide - addressing the grand health challenges around the world
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信