Developing a Simple Non-Laboratory-Based Machine Learning Tool for Prediabetes Screening in a Target Population: A Proof-of-Concept Study.

IF 3.7 Q2 ENDOCRINOLOGY & METABOLISM
Tanja Fredensborg Holm, Thomas Kronborg, Morten Hasselstrøm Jensen, Stine Hangaard
{"title":"Developing a Simple Non-Laboratory-Based Machine Learning Tool for Prediabetes Screening in a Target Population: A Proof-of-Concept Study.","authors":"Tanja Fredensborg Holm, Thomas Kronborg, Morten Hasselstrøm Jensen, Stine Hangaard","doi":"10.1177/19322968251376380","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Progression from prediabetes to type 2 diabetes (T2D) can be delayed with early detection and intervention. Current detection methods, relying on costly blood glucose tests, limit widespread screening. Machine learning models offer the potential for non-laboratory-based tools. However, existing prediabetes detection models lack validation in their intended target populations. Thus, this study aimed to develop and validate a non-laboratory-based machine learning tool for prediabetes detection in a specific target population.</p><p><strong>Methods: </strong>Based on 501 adults from a prediabetes screening project, a decision tree model was developed. Twelve potential non-laboratory-based features were extracted. The target variable was categorized into prediabetes (hemoglobin A1c [HbA<sub>1c</sub>] ≥39 mmol/mol and <48 mmol/mol) and normoglycemia (HbA<sub>1c</sub> <39 mmol/mol). The data set was divided into 70% for training and 30% for validation, and forward feature selection was used to identify the most relevant features.</p><p><strong>Results: </strong>Out of 501 participants, 88 were identified with prediabetes. The mean age and body mass index (BMI) were approximately 50 years and 27 in both the training and validation sets. Forward selection identified age and waist circumference as the most important features to include in the model. The model achieved an area under the receiver operating characteristic curve (ROC AUC) of 0.8297 and 0.7961 on the training and validation sets.</p><p><strong>Conclusion: </strong>A machine learning screening tool using age and waist circumference was developed with promising results. Its simplicity, by only requiring two non-laboratory features, allows for easy implementation. However, to verify the model's generalizability and external validity, it needs to be evaluated using additional data.</p>","PeriodicalId":15475,"journal":{"name":"Journal of Diabetes Science and Technology","volume":" ","pages":"19322968251376380"},"PeriodicalIF":3.7000,"publicationDate":"2025-09-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12460283/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Diabetes Science and Technology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1177/19322968251376380","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENDOCRINOLOGY & METABOLISM","Score":null,"Total":0}
引用次数: 0

Abstract

Background: Progression from prediabetes to type 2 diabetes (T2D) can be delayed with early detection and intervention. Current detection methods, relying on costly blood glucose tests, limit widespread screening. Machine learning models offer the potential for non-laboratory-based tools. However, existing prediabetes detection models lack validation in their intended target populations. Thus, this study aimed to develop and validate a non-laboratory-based machine learning tool for prediabetes detection in a specific target population.

Methods: Based on 501 adults from a prediabetes screening project, a decision tree model was developed. Twelve potential non-laboratory-based features were extracted. The target variable was categorized into prediabetes (hemoglobin A1c [HbA1c] ≥39 mmol/mol and <48 mmol/mol) and normoglycemia (HbA1c <39 mmol/mol). The data set was divided into 70% for training and 30% for validation, and forward feature selection was used to identify the most relevant features.

Results: Out of 501 participants, 88 were identified with prediabetes. The mean age and body mass index (BMI) were approximately 50 years and 27 in both the training and validation sets. Forward selection identified age and waist circumference as the most important features to include in the model. The model achieved an area under the receiver operating characteristic curve (ROC AUC) of 0.8297 and 0.7961 on the training and validation sets.

Conclusion: A machine learning screening tool using age and waist circumference was developed with promising results. Its simplicity, by only requiring two non-laboratory features, allows for easy implementation. However, to verify the model's generalizability and external validity, it needs to be evaluated using additional data.

开发一种简单的非实验室机器学习工具,用于目标人群的糖尿病前期筛查:一项概念验证研究。
背景:早期发现和干预可以延缓糖尿病前期向2型糖尿病(T2D)的进展。目前的检测方法依赖于昂贵的血糖测试,限制了广泛的筛查。机器学习模型为非实验室工具提供了潜力。然而,现有的糖尿病前期检测模型在其预期的目标人群中缺乏验证。因此,本研究旨在开发和验证一种非实验室的机器学习工具,用于在特定目标人群中检测糖尿病前期。方法:基于前驱糖尿病筛查项目的501名成年人,建立决策树模型。提取了12个潜在的非实验室特征。目标变量被分类为糖尿病前期(血红蛋白A1c [HbA1c]≥39 mmol/mol和1c)。结果:在501名参与者中,88人被确定为糖尿病前期。训练组和验证组的平均年龄和体重指数(BMI)分别约为50岁和27岁。前向选择将年龄和腰围确定为模型中最重要的特征。该模型在训练集和验证集上的受试者工作特征曲线下面积(ROC AUC)分别为0.8297和0.7961。结论:开发了一种基于年龄和腰围的机器学习筛选工具,效果良好。它的简单性,只需要两个非实验室功能,允许容易实现。然而,为了验证模型的泛化性和外部有效性,需要使用额外的数据对其进行评估。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Journal of Diabetes Science and Technology
Journal of Diabetes Science and Technology Medicine-Internal Medicine
CiteScore
7.50
自引率
12.00%
发文量
148
期刊介绍: The Journal of Diabetes Science and Technology (JDST) is a bi-monthly, peer-reviewed scientific journal published by the Diabetes Technology Society. JDST covers scientific and clinical aspects of diabetes technology including glucose monitoring, insulin and metabolic peptide delivery, the artificial pancreas, digital health, precision medicine, social media, cybersecurity, software for modeling, physiologic monitoring, technology for managing obesity, and diagnostic tests of glycation. The journal also covers the development and use of mobile applications and wireless communication, as well as bioengineered tools such as MEMS, new biomaterials, and nanotechnology to develop new sensors. Articles in JDST cover both basic research and clinical applications of technologies being developed to help people with diabetes.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信