Tanja Fredensborg Holm, Thomas Kronborg, Morten Hasselstrøm Jensen, Stine Hangaard
{"title":"开发一种简单的非实验室机器学习工具,用于目标人群的糖尿病前期筛查:一项概念验证研究。","authors":"Tanja Fredensborg Holm, Thomas Kronborg, Morten Hasselstrøm Jensen, Stine Hangaard","doi":"10.1177/19322968251376380","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Progression from prediabetes to type 2 diabetes (T2D) can be delayed with early detection and intervention. Current detection methods, relying on costly blood glucose tests, limit widespread screening. Machine learning models offer the potential for non-laboratory-based tools. However, existing prediabetes detection models lack validation in their intended target populations. Thus, this study aimed to develop and validate a non-laboratory-based machine learning tool for prediabetes detection in a specific target population.</p><p><strong>Methods: </strong>Based on 501 adults from a prediabetes screening project, a decision tree model was developed. Twelve potential non-laboratory-based features were extracted. The target variable was categorized into prediabetes (hemoglobin A1c [HbA<sub>1c</sub>] ≥39 mmol/mol and <48 mmol/mol) and normoglycemia (HbA<sub>1c</sub> <39 mmol/mol). The data set was divided into 70% for training and 30% for validation, and forward feature selection was used to identify the most relevant features.</p><p><strong>Results: </strong>Out of 501 participants, 88 were identified with prediabetes. The mean age and body mass index (BMI) were approximately 50 years and 27 in both the training and validation sets. Forward selection identified age and waist circumference as the most important features to include in the model. The model achieved an area under the receiver operating characteristic curve (ROC AUC) of 0.8297 and 0.7961 on the training and validation sets.</p><p><strong>Conclusion: </strong>A machine learning screening tool using age and waist circumference was developed with promising results. Its simplicity, by only requiring two non-laboratory features, allows for easy implementation. However, to verify the model's generalizability and external validity, it needs to be evaluated using additional data.</p>","PeriodicalId":15475,"journal":{"name":"Journal of Diabetes Science and Technology","volume":" ","pages":"19322968251376380"},"PeriodicalIF":3.7000,"publicationDate":"2025-09-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12460283/pdf/","citationCount":"0","resultStr":"{\"title\":\"Developing a Simple Non-Laboratory-Based Machine Learning Tool for Prediabetes Screening in a Target Population: A Proof-of-Concept Study.\",\"authors\":\"Tanja Fredensborg Holm, Thomas Kronborg, Morten Hasselstrøm Jensen, Stine Hangaard\",\"doi\":\"10.1177/19322968251376380\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Background: </strong>Progression from prediabetes to type 2 diabetes (T2D) can be delayed with early detection and intervention. Current detection methods, relying on costly blood glucose tests, limit widespread screening. Machine learning models offer the potential for non-laboratory-based tools. However, existing prediabetes detection models lack validation in their intended target populations. Thus, this study aimed to develop and validate a non-laboratory-based machine learning tool for prediabetes detection in a specific target population.</p><p><strong>Methods: </strong>Based on 501 adults from a prediabetes screening project, a decision tree model was developed. Twelve potential non-laboratory-based features were extracted. The target variable was categorized into prediabetes (hemoglobin A1c [HbA<sub>1c</sub>] ≥39 mmol/mol and <48 mmol/mol) and normoglycemia (HbA<sub>1c</sub> <39 mmol/mol). The data set was divided into 70% for training and 30% for validation, and forward feature selection was used to identify the most relevant features.</p><p><strong>Results: </strong>Out of 501 participants, 88 were identified with prediabetes. The mean age and body mass index (BMI) were approximately 50 years and 27 in both the training and validation sets. Forward selection identified age and waist circumference as the most important features to include in the model. The model achieved an area under the receiver operating characteristic curve (ROC AUC) of 0.8297 and 0.7961 on the training and validation sets.</p><p><strong>Conclusion: </strong>A machine learning screening tool using age and waist circumference was developed with promising results. Its simplicity, by only requiring two non-laboratory features, allows for easy implementation. However, to verify the model's generalizability and external validity, it needs to be evaluated using additional data.</p>\",\"PeriodicalId\":15475,\"journal\":{\"name\":\"Journal of Diabetes Science and Technology\",\"volume\":\" \",\"pages\":\"19322968251376380\"},\"PeriodicalIF\":3.7000,\"publicationDate\":\"2025-09-22\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12460283/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Diabetes Science and Technology\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1177/19322968251376380\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"ENDOCRINOLOGY & METABOLISM\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Diabetes Science and Technology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1177/19322968251376380","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENDOCRINOLOGY & METABOLISM","Score":null,"Total":0}
Developing a Simple Non-Laboratory-Based Machine Learning Tool for Prediabetes Screening in a Target Population: A Proof-of-Concept Study.
Background: Progression from prediabetes to type 2 diabetes (T2D) can be delayed with early detection and intervention. Current detection methods, relying on costly blood glucose tests, limit widespread screening. Machine learning models offer the potential for non-laboratory-based tools. However, existing prediabetes detection models lack validation in their intended target populations. Thus, this study aimed to develop and validate a non-laboratory-based machine learning tool for prediabetes detection in a specific target population.
Methods: Based on 501 adults from a prediabetes screening project, a decision tree model was developed. Twelve potential non-laboratory-based features were extracted. The target variable was categorized into prediabetes (hemoglobin A1c [HbA1c] ≥39 mmol/mol and <48 mmol/mol) and normoglycemia (HbA1c <39 mmol/mol). The data set was divided into 70% for training and 30% for validation, and forward feature selection was used to identify the most relevant features.
Results: Out of 501 participants, 88 were identified with prediabetes. The mean age and body mass index (BMI) were approximately 50 years and 27 in both the training and validation sets. Forward selection identified age and waist circumference as the most important features to include in the model. The model achieved an area under the receiver operating characteristic curve (ROC AUC) of 0.8297 and 0.7961 on the training and validation sets.
Conclusion: A machine learning screening tool using age and waist circumference was developed with promising results. Its simplicity, by only requiring two non-laboratory features, allows for easy implementation. However, to verify the model's generalizability and external validity, it needs to be evaluated using additional data.
期刊介绍:
The Journal of Diabetes Science and Technology (JDST) is a bi-monthly, peer-reviewed scientific journal published by the Diabetes Technology Society. JDST covers scientific and clinical aspects of diabetes technology including glucose monitoring, insulin and metabolic peptide delivery, the artificial pancreas, digital health, precision medicine, social media, cybersecurity, software for modeling, physiologic monitoring, technology for managing obesity, and diagnostic tests of glycation. The journal also covers the development and use of mobile applications and wireless communication, as well as bioengineered tools such as MEMS, new biomaterials, and nanotechnology to develop new sensors. Articles in JDST cover both basic research and clinical applications of technologies being developed to help people with diabetes.