2051-LB:使用公开索赔数据进行症状前1型糖尿病检测的预测建模

IF 7.5 1区 医学 Q1 ENDOCRINOLOGY & METABOLISM
Diabetes Pub Date : 2025-06-20 DOI:10.2337/db25-2051-lb
DANIEL EINOR, OLEKSANDR BUIKO, KENI C.S. LEE, LUKAS ADAMEK, BRANDON RUFINO, INNA YAROVA, JARED JOSLEYN
{"title":"2051-LB:使用公开索赔数据进行症状前1型糖尿病检测的预测建模","authors":"DANIEL EINOR, OLEKSANDR BUIKO, KENI C.S. LEE, LUKAS ADAMEK, BRANDON RUFINO, INNA YAROVA, JARED JOSLEYN","doi":"10.2337/db25-2051-lb","DOIUrl":null,"url":null,"abstract":"Introduction and Objective: Type 1 Diabetes mellitus (T1D) progresses through distinct stages before clinical diagnosis, with early stages being asymptomatic. Integrating pre-symptomatic T1D detection can enable the proactive identification of patients at higher risk and facilitate earlier interventions. This study employed machine learning approaches on a U.S. open claims dataset to identify people at risk of presymptomatic T1D. Methods: A retrospective cohort analysis was conducted on 89,453 patients selected based on the modified Klompas algorithm with initial T1D diagnosis or insulin prescription between Jan 2016 and Dec 2023. The study utilized tree-based (XGBoost) and transformer-based (BERT) models. The performance was evaluated using Precision, Recall, Bayes factor, and Number Needed to Test (NNT). Feature importance was assessed using Shapley values. Results: BERT was found to have better performance than tree-based models featuring Precision of 0.09 vs 0.05; Recall of 0.8 vs 0.72; Bayes factor of 19.92 vs 10.75; NNT values of 11.6 vs 22.7, respectfully. In both models, elevated risk of T1D was most associated with lower age and male gender. Tree-based model showed that elevated occurrence of diagnoses related to digestive system and thyroid tests were also associated with presymptomatic T1D. Conclusion: Our findings suggest the potential for using claims records to identify presymptomatic T1D individuals. The models improved the detection rate from the standard prevalence of 1:200 to less than 1:23, significantly enhancing early detection efforts. Existing demographic, frequent digestive system diagnoses and routine endocrine tests could signal T1D risk. Utilizing the recency and frequency of medical events may enhance the predictability of early T1D onset. Such models could enable more targeted screening and access to timely interventions for improved clinical outcomes. Disclosure D. Einor: None. O. Buiko: None. K.C.S. Lee: Employee; Sanofi. L. Adamek: Employee; Sanofi. B. Rufino: Employee; Sanofi. I. Yarova: None. J. Josleyn: Employee; Sanofi.","PeriodicalId":11376,"journal":{"name":"Diabetes","volume":"7 1","pages":""},"PeriodicalIF":7.5000,"publicationDate":"2025-06-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"2051-LB: Predictive Modeling for Presymptomatic Type 1 Diabetes Detection Using Open Claims Data\",\"authors\":\"DANIEL EINOR, OLEKSANDR BUIKO, KENI C.S. LEE, LUKAS ADAMEK, BRANDON RUFINO, INNA YAROVA, JARED JOSLEYN\",\"doi\":\"10.2337/db25-2051-lb\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Introduction and Objective: Type 1 Diabetes mellitus (T1D) progresses through distinct stages before clinical diagnosis, with early stages being asymptomatic. Integrating pre-symptomatic T1D detection can enable the proactive identification of patients at higher risk and facilitate earlier interventions. This study employed machine learning approaches on a U.S. open claims dataset to identify people at risk of presymptomatic T1D. Methods: A retrospective cohort analysis was conducted on 89,453 patients selected based on the modified Klompas algorithm with initial T1D diagnosis or insulin prescription between Jan 2016 and Dec 2023. The study utilized tree-based (XGBoost) and transformer-based (BERT) models. The performance was evaluated using Precision, Recall, Bayes factor, and Number Needed to Test (NNT). Feature importance was assessed using Shapley values. Results: BERT was found to have better performance than tree-based models featuring Precision of 0.09 vs 0.05; Recall of 0.8 vs 0.72; Bayes factor of 19.92 vs 10.75; NNT values of 11.6 vs 22.7, respectfully. In both models, elevated risk of T1D was most associated with lower age and male gender. Tree-based model showed that elevated occurrence of diagnoses related to digestive system and thyroid tests were also associated with presymptomatic T1D. Conclusion: Our findings suggest the potential for using claims records to identify presymptomatic T1D individuals. The models improved the detection rate from the standard prevalence of 1:200 to less than 1:23, significantly enhancing early detection efforts. Existing demographic, frequent digestive system diagnoses and routine endocrine tests could signal T1D risk. Utilizing the recency and frequency of medical events may enhance the predictability of early T1D onset. Such models could enable more targeted screening and access to timely interventions for improved clinical outcomes. Disclosure D. Einor: None. O. Buiko: None. K.C.S. Lee: Employee; Sanofi. L. Adamek: Employee; Sanofi. B. Rufino: Employee; Sanofi. I. Yarova: None. J. Josleyn: Employee; Sanofi.\",\"PeriodicalId\":11376,\"journal\":{\"name\":\"Diabetes\",\"volume\":\"7 1\",\"pages\":\"\"},\"PeriodicalIF\":7.5000,\"publicationDate\":\"2025-06-20\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Diabetes\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.2337/db25-2051-lb\",\"RegionNum\":1,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENDOCRINOLOGY & METABOLISM\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Diabetes","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.2337/db25-2051-lb","RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENDOCRINOLOGY & METABOLISM","Score":null,"Total":0}
引用次数: 0

摘要

简介与目的:1型糖尿病(Type 1 Diabetes, T1D)在临床诊断前有不同的分期,早期无症状。综合症状前T1D检测可以主动识别高风险患者并促进早期干预。这项研究在美国公开索赔数据集上使用机器学习方法来识别有症状前T1D风险的人。方法:对2016年1月至2023年12月期间,根据改进的Klompas算法选择的89,453例首次诊断为T1D或处方胰岛素的患者进行回顾性队列分析。该研究使用了基于树(XGBoost)和基于变压器(BERT)的模型。使用Precision, Recall, Bayes factor和Number to Test (NNT)来评估性能。使用Shapley值评估特征重要性。结果:BERT模型的准确率分别为0.09和0.05,优于基于树的模型;召回率0.8 vs 0.72;贝叶斯因子为19.92 vs 10.75;NNT值为11.6 vs 22.7。在这两种模型中,T1D的风险升高与年龄较小和男性性别最相关。基于树的模型显示,消化系统和甲状腺检查相关诊断的发生率升高也与症状前T1D相关。结论:我们的研究结果表明,使用索赔记录识别症状前T1D个体的潜力。这些模型将检出率从1:20 00的标准患病率提高到1:23以下,显著提高了早期检测工作。现有的人口统计学、频繁的消化系统诊断和常规内分泌检查可能预示着T1D的风险。利用医疗事件的近代性和频率可以提高T1D早期发病的可预测性。这种模型可以使更有针对性的筛查和获得及时的干预措施,以改善临床结果。D.埃诺:没有。O. Buiko:没有。李家诚:雇员;赛诺菲。L. Adamek:雇员;赛诺菲。B. Rufino:员工;赛诺菲。亚罗娃:没有。乔斯林:雇员;赛诺菲。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
2051-LB: Predictive Modeling for Presymptomatic Type 1 Diabetes Detection Using Open Claims Data
Introduction and Objective: Type 1 Diabetes mellitus (T1D) progresses through distinct stages before clinical diagnosis, with early stages being asymptomatic. Integrating pre-symptomatic T1D detection can enable the proactive identification of patients at higher risk and facilitate earlier interventions. This study employed machine learning approaches on a U.S. open claims dataset to identify people at risk of presymptomatic T1D. Methods: A retrospective cohort analysis was conducted on 89,453 patients selected based on the modified Klompas algorithm with initial T1D diagnosis or insulin prescription between Jan 2016 and Dec 2023. The study utilized tree-based (XGBoost) and transformer-based (BERT) models. The performance was evaluated using Precision, Recall, Bayes factor, and Number Needed to Test (NNT). Feature importance was assessed using Shapley values. Results: BERT was found to have better performance than tree-based models featuring Precision of 0.09 vs 0.05; Recall of 0.8 vs 0.72; Bayes factor of 19.92 vs 10.75; NNT values of 11.6 vs 22.7, respectfully. In both models, elevated risk of T1D was most associated with lower age and male gender. Tree-based model showed that elevated occurrence of diagnoses related to digestive system and thyroid tests were also associated with presymptomatic T1D. Conclusion: Our findings suggest the potential for using claims records to identify presymptomatic T1D individuals. The models improved the detection rate from the standard prevalence of 1:200 to less than 1:23, significantly enhancing early detection efforts. Existing demographic, frequent digestive system diagnoses and routine endocrine tests could signal T1D risk. Utilizing the recency and frequency of medical events may enhance the predictability of early T1D onset. Such models could enable more targeted screening and access to timely interventions for improved clinical outcomes. Disclosure D. Einor: None. O. Buiko: None. K.C.S. Lee: Employee; Sanofi. L. Adamek: Employee; Sanofi. B. Rufino: Employee; Sanofi. I. Yarova: None. J. Josleyn: Employee; Sanofi.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Diabetes
Diabetes 医学-内分泌学与代谢
CiteScore
12.50
自引率
2.60%
发文量
1968
审稿时长
1 months
期刊介绍: Diabetes is a scientific journal that publishes original research exploring the physiological and pathophysiological aspects of diabetes mellitus. We encourage submissions of manuscripts pertaining to laboratory, animal, or human research, covering a wide range of topics. Our primary focus is on investigative reports investigating various aspects such as the development and progression of diabetes, along with its associated complications. We also welcome studies delving into normal and pathological pancreatic islet function and intermediary metabolism, as well as exploring the mechanisms of drug and hormone action from a pharmacological perspective. Additionally, we encourage submissions that delve into the biochemical and molecular aspects of both normal and abnormal biological processes. However, it is important to note that we do not publish studies relating to diabetes education or the application of accepted therapeutic and diagnostic approaches to patients with diabetes mellitus. Our aim is to provide a platform for research that contributes to advancing our understanding of the underlying mechanisms and processes of diabetes.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信