Craig Vandervelden, Brent Lockee, Mitchell Barnes, Erin M Tallon, David D Williams, Anna Kahkoska, Angelica Cristello Sarteau, Susana R Patton, Rona Y Sonabend, Jacob D Kohlenberg, Mark A Clements
{"title":"Predicting and Ranking Diabetic Ketoacidosis Risk among Youth with Type 1 Diabetes with a Clinic-to-Clinic Transferrable Machine Learning Model.","authors":"Craig Vandervelden, Brent Lockee, Mitchell Barnes, Erin M Tallon, David D Williams, Anna Kahkoska, Angelica Cristello Sarteau, Susana R Patton, Rona Y Sonabend, Jacob D Kohlenberg, Mark A Clements","doi":"10.1089/dia.2024.0484","DOIUrl":null,"url":null,"abstract":"<p><p><b><i>Aim:</i></b> To use electronic health record (EHR) data to develop a scalable and transferrable model to predict 6-month risk for diabetic ketoacidosis (DKA)-related hospitalization or emergency care in youth with type 1 diabetes (T1D). <b><i>Method:</i></b> To achieve a sharable predictive model, we engineered features using EHR data mapped to the T1D Exchange Quality Improvement Collaborative's (T1DX-QI) data schema used by 60+ U.S. diabetes centers and chose a compact set of 15 features (e.g., demographics, factors related to diabetes management, etc.) to yield \"explainable AI\" predictions for DKA risk on a 6-month horizon. We used an ensemble of gradient-boosted, tree-based models trained on data collected from September 1, 2017 to November 1, 2022 (3097 unique patients; 24,638 clinical encounters) from a tertiary care pediatric diabetes clinic network in the Midwest USA. <b><i>Results:</i></b> We rank-ordered the top 10, 25, 50, and 100 highest-risk youth in an out-of-sample testing set, which yielded an average precision of 0.96, 0.81, 0.75, and 0.70, respectively. The lift of the model (relative to random selection) for the top 100 individuals is 19. The model identified average time between DKA episodes, time since the last DKA episode, and T1D duration as the top three features for predicting DKA risk. <b><i>Conclusions:</i></b> Our DKA risk model effectively predicts youths' relative risk of experiencing hospitalization for DKA and is readily deployable to other diabetes centers that map diabetes data to the T1DX-QI schema. This model may facilitate the development of targeted interventions for youths at the highest risk for DKA. Future work will add novel features such as device data, social determinants of health, and diabetes self-management behaviors.</p>","PeriodicalId":11159,"journal":{"name":"Diabetes technology & therapeutics","volume":" ","pages":""},"PeriodicalIF":5.7000,"publicationDate":"2025-01-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Diabetes technology & therapeutics","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1089/dia.2024.0484","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENDOCRINOLOGY & METABOLISM","Score":null,"Total":0}
引用次数: 0
Abstract
Aim: To use electronic health record (EHR) data to develop a scalable and transferrable model to predict 6-month risk for diabetic ketoacidosis (DKA)-related hospitalization or emergency care in youth with type 1 diabetes (T1D). Method: To achieve a sharable predictive model, we engineered features using EHR data mapped to the T1D Exchange Quality Improvement Collaborative's (T1DX-QI) data schema used by 60+ U.S. diabetes centers and chose a compact set of 15 features (e.g., demographics, factors related to diabetes management, etc.) to yield "explainable AI" predictions for DKA risk on a 6-month horizon. We used an ensemble of gradient-boosted, tree-based models trained on data collected from September 1, 2017 to November 1, 2022 (3097 unique patients; 24,638 clinical encounters) from a tertiary care pediatric diabetes clinic network in the Midwest USA. Results: We rank-ordered the top 10, 25, 50, and 100 highest-risk youth in an out-of-sample testing set, which yielded an average precision of 0.96, 0.81, 0.75, and 0.70, respectively. The lift of the model (relative to random selection) for the top 100 individuals is 19. The model identified average time between DKA episodes, time since the last DKA episode, and T1D duration as the top three features for predicting DKA risk. Conclusions: Our DKA risk model effectively predicts youths' relative risk of experiencing hospitalization for DKA and is readily deployable to other diabetes centers that map diabetes data to the T1DX-QI schema. This model may facilitate the development of targeted interventions for youths at the highest risk for DKA. Future work will add novel features such as device data, social determinants of health, and diabetes self-management behaviors.
期刊介绍:
Diabetes Technology & Therapeutics is the only peer-reviewed journal providing healthcare professionals with information on new devices, drugs, drug delivery systems, and software for managing patients with diabetes. This leading international journal delivers practical information and comprehensive coverage of cutting-edge technologies and therapeutics in the field, and each issue highlights new pharmacological and device developments to optimize patient care.