Alyce S Adams, Catherine Lee, Gabriel Escobar, Elizabeth A Bayliss, Brian Callaghan, Michael Horberg, Julie A Schmittdiel, Connie Trinacty, Lisa K Gilliam, Eileen Kim, Nima S Hejazi, Lin Ma, Romain Neugebauer
{"title":"Estimating the Risk of Lower Extremity Complications in Adults Newly Diagnosed With Diabetic Polyneuropathy: Retrospective Cohort Study.","authors":"Alyce S Adams, Catherine Lee, Gabriel Escobar, Elizabeth A Bayliss, Brian Callaghan, Michael Horberg, Julie A Schmittdiel, Connie Trinacty, Lisa K Gilliam, Eileen Kim, Nima S Hejazi, Lin Ma, Romain Neugebauer","doi":"10.2196/60141","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Diabetes-related lower extremity complications, such as foot ulceration and amputation, are on the rise, currently affecting nearly 131 million people worldwide. Methods for early detection of individuals at high risk remain elusive. While data-driven diabetic polyneuropathy algorithms exist, high-performing, clinically useful tools to assess risk are needed to improve clinical care.</p><p><strong>Objective: </strong>This study aimed to develop an electronic medical record-based machine learning algorithm that would predict lower extremity complications.</p><p><strong>Methods: </strong>We conducted a retrospective longitudinal cohort study to predict the risk of lower extremity complications within 24 months of an initial diagnosis of diabetic polyneuropathy. From an initial cohort of 468,162 individuals with at least 1 diagnosis of diabetic polyneuropathy at one of 2 multispecialty health care systems (based in northern California and Colorado) between April 2012 and December 2016, we created an analytic cohort of 48,209 adults with continuous enrollment, who were newly diagnosed with no evidence of end-of-life care. The outcome was any lower extremity complication, including foot ulceration, osteomyelitis, gangrene, or lower extremity amputation. We randomly split the data into training (38,569/48209; 80%) and testing (9,640/48209; 20%) datasets. In the training dataset, we used super Learner (SL), an ensemble learning method that employs cross-validation and combines multiple candidate risk predictors, into a single risk predictor. We evaluated the performance of the SL risk predictor in the testing dataset using the receiver operating characteristic curve and a calibration plot.</p><p><strong>Results: </strong>Of the 48,209 individuals in the cohort, 2327 developed a lower extremity complication during follow-up. The SL risk estimator exhibited good discrimination (AUC=0.845, 95% CI 0.826-0.863) and calibration. A modified version of our SL algorithm, simplified to facilitate real-world adoption, had only slightly reduced discrimination (AUC=0.817, 95%CI 0.797-0.837). The modified version slightly outperformed the naïve logistic regression model (AUC=0.804, 95% CI 0.783-0.825) in terms of precision gained relative to the frequency of alerts and number of patients that needed to be evaluated.</p><p><strong>Conclusions: </strong>We have built a machine learning-based risk estimator with the potential to improve clinical detection of diabetic patients at high risk for lower extremity complications at the time of an initial diabetic polyneuropathy diagnosis. The algorithm exhibited good discriminant validity and calibration using only data from the electronic medical record. Additional research will be needed to identify optimal contexts and strategies for maximizing algorithmic fairness in both interpretation and deployment.</p>","PeriodicalId":52371,"journal":{"name":"JMIR Diabetes","volume":"10 ","pages":"e60141"},"PeriodicalIF":2.6000,"publicationDate":"2025-05-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12140504/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"JMIR Diabetes","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.2196/60141","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"Medicine","Score":null,"Total":0}
引用次数: 0
Abstract
Background: Diabetes-related lower extremity complications, such as foot ulceration and amputation, are on the rise, currently affecting nearly 131 million people worldwide. Methods for early detection of individuals at high risk remain elusive. While data-driven diabetic polyneuropathy algorithms exist, high-performing, clinically useful tools to assess risk are needed to improve clinical care.
Objective: This study aimed to develop an electronic medical record-based machine learning algorithm that would predict lower extremity complications.
Methods: We conducted a retrospective longitudinal cohort study to predict the risk of lower extremity complications within 24 months of an initial diagnosis of diabetic polyneuropathy. From an initial cohort of 468,162 individuals with at least 1 diagnosis of diabetic polyneuropathy at one of 2 multispecialty health care systems (based in northern California and Colorado) between April 2012 and December 2016, we created an analytic cohort of 48,209 adults with continuous enrollment, who were newly diagnosed with no evidence of end-of-life care. The outcome was any lower extremity complication, including foot ulceration, osteomyelitis, gangrene, or lower extremity amputation. We randomly split the data into training (38,569/48209; 80%) and testing (9,640/48209; 20%) datasets. In the training dataset, we used super Learner (SL), an ensemble learning method that employs cross-validation and combines multiple candidate risk predictors, into a single risk predictor. We evaluated the performance of the SL risk predictor in the testing dataset using the receiver operating characteristic curve and a calibration plot.
Results: Of the 48,209 individuals in the cohort, 2327 developed a lower extremity complication during follow-up. The SL risk estimator exhibited good discrimination (AUC=0.845, 95% CI 0.826-0.863) and calibration. A modified version of our SL algorithm, simplified to facilitate real-world adoption, had only slightly reduced discrimination (AUC=0.817, 95%CI 0.797-0.837). The modified version slightly outperformed the naïve logistic regression model (AUC=0.804, 95% CI 0.783-0.825) in terms of precision gained relative to the frequency of alerts and number of patients that needed to be evaluated.
Conclusions: We have built a machine learning-based risk estimator with the potential to improve clinical detection of diabetic patients at high risk for lower extremity complications at the time of an initial diabetic polyneuropathy diagnosis. The algorithm exhibited good discriminant validity and calibration using only data from the electronic medical record. Additional research will be needed to identify optimal contexts and strategies for maximizing algorithmic fairness in both interpretation and deployment.
背景:与糖尿病相关的下肢并发症,如足部溃疡和截肢,呈上升趋势,目前影响全球近1.31亿人。早期发现高危个体的方法仍然难以捉摸。虽然存在数据驱动的糖尿病多发性神经病变算法,但需要高效、临床有用的工具来评估风险,以改善临床护理。目的:本研究旨在开发一种基于电子病历的机器学习算法来预测下肢并发症。方法:我们进行了一项回顾性纵向队列研究,以预测首次诊断为糖尿病多发性神经病变的患者在24个月内下肢并发症的风险。从2012年4月至2016年12月期间在2个多专业医疗保健系统(北加州和科罗拉多州)中至少诊断为1种糖尿病多发性神经病变的468,162人的初始队列中,我们创建了一个连续登记的48,209名成年人的分析队列,他们是新诊断的,没有临终关怀的证据。结果是任何下肢并发症,包括足部溃疡、骨髓炎、坏疽或下肢截肢。我们将数据随机分成训练组(38,569/48209;80%)和测试(9,640/48209;20%)的数据集。在训练数据集中,我们使用了超级学习者(SL),这是一种集成学习方法,采用交叉验证并将多个候选风险预测因子组合为单个风险预测因子。我们使用接收者工作特征曲线和校准图来评估测试数据集中SL风险预测器的性能。结果:在该队列的48,209人中,有2327人在随访期间出现了下肢并发症。SL风险估计量具有良好的判别性(AUC=0.845, 95% CI 0.826-0.863)和校准。我们的SL算法的修改版本,简化为便于现实世界的采用,仅略微降低了歧视(AUC=0.817, 95%CI 0.797-0.837)。修改后的版本在相对于警报频率和需要评估的患者数量获得的精度方面略优于naïve逻辑回归模型(AUC=0.804, 95% CI 0.783-0.825)。结论:我们已经建立了一个基于机器学习的风险评估器,它有可能在糖尿病多发性神经病变的初始诊断时提高对下肢并发症高风险糖尿病患者的临床检测。该算法仅使用电子病历数据就具有良好的判别有效性和校准性。需要进一步的研究来确定在解释和部署中最大化算法公平性的最佳环境和策略。