基于持续血糖监测的60分钟1型糖尿病患者血糖预测的种族差异

IF 7.7

PLOS digital health Pub Date : 2025-06-30 eCollection Date: 2025-06-01 DOI:10.1371/journal.pdig.0000918

Helene Bei Thomsen, Livie Yumeng Li, Anders Aasted Isaksen, Benjamin Lebiecka-Johansen, Charline Bour, Guy Fagherazzi, William P T M van Doorn, Tibor V Varga, Adam Hulman

{"title":"基于持续血糖监测的60分钟1型糖尿病患者血糖预测的种族差异","authors":"Helene Bei Thomsen, Livie Yumeng Li, Anders Aasted Isaksen, Benjamin Lebiecka-Johansen, Charline Bour, Guy Fagherazzi, William P T M van Doorn, Tibor V Varga, Adam Hulman","doi":"10.1371/journal.pdig.0000918","DOIUrl":null,"url":null,"abstract":"Non-Hispanic white (White) populations are overrepresented in medical studies. Potential healthcare disparities can happen when machine learning models, used in diabetes technologies, are trained on data from primarily White patients. We aimed to evaluate algorithmic fairness in glucose predictions. This study utilized continuous glucose monitoring (CGM) data from 101 White and 104 Black participants with type 1 diabetes collected by the JAEB Center for Health Research, US. Long short-term memory (LSTM) deep learning models were trained on 11 datasets of different proportions of White and Black participants and tailored to each individual using transfer learning to predict glucose 60 minutes ahead based on 60-minute windows. Root mean squared errors (RMSE) were calculated for each participant. Linear mixed-effect models were used to investigate the association between racial composition and RMSE while accounting for age, sex, and training data size. A median of 9 weeks (IQR: 7, 10) of CGM data was available per participant. The divergence in performance (RMSE slope by proportion) was not statistically significant for either group. However, the slope difference (from 0% White and 100% Black to 100% White and 0% Black) between groups was statistically significant (p = 0.02), meaning the RMSE increased 0.04 [0.01, 0.08] mmol/L more for Black participants compared to White participants when the proportion of White participants increased from 0 to 100% in the training data. This difference was attenuated in the transfer learned models (RMSE: 0.02 [-0.01, 0.05] mmol/L, p = 0.20). The racial composition of training data created a small statistically significant difference in the performance of the models, which was not present after using transfer learning. This demonstrates the importance of diversity in datasets and the potential value of transfer learning for developing more fair prediction models.","PeriodicalId":74465,"journal":{"name":"PLOS digital health","volume":"4 6","pages":"e0000918"},"PeriodicalIF":7.7000,"publicationDate":"2025-06-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12208448/pdf/","citationCount":"0","resultStr":"{\"title\":\"Racial disparities in continuous glucose monitoring-based 60-min glucose predictions among people with type 1 diabetes.\",\"authors\":\"Helene Bei Thomsen, Livie Yumeng Li, Anders Aasted Isaksen, Benjamin Lebiecka-Johansen, Charline Bour, Guy Fagherazzi, William P T M van Doorn, Tibor V Varga, Adam Hulman\",\"doi\":\"10.1371/journal.pdig.0000918\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Non-Hispanic white (White) populations are overrepresented in medical studies. Potential healthcare disparities can happen when machine learning models, used in diabetes technologies, are trained on data from primarily White patients. We aimed to evaluate algorithmic fairness in glucose predictions. This study utilized continuous glucose monitoring (CGM) data from 101 White and 104 Black participants with type 1 diabetes collected by the JAEB Center for Health Research, US. Long short-term memory (LSTM) deep learning models were trained on 11 datasets of different proportions of White and Black participants and tailored to each individual using transfer learning to predict glucose 60 minutes ahead based on 60-minute windows. Root mean squared errors (RMSE) were calculated for each participant. Linear mixed-effect models were used to investigate the association between racial composition and RMSE while accounting for age, sex, and training data size. A median of 9 weeks (IQR: 7, 10) of CGM data was available per participant. The divergence in performance (RMSE slope by proportion) was not statistically significant for either group. However, the slope difference (from 0% White and 100% Black to 100% White and 0% Black) between groups was statistically significant (p = 0.02), meaning the RMSE increased 0.04 [0.01, 0.08] mmol/L more for Black participants compared to White participants when the proportion of White participants increased from 0 to 100% in the training data. This difference was attenuated in the transfer learned models (RMSE: 0.02 [-0.01, 0.05] mmol/L, p = 0.20). The racial composition of training data created a small statistically significant difference in the performance of the models, which was not present after using transfer learning. This demonstrates the importance of diversity in datasets and the potential value of transfer learning for developing more fair prediction models.\",\"PeriodicalId\":74465,\"journal\":{\"name\":\"PLOS digital health\",\"volume\":\"4 6\",\"pages\":\"e0000918\"},\"PeriodicalIF\":7.7000,\"publicationDate\":\"2025-06-30\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12208448/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"PLOS digital health\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1371/journal.pdig.0000918\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2025/6/1 0:00:00\",\"PubModel\":\"eCollection\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"PLOS digital health","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1371/journal.pdig.0000918","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/6/1 0:00:00","PubModel":"eCollection","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

非西班牙裔白人（白人）人口在医学研究中的比例过高。当糖尿病技术中使用的机器学习模型接受主要来自白人患者的数据训练时，潜在的医疗差异就会发生。我们的目的是评估血糖预测算法的公平性。本研究利用美国JAEB健康研究中心收集的101名白人和104名黑人1型糖尿病患者的连续血糖监测（CGM）数据。长短期记忆（LSTM）深度学习模型在11个不同比例的白人和黑人参与者数据集上进行了训练，并针对每个人量身定制，使用迁移学习在60分钟窗口的基础上提前60分钟预测血糖。计算每位参与者的均方根误差（RMSE）。在考虑年龄、性别和训练数据大小的情况下，使用线性混合效应模型来调查种族构成与RMSE之间的关系。每位参与者的CGM数据中位数为9周（IQR: 7,10）。两组的表现差异（按比例计算的RMSE斜率）均无统计学意义。然而，两组之间的斜率差异（从0%白人和100%黑人到100%白人和0%黑人）具有统计学意义（p = 0.02），这意味着当训练数据中白人参与者的比例从0增加到100%时，黑人参与者的RMSE比白人参与者多增加0.04 [0.01,0.08]mmol/L。这种差异在迁移学习模型中减弱（RMSE: 0.02 [-0.01, 0.05] mmol/L, p = 0.20）。训练数据的种族组成在模型的性能上产生了统计学上显著的小差异，而使用迁移学习后则不存在这种差异。这证明了数据集多样性的重要性，以及迁移学习对开发更公平的预测模型的潜在价值。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

Racial disparities in continuous glucose monitoring-based 60-min glucose predictions among people with type 1 diabetes.

查看原文本刊更多论文

Racial disparities in continuous glucose monitoring-based 60-min glucose predictions among people with type 1 diabetes.

Non-Hispanic white (White) populations are overrepresented in medical studies. Potential healthcare disparities can happen when machine learning models, used in diabetes technologies, are trained on data from primarily White patients. We aimed to evaluate algorithmic fairness in glucose predictions. This study utilized continuous glucose monitoring (CGM) data from 101 White and 104 Black participants with type 1 diabetes collected by the JAEB Center for Health Research, US. Long short-term memory (LSTM) deep learning models were trained on 11 datasets of different proportions of White and Black participants and tailored to each individual using transfer learning to predict glucose 60 minutes ahead based on 60-minute windows. Root mean squared errors (RMSE) were calculated for each participant. Linear mixed-effect models were used to investigate the association between racial composition and RMSE while accounting for age, sex, and training data size. A median of 9 weeks (IQR: 7, 10) of CGM data was available per participant. The divergence in performance (RMSE slope by proportion) was not statistically significant for either group. However, the slope difference (from 0% White and 100% Black to 100% White and 0% Black) between groups was statistically significant (p = 0.02), meaning the RMSE increased 0.04 [0.01, 0.08] mmol/L more for Black participants compared to White participants when the proportion of White participants increased from 0 to 100% in the training data. This difference was attenuated in the transfer learned models (RMSE: 0.02 [-0.01, 0.05] mmol/L, p = 0.20). The racial composition of training data created a small statistically significant difference in the performance of the models, which was not present after using transfer learning. This demonstrates the importance of diversity in datasets and the potential value of transfer learning for developing more fair prediction models.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

PLOS digital health

自引率

0.00%

发文量