Yinan Mao, Kyle Xin Quan Tan, Augustin Seng, Peter Wong, Sue-Anne Toh, Alex R Cook
{"title":"使用连续血糖监测资料和机器学习对糖尿病患者进行分层","authors":"Yinan Mao, Kyle Xin Quan Tan, Augustin Seng, Peter Wong, Sue-Anne Toh, Alex R Cook","doi":"10.34133/2022/9892340","DOIUrl":null,"url":null,"abstract":"<p><p><i>Background.</i> Continuous glucose monitoring (CGM) offers an opportunity for patients with diabetes to modify their lifestyle to better manage their condition and for clinicians to provide personalized healthcare and lifestyle advice. However, analytic tools are needed to standardize and analyze the rich data that emerge from CGM devices. This would allow glucotypes of patients to be identified to aid clinical decision-making.<i>Methods.</i> In this paper, we develop an analysis pipeline for CGM data and apply it to 148 diabetic patients with a total of 8632 days of follow up. The pipeline projects CGM data to a lower-dimensional space of features representing centrality, spread, size, and duration of glycemic excursions and the circadian cycle. We then use principal components analysis and <math><mi>k</mi></math>-means to cluster patients' records into one of four glucotypes and analyze cluster membership using multinomial logistic regression.<i>Results.</i> Glucotypes differ in the degree of control, amount of time spent in range, and on the presence and timing of hyper- and hypoglycemia. Patients on the program had statistically significant improvements in their glucose levels.<i>Conclusions.</i> This pipeline provides a fast automatic function to label raw CGM data without manual input.</p>","PeriodicalId":73207,"journal":{"name":"Health data science","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2022-04-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10880155/pdf/","citationCount":"0","resultStr":"{\"title\":\"Stratification of Patients with Diabetes Using Continuous Glucose Monitoring Profiles and Machine Learning.\",\"authors\":\"Yinan Mao, Kyle Xin Quan Tan, Augustin Seng, Peter Wong, Sue-Anne Toh, Alex R Cook\",\"doi\":\"10.34133/2022/9892340\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p><i>Background.</i> Continuous glucose monitoring (CGM) offers an opportunity for patients with diabetes to modify their lifestyle to better manage their condition and for clinicians to provide personalized healthcare and lifestyle advice. However, analytic tools are needed to standardize and analyze the rich data that emerge from CGM devices. This would allow glucotypes of patients to be identified to aid clinical decision-making.<i>Methods.</i> In this paper, we develop an analysis pipeline for CGM data and apply it to 148 diabetic patients with a total of 8632 days of follow up. The pipeline projects CGM data to a lower-dimensional space of features representing centrality, spread, size, and duration of glycemic excursions and the circadian cycle. We then use principal components analysis and <math><mi>k</mi></math>-means to cluster patients' records into one of four glucotypes and analyze cluster membership using multinomial logistic regression.<i>Results.</i> Glucotypes differ in the degree of control, amount of time spent in range, and on the presence and timing of hyper- and hypoglycemia. Patients on the program had statistically significant improvements in their glucose levels.<i>Conclusions.</i> This pipeline provides a fast automatic function to label raw CGM data without manual input.</p>\",\"PeriodicalId\":73207,\"journal\":{\"name\":\"Health data science\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-04-27\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10880155/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Health data science\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.34133/2022/9892340\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2022/1/1 0:00:00\",\"PubModel\":\"eCollection\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Health data science","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.34133/2022/9892340","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2022/1/1 0:00:00","PubModel":"eCollection","JCR":"","JCRName":"","Score":null,"Total":0}
Stratification of Patients with Diabetes Using Continuous Glucose Monitoring Profiles and Machine Learning.
Background. Continuous glucose monitoring (CGM) offers an opportunity for patients with diabetes to modify their lifestyle to better manage their condition and for clinicians to provide personalized healthcare and lifestyle advice. However, analytic tools are needed to standardize and analyze the rich data that emerge from CGM devices. This would allow glucotypes of patients to be identified to aid clinical decision-making.Methods. In this paper, we develop an analysis pipeline for CGM data and apply it to 148 diabetic patients with a total of 8632 days of follow up. The pipeline projects CGM data to a lower-dimensional space of features representing centrality, spread, size, and duration of glycemic excursions and the circadian cycle. We then use principal components analysis and -means to cluster patients' records into one of four glucotypes and analyze cluster membership using multinomial logistic regression.Results. Glucotypes differ in the degree of control, amount of time spent in range, and on the presence and timing of hyper- and hypoglycemia. Patients on the program had statistically significant improvements in their glucose levels.Conclusions. This pipeline provides a fast automatic function to label raw CGM data without manual input.