Modeling the fasting blood glucose response to basal insulin adjustment in type 2 diabetes: An explainable machine learning approach on real-world data
IF 3.7 2区 医学Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS
Camilla Heisel Nyholm Thomsen , Thomas Kronborg , Stine Hangaard , Peter Vestergaard , Ole Hejlesen , Morten Hasselstrøm Jensen
{"title":"Modeling the fasting blood glucose response to basal insulin adjustment in type 2 diabetes: An explainable machine learning approach on real-world data","authors":"Camilla Heisel Nyholm Thomsen , Thomas Kronborg , Stine Hangaard , Peter Vestergaard , Ole Hejlesen , Morten Hasselstrøm Jensen","doi":"10.1016/j.ijmedinf.2024.105758","DOIUrl":null,"url":null,"abstract":"<div><h3>Introduction</h3><div>Optimal basal insulin titration for people with type 2 diabetes is vital to effectively reducing the risk of complications. However, a sizeable proportion of people (30–50 %) remain in suboptimal glycemic control six months post-initiation of basal insulin. This indicates a clear need for novel titration methods that account for individual patient variability in real-world settings.</div></div><div><h3>Objective</h3><div>This study aims to investigate the use of real-world data and explainable machine learning in modeling fasting glucose responses to basal insulin adjustments, focusing on identifying factors influencing fasting glucose variability.</div></div><div><h3>Methods</h3><div>A three-step explanatory approach was used to develop models using multiple linear regression, forward feature selection, and three-fold cross-validation. The models were built progressively, starting with a baseline model incorporating fasting blood glucose and insulin dose adjustments, followed by iterative models that in turn included biometric data, social factors, and biochemistry data, and lastly, a comprehensive model without constraints on the feature pool.</div></div><div><h3>Results</h3><div>The baseline model yielded an average root mean squared error (RMSE) of 1.52 [95% CI: 1.33–1.71]. The iterative models resulted in an average RMSE of 1.49 [95% CI: 1.35–1.62] (biometric data), 1.47 [95% CI: 1.36–1.58] (social factors), and 1.52 [95% CI: 1.34–1.70] (biochemistry data). The comprehensive model yielded an average RMSE of 1.44 [95% CI: 1.41–1.48].</div></div><div><h3>Conclusion</h3><div>Developing explainable machine learning models using real-world data is possible for basal insulin titration. However, model performance is influenced by data’s ability to capture everyday behavior, underscoring the need for incorporating more detailed behavioral and social data to optimize future titration models.</div></div>","PeriodicalId":54950,"journal":{"name":"International Journal of Medical Informatics","volume":"195 ","pages":"Article 105758"},"PeriodicalIF":3.7000,"publicationDate":"2024-12-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Medical Informatics","FirstCategoryId":"3","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1386505624004210","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
引用次数: 0
Abstract
Introduction
Optimal basal insulin titration for people with type 2 diabetes is vital to effectively reducing the risk of complications. However, a sizeable proportion of people (30–50 %) remain in suboptimal glycemic control six months post-initiation of basal insulin. This indicates a clear need for novel titration methods that account for individual patient variability in real-world settings.
Objective
This study aims to investigate the use of real-world data and explainable machine learning in modeling fasting glucose responses to basal insulin adjustments, focusing on identifying factors influencing fasting glucose variability.
Methods
A three-step explanatory approach was used to develop models using multiple linear regression, forward feature selection, and three-fold cross-validation. The models were built progressively, starting with a baseline model incorporating fasting blood glucose and insulin dose adjustments, followed by iterative models that in turn included biometric data, social factors, and biochemistry data, and lastly, a comprehensive model without constraints on the feature pool.
Results
The baseline model yielded an average root mean squared error (RMSE) of 1.52 [95% CI: 1.33–1.71]. The iterative models resulted in an average RMSE of 1.49 [95% CI: 1.35–1.62] (biometric data), 1.47 [95% CI: 1.36–1.58] (social factors), and 1.52 [95% CI: 1.34–1.70] (biochemistry data). The comprehensive model yielded an average RMSE of 1.44 [95% CI: 1.41–1.48].
Conclusion
Developing explainable machine learning models using real-world data is possible for basal insulin titration. However, model performance is influenced by data’s ability to capture everyday behavior, underscoring the need for incorporating more detailed behavioral and social data to optimize future titration models.
期刊介绍:
International Journal of Medical Informatics provides an international medium for dissemination of original results and interpretative reviews concerning the field of medical informatics. The Journal emphasizes the evaluation of systems in healthcare settings.
The scope of journal covers:
Information systems, including national or international registration systems, hospital information systems, departmental and/or physician''s office systems, document handling systems, electronic medical record systems, standardization, systems integration etc.;
Computer-aided medical decision support systems using heuristic, algorithmic and/or statistical methods as exemplified in decision theory, protocol development, artificial intelligence, etc.
Educational computer based programs pertaining to medical informatics or medicine in general;
Organizational, economic, social, clinical impact, ethical and cost-benefit aspects of IT applications in health care.