Data augmentation and evolutionary algorithms to improve the prediction of blood glucose levels in scarcity of training data

2017 IEEE Congress on Evolutionary Computation (CEC) Pub Date : 2017-06-05 DOI:10.1109/CEC.2017.7969570

J. M. Velasco, O. Garnica, Sergio Contador, J. Lanchares, E. Maqueda, M. Botella, J. Hidalgo

{"title":"Data augmentation and evolutionary algorithms to improve the prediction of blood glucose levels in scarcity of training data","authors":"J. M. Velasco, O. Garnica, Sergio Contador, J. Lanchares, E. Maqueda, M. Botella, J. Hidalgo","doi":"10.1109/CEC.2017.7969570","DOIUrl":null,"url":null,"abstract":"Diabetes Mellitus Type 1 patients are waiting for the arrival of the Artificial Pancreas. Artificial Pancreas systems will control the blood glucose of patients, improving their quality of life and reducing the risks they face daily. At the core of the Artificial Pancreas, an algorithm will forecast future glucose levels and estimate insulin bolus sizes. Grammatical Evolution has been proved as a suitable algorithm for predicting glucose levels. Nevertheless, one of the main obstacles that researches have found for training the Grammatical Evolution models is the lack of significant amounts of data. As in many other fields in medicine, the collection of data from real patients is very complex along with the fact that the patient's response can vary in a high degree due to a lot of personal factors which can be seen as different scenarios. In this paper, we propose both a classification system for scenario selection and a data augmentation algorithm that generates synthetic glucose time series from real data. Our experimental results show that, in a scarce data context, Grammatical Evolution models can get more accurate and robust predictions using scenario selection and data augmentation.","PeriodicalId":335123,"journal":{"name":"2017 IEEE Congress on Evolutionary Computation (CEC)","volume":"396 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-06-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"13","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 IEEE Congress on Evolutionary Computation (CEC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CEC.2017.7969570","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 13

Abstract

Diabetes Mellitus Type 1 patients are waiting for the arrival of the Artificial Pancreas. Artificial Pancreas systems will control the blood glucose of patients, improving their quality of life and reducing the risks they face daily. At the core of the Artificial Pancreas, an algorithm will forecast future glucose levels and estimate insulin bolus sizes. Grammatical Evolution has been proved as a suitable algorithm for predicting glucose levels. Nevertheless, one of the main obstacles that researches have found for training the Grammatical Evolution models is the lack of significant amounts of data. As in many other fields in medicine, the collection of data from real patients is very complex along with the fact that the patient's response can vary in a high degree due to a lot of personal factors which can be seen as different scenarios. In this paper, we propose both a classification system for scenario selection and a data augmentation algorithm that generates synthetic glucose time series from real data. Our experimental results show that, in a scarce data context, Grammatical Evolution models can get more accurate and robust predictions using scenario selection and data augmentation.

查看原文本刊更多论文

在训练数据匮乏的情况下，改进血糖水平预测的数据增强和进化算法

1型糖尿病患者正在等待人工胰腺的到来。人工胰腺系统将控制患者的血糖，提高他们的生活质量，减少他们每天面临的风险。在人工胰腺的核心，一种算法将预测未来的血糖水平并估计胰岛素剂量。语法进化已被证明是预测血糖水平的合适算法。然而，研究人员发现，训练语法进化模型的主要障碍之一是缺乏大量的数据。正如在医学的许多其他领域一样，从真实患者身上收集数据是非常复杂的，而且由于许多个人因素，患者的反应可能在很大程度上有所不同，这些因素可以被视为不同的情况。在本文中，我们提出了一个用于场景选择的分类系统和一个从真实数据生成合成葡萄糖时间序列的数据增强算法。我们的实验结果表明，在数据稀缺的情况下，使用场景选择和数据增强，语法进化模型可以获得更准确和稳健的预测。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2017 IEEE Congress on Evolutionary Computation (CEC)

自引率

0.00%

发文量