The use of block maxima method of extreme value statistics to characterise blood glucose curves

M. Szigeti, T. Ferenci, L. Kovács
{"title":"The use of block maxima method of extreme value statistics to characterise blood glucose curves","authors":"M. Szigeti, T. Ferenci, L. Kovács","doi":"10.1109/SoSE50414.2020.9130427","DOIUrl":null,"url":null,"abstract":"In contrast to regular statistics where the focus is on the most typical part of the data and the used metrics are describing that part (usually with the mean or median and variance and interquartile range) where most of the observations came from, there is a branch of statistics which focuses on the extreme events, i.e, the tails of the distributions. These are not simple outliers, like data entry errors, but real part of the data which are far from the central tendency and occur rarely, yet, have relevance and impact. Thus, in many application, they can’t be simply neglected. The use of extreme value statistics allows us to fit models on this part of the data and like “regular” statistics, enables us to calculate estimates and predictions, but in this case for extreme values. These methods are frequently used in fields like meteorology and finance where the extreme events have large impact despite their rarity. Because of this rarity, however, only a small fraction of the data can be used so much higher sample size is required for such analysis. This factor limited the use of extreme value statistics in biomedical field where available technology and costs are strong limitations at frequently measuring most of the biomarkers until recently. Blood glucose level is one of the exceptions nowadays, as with recent advancements it can be monitored for relatively long time and with high frequency for a patient. Additionally, extreme values of blood glucose levels (both high and low) are associated with- chronic or acute- complications of diabetes. This paper aims to demonstrate that the use of extreme value statistics, in particular the block maxima approach could be a possible way to characterize blood glucose curves. In addition to providing a metric for the state of the patient and therefore hopefully the associated risks, it allows the comparison of the performance of artificial pancreas systems. Block maxima method was used to model extreme values of a dataset containing measurements of a single patient with 476 complete days of data acquired with sampling frequency of 15 minutes. Probabilities for exceeding the clinically relevant levels of 270 mg/dl (cognitive symptoms expected) and 600 mg/dl (diabetic hyperosmolar syndrome) were calculated and were 3.47% and 4.96.10-7% respectively. Through these estimates it is possible to characterise each patient’s status and compare different controllers.","PeriodicalId":121664,"journal":{"name":"2020 IEEE 15th International Conference of System of Systems Engineering (SoSE)","volume":"29 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 IEEE 15th International Conference of System of Systems Engineering (SoSE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SoSE50414.2020.9130427","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2

Abstract

In contrast to regular statistics where the focus is on the most typical part of the data and the used metrics are describing that part (usually with the mean or median and variance and interquartile range) where most of the observations came from, there is a branch of statistics which focuses on the extreme events, i.e, the tails of the distributions. These are not simple outliers, like data entry errors, but real part of the data which are far from the central tendency and occur rarely, yet, have relevance and impact. Thus, in many application, they can’t be simply neglected. The use of extreme value statistics allows us to fit models on this part of the data and like “regular” statistics, enables us to calculate estimates and predictions, but in this case for extreme values. These methods are frequently used in fields like meteorology and finance where the extreme events have large impact despite their rarity. Because of this rarity, however, only a small fraction of the data can be used so much higher sample size is required for such analysis. This factor limited the use of extreme value statistics in biomedical field where available technology and costs are strong limitations at frequently measuring most of the biomarkers until recently. Blood glucose level is one of the exceptions nowadays, as with recent advancements it can be monitored for relatively long time and with high frequency for a patient. Additionally, extreme values of blood glucose levels (both high and low) are associated with- chronic or acute- complications of diabetes. This paper aims to demonstrate that the use of extreme value statistics, in particular the block maxima approach could be a possible way to characterize blood glucose curves. In addition to providing a metric for the state of the patient and therefore hopefully the associated risks, it allows the comparison of the performance of artificial pancreas systems. Block maxima method was used to model extreme values of a dataset containing measurements of a single patient with 476 complete days of data acquired with sampling frequency of 15 minutes. Probabilities for exceeding the clinically relevant levels of 270 mg/dl (cognitive symptoms expected) and 600 mg/dl (diabetic hyperosmolar syndrome) were calculated and were 3.47% and 4.96.10-7% respectively. Through these estimates it is possible to characterise each patient’s status and compare different controllers.
利用极值统计的块极大值法来表征血糖曲线
常规统计关注的是数据的最典型部分,使用的指标描述的是大部分观测结果来自的那部分(通常是平均值或中位数、方差和四分位数范围),与之相反,统计学的一个分支关注的是极端事件,即分布的尾部。这些不是简单的异常值,如数据输入错误,而是远离集中趋势的真实数据部分,很少发生,但具有相关性和影响。因此,在许多应用中,它们不能被简单地忽略。极值统计的使用使我们能够在这部分数据上拟合模型,并且像“常规”统计一样,使我们能够计算估计和预测,但在这种情况下是针对极值的。这些方法经常用于气象和金融等领域,在这些领域,极端事件尽管罕见,但影响很大。然而,由于这种稀有性,只有一小部分数据可以使用,因此需要更大的样本量来进行这种分析。这一因素限制了极端值统计在生物医学领域的使用,因为直到最近,现有的技术和成本都是频繁测量大多数生物标志物的强大限制。血糖水平是当今的一个例外,随着最近的进步,它可以监测相对较长时间和高频率的病人。此外,血糖水平的极端值(无论是高还是低)都与糖尿病的慢性或急性并发症有关。本文旨在证明使用极值统计,特别是块极大值方法可能是表征血糖曲线的一种可能方法。除了为病人的状态提供一个指标,因此有希望提供相关的风险,它允许对人工胰腺系统的性能进行比较。使用块最大值方法对包含单个患者的测量数据集的极值进行建模,该数据集具有476完整天的数据,采样频率为15分钟。计算出超过临床相关水平270 mg/dl(预期认知症状)和600 mg/dl(糖尿病高渗综合征)的概率,分别为3.47%和4.96.10-7%。通过这些估计,有可能描述每个病人的状态,并比较不同的控制器。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信