Genetic algorithm with a Bayesian approach for multiple change-point detection in time series of counting exceedances for specific thresholds

Pub Date : 2023-10-09 DOI:10.1007/s42952-023-00227-2
Biviana Marcela Suárez-Sierra, Arrigo Coen, Carlos Alberto Taimal
{"title":"Genetic algorithm with a Bayesian approach for multiple change-point detection in time series of counting exceedances for specific thresholds","authors":"Biviana Marcela Suárez-Sierra, Arrigo Coen, Carlos Alberto Taimal","doi":"10.1007/s42952-023-00227-2","DOIUrl":null,"url":null,"abstract":"Abstract Although the applications of Non-Homogeneous Poisson Processes (NHPP) to model and study the threshold overshoots of interest in different time series of measurements have proven to provide good results, they needed to be complemented with an efficient and automatic diagnostic technique to establish the location of the change-points, which, when taken into account, make the estimated model fit poorly in regards of the information contained in the real one. Because of this, a new method is proposed to solve the segmentation uncertainty of the time series of measurements, where the generating distribution of exceedances of a specific threshold is the focus of investigation. One of the great contributions of the present algorithm is that all the days that trespassed are candidates to be a change-point, so all the possible configurations of overflow days under the heuristics of a genetic algorithm are the possible chromosomes, which will unite to produce new solutions. Also, such methods will be guarantee to non-local and the best possible one solution, reducing wasted machine time evaluating the least likely chromosomes to be a feasible solution. The analytical evaluation technique will be by means of the Minimum Description Length ( MDL ) as the objective function, which is the joint posterior distribution function of the parameters of the NHPP of each regime and the change-points that determines them and which account as well for the influence of the presence of said times. Thus, one of the practical implications of the present work comes in terms of overcoming the need of modeling the time series of measurements, where the distributions of exceedances of certain thresholds, or where the counting of certain events involving abrupt changes, is the main focus with applications in phenomena such as climate change, information security and epidemiology, to name a few.","PeriodicalId":0,"journal":{"name":"","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-10-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1007/s42952-023-00227-2","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Abstract Although the applications of Non-Homogeneous Poisson Processes (NHPP) to model and study the threshold overshoots of interest in different time series of measurements have proven to provide good results, they needed to be complemented with an efficient and automatic diagnostic technique to establish the location of the change-points, which, when taken into account, make the estimated model fit poorly in regards of the information contained in the real one. Because of this, a new method is proposed to solve the segmentation uncertainty of the time series of measurements, where the generating distribution of exceedances of a specific threshold is the focus of investigation. One of the great contributions of the present algorithm is that all the days that trespassed are candidates to be a change-point, so all the possible configurations of overflow days under the heuristics of a genetic algorithm are the possible chromosomes, which will unite to produce new solutions. Also, such methods will be guarantee to non-local and the best possible one solution, reducing wasted machine time evaluating the least likely chromosomes to be a feasible solution. The analytical evaluation technique will be by means of the Minimum Description Length ( MDL ) as the objective function, which is the joint posterior distribution function of the parameters of the NHPP of each regime and the change-points that determines them and which account as well for the influence of the presence of said times. Thus, one of the practical implications of the present work comes in terms of overcoming the need of modeling the time series of measurements, where the distributions of exceedances of certain thresholds, or where the counting of certain events involving abrupt changes, is the main focus with applications in phenomena such as climate change, information security and epidemiology, to name a few.

Abstract Image

分享
查看原文
遗传算法与贝叶斯方法多变化点检测的时间序列计数超过特定阈值
虽然应用非齐次泊松过程(Non-Homogeneous Poisson Processes, NHPP)来建模和研究不同时间序列测量的感兴趣的阈值超调已被证明提供了良好的结果,但它们需要补充一种有效的自动诊断技术来确定变化点的位置,当考虑到这一点时,估计模型与真实模型中包含的信息拟合较差。因此,提出了一种新的方法来解决测量时间序列的分割不确定性,其中研究了超过特定阈值的生成分布。该算法的一个重要贡献是,所有的溢出天数都是一个候选的改变点,因此在遗传算法的启发式下,所有可能的溢出天数配置都是可能的染色体,它们将联合起来产生新的解。此外,这种方法将保证非局部和最佳可能的一个解决方案,减少浪费的机器时间来评估最不可能的染色体是一个可行的解决方案。分析评价技术将通过最小描述长度(MDL)作为目标函数,这是每个制度的NHPP参数和决定它们的变化点的联合后验分布函数,也说明了上述时间存在的影响。因此,目前工作的实际意义之一在于克服对测量时间序列建模的需要,在这些时间序列中,超出某些阈值的分布,或涉及突变的某些事件的计数,是气候变化、信息安全和流行病学等现象应用的主要焦点。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信