时间序列基础设施度量数据的异常检测

Stefan Bucur, F. Moldoveanu
{"title":"时间序列基础设施度量数据的异常检测","authors":"Stefan Bucur, F. Moldoveanu","doi":"10.1109/CSCS.2019.00036","DOIUrl":null,"url":null,"abstract":"Time series metric data is used by monitoring systems to analyze the performance of hardware and software from a data center. This data can be easily displayed using line graphs and, as such, can be manually checked by operators for anomalies. Manual reviews have some major disadvantages, since they are time consuming and error prone. In this paper, we describe our approach for automatically detecting outliers in metric data, which have some unique properties like high seasonality and extreme seasonal spikes. We use the \"Prophet\" library to create a model of the time series data, which is then automatically analyzed with our own distance formula for detecting anomalies in the time series data. A part of our metrics data have a propensity for extreme recurrent spikes, for example network usage during daily backups that is millions of times higher than the average. To handle such special cases we have also implemented a seasonal spike detector.","PeriodicalId":352411,"journal":{"name":"2019 22nd International Conference on Control Systems and Computer Science (CSCS)","volume":"46 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Anomaly Detection for Time Series Infrastructure Metric Data\",\"authors\":\"Stefan Bucur, F. Moldoveanu\",\"doi\":\"10.1109/CSCS.2019.00036\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Time series metric data is used by monitoring systems to analyze the performance of hardware and software from a data center. This data can be easily displayed using line graphs and, as such, can be manually checked by operators for anomalies. Manual reviews have some major disadvantages, since they are time consuming and error prone. In this paper, we describe our approach for automatically detecting outliers in metric data, which have some unique properties like high seasonality and extreme seasonal spikes. We use the \\\"Prophet\\\" library to create a model of the time series data, which is then automatically analyzed with our own distance formula for detecting anomalies in the time series data. A part of our metrics data have a propensity for extreme recurrent spikes, for example network usage during daily backups that is millions of times higher than the average. To handle such special cases we have also implemented a seasonal spike detector.\",\"PeriodicalId\":352411,\"journal\":{\"name\":\"2019 22nd International Conference on Control Systems and Computer Science (CSCS)\",\"volume\":\"46 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-05-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2019 22nd International Conference on Control Systems and Computer Science (CSCS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CSCS.2019.00036\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 22nd International Conference on Control Systems and Computer Science (CSCS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CSCS.2019.00036","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

摘要

监控系统使用时间序列度量数据来分析数据中心硬件和软件的性能。这些数据可以很容易地使用线形图显示,因此,操作员可以手动检查异常情况。手动审查有一些主要的缺点,因为它们既耗时又容易出错。在本文中,我们描述了一种自动检测度量数据中异常值的方法,这些异常值具有高季节性和极端季节性峰值等独特性质。我们使用“Prophet”库创建时间序列数据的模型,然后使用我们自己的距离公式自动分析时间序列数据中的异常。我们的指标数据的一部分具有极端周期性峰值的倾向,例如,每日备份期间的网络使用情况比平均值高出数百万倍。为了处理这种特殊情况,我们还实施了季节性峰值检测器。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Anomaly Detection for Time Series Infrastructure Metric Data
Time series metric data is used by monitoring systems to analyze the performance of hardware and software from a data center. This data can be easily displayed using line graphs and, as such, can be manually checked by operators for anomalies. Manual reviews have some major disadvantages, since they are time consuming and error prone. In this paper, we describe our approach for automatically detecting outliers in metric data, which have some unique properties like high seasonality and extreme seasonal spikes. We use the "Prophet" library to create a model of the time series data, which is then automatically analyzed with our own distance formula for detecting anomalies in the time series data. A part of our metrics data have a propensity for extreme recurrent spikes, for example network usage during daily backups that is millions of times higher than the average. To handle such special cases we have also implemented a seasonal spike detector.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信