{"title":"Anomaly Detection for Time Series Infrastructure Metric Data","authors":"Stefan Bucur, F. Moldoveanu","doi":"10.1109/CSCS.2019.00036","DOIUrl":null,"url":null,"abstract":"Time series metric data is used by monitoring systems to analyze the performance of hardware and software from a data center. This data can be easily displayed using line graphs and, as such, can be manually checked by operators for anomalies. Manual reviews have some major disadvantages, since they are time consuming and error prone. In this paper, we describe our approach for automatically detecting outliers in metric data, which have some unique properties like high seasonality and extreme seasonal spikes. We use the \"Prophet\" library to create a model of the time series data, which is then automatically analyzed with our own distance formula for detecting anomalies in the time series data. A part of our metrics data have a propensity for extreme recurrent spikes, for example network usage during daily backups that is millions of times higher than the average. To handle such special cases we have also implemented a seasonal spike detector.","PeriodicalId":352411,"journal":{"name":"2019 22nd International Conference on Control Systems and Computer Science (CSCS)","volume":"46 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 22nd International Conference on Control Systems and Computer Science (CSCS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CSCS.2019.00036","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
Time series metric data is used by monitoring systems to analyze the performance of hardware and software from a data center. This data can be easily displayed using line graphs and, as such, can be manually checked by operators for anomalies. Manual reviews have some major disadvantages, since they are time consuming and error prone. In this paper, we describe our approach for automatically detecting outliers in metric data, which have some unique properties like high seasonality and extreme seasonal spikes. We use the "Prophet" library to create a model of the time series data, which is then automatically analyzed with our own distance formula for detecting anomalies in the time series data. A part of our metrics data have a propensity for extreme recurrent spikes, for example network usage during daily backups that is millions of times higher than the average. To handle such special cases we have also implemented a seasonal spike detector.