{"title":"ANOMALY DETECTION AND ATTRIBUTION USING AUTO FORECAST AND DIRECTED GRAPHS","authors":"V. Sankar, Somendra Tripathi","doi":"10.5121/IJDKP.2016.6205","DOIUrl":null,"url":null,"abstract":"In the business world, decision makers rely heavily on data to back their decisions. With the quantum of data increasing rapidly, traditional methods used to generate insights from reports and dashboards will soon become intractable. This creates a need for efficient systems which can substitute human intelligence and reduce time latency in decision making. This paper describes an approach to process time series data with multiple dimensions such as geographies, verticals, products, efficiently, and to detect anomalies in the data and further, to explain potential reasons for the occurrence of the anomalies. The algorithm implements auto selection of forecast models to make reliable forecasts and detect such anomalies. Depth First Search (DFS) is applied to analyse each of these anomalies and find its root causes. The algorithm filters the redundant causes and reports the insights to the stakeholders. Apart from being a hair-trigger KPI tracking mechanism, this algorithm can also be customized for problems lke A/B testing, campaign tracking and product evaluations.","PeriodicalId":131153,"journal":{"name":"International Journal of Data Mining & Knowledge Management Process","volume":"24 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-03-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Data Mining & Knowledge Management Process","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.5121/IJDKP.2016.6205","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
In the business world, decision makers rely heavily on data to back their decisions. With the quantum of data increasing rapidly, traditional methods used to generate insights from reports and dashboards will soon become intractable. This creates a need for efficient systems which can substitute human intelligence and reduce time latency in decision making. This paper describes an approach to process time series data with multiple dimensions such as geographies, verticals, products, efficiently, and to detect anomalies in the data and further, to explain potential reasons for the occurrence of the anomalies. The algorithm implements auto selection of forecast models to make reliable forecasts and detect such anomalies. Depth First Search (DFS) is applied to analyse each of these anomalies and find its root causes. The algorithm filters the redundant causes and reports the insights to the stakeholders. Apart from being a hair-trigger KPI tracking mechanism, this algorithm can also be customized for problems lke A/B testing, campaign tracking and product evaluations.
在商业世界中,决策者严重依赖数据来支持他们的决策。随着数据量的快速增长,用于从报告和仪表板中生成见解的传统方法将很快变得棘手。这就产生了对高效系统的需求,这些系统可以替代人类智能,减少决策过程中的时间延迟。本文介绍了一种高效处理地理、垂直、产品等多维时间序列数据的方法,并对数据中的异常进行检测,进而解释异常发生的潜在原因。该算法实现了预测模型的自动选择,以进行可靠的预测和检测此类异常。采用深度优先搜索(Depth First Search, DFS)对每一种异常进行分析,找出其根本原因。该算法过滤冗余原因,并向涉众报告见解。除了作为一种一触即发的KPI跟踪机制,该算法还可以针对a /B测试、活动跟踪和产品评估等问题进行定制。