{"title":"OML-AD:用于时间序列数据异常检测的在线机器学习","authors":"Sebastian Wette, Florian Heinrichs","doi":"arxiv-2409.09742","DOIUrl":null,"url":null,"abstract":"Time series are ubiquitous and occur naturally in a variety of applications\n-- from data recorded by sensors in manufacturing processes, over financial\ndata streams to climate data. Different tasks arise, such as regression,\nclassification or segmentation of the time series. However, to reliably solve\nthese challenges, it is important to filter out abnormal observations that\ndeviate from the usual behavior of the time series. While many anomaly\ndetection methods exist for independent data and stationary time series, these\nmethods are not applicable to non-stationary time series. To allow for\nnon-stationarity in the data, while simultaneously detecting anomalies, we\npropose OML-AD, a novel approach for anomaly detection (AD) based on online\nmachine learning (OML). We provide an implementation of OML-AD within the\nPython library River and show that it outperforms state-of-the-art baseline\nmethods in terms of accuracy and computational efficiency.","PeriodicalId":501340,"journal":{"name":"arXiv - STAT - Machine Learning","volume":"31 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-09-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"OML-AD: Online Machine Learning for Anomaly Detection in Time Series Data\",\"authors\":\"Sebastian Wette, Florian Heinrichs\",\"doi\":\"arxiv-2409.09742\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Time series are ubiquitous and occur naturally in a variety of applications\\n-- from data recorded by sensors in manufacturing processes, over financial\\ndata streams to climate data. Different tasks arise, such as regression,\\nclassification or segmentation of the time series. However, to reliably solve\\nthese challenges, it is important to filter out abnormal observations that\\ndeviate from the usual behavior of the time series. While many anomaly\\ndetection methods exist for independent data and stationary time series, these\\nmethods are not applicable to non-stationary time series. To allow for\\nnon-stationarity in the data, while simultaneously detecting anomalies, we\\npropose OML-AD, a novel approach for anomaly detection (AD) based on online\\nmachine learning (OML). We provide an implementation of OML-AD within the\\nPython library River and show that it outperforms state-of-the-art baseline\\nmethods in terms of accuracy and computational efficiency.\",\"PeriodicalId\":501340,\"journal\":{\"name\":\"arXiv - STAT - Machine Learning\",\"volume\":\"31 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-09-15\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"arXiv - STAT - Machine Learning\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/arxiv-2409.09742\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - STAT - Machine Learning","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2409.09742","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
摘要
时间序列无处不在,自然出现在各种应用中--从生产过程中传感器记录的数据、金融数据流到气候数据。不同的任务随之而来,如时间序列的回归、分类或分割。然而,要可靠地解决这些难题,重要的是要过滤掉与时间序列通常行为不同的异常观测数据。虽然有很多异常检测方法适用于独立数据和静态时间序列,但这些方法并不适用于非静态时间序列。为了在检测异常的同时考虑数据的非平稳性,我们提出了基于在线机器学习(OML)的异常检测(AD)新方法 OML-AD。我们在 Python 库 River 中提供了 OML-AD 的实现,并证明它在准确性和计算效率方面优于最先进的基准方法。
OML-AD: Online Machine Learning for Anomaly Detection in Time Series Data
Time series are ubiquitous and occur naturally in a variety of applications
-- from data recorded by sensors in manufacturing processes, over financial
data streams to climate data. Different tasks arise, such as regression,
classification or segmentation of the time series. However, to reliably solve
these challenges, it is important to filter out abnormal observations that
deviate from the usual behavior of the time series. While many anomaly
detection methods exist for independent data and stationary time series, these
methods are not applicable to non-stationary time series. To allow for
non-stationarity in the data, while simultaneously detecting anomalies, we
propose OML-AD, a novel approach for anomaly detection (AD) based on online
machine learning (OML). We provide an implementation of OML-AD within the
Python library River and show that it outperforms state-of-the-art baseline
methods in terms of accuracy and computational efficiency.