Outlier Detection Based on Low Density Models

2018 IEEE International Conference on Data Mining Workshops (ICDMW) Pub Date : 2018-11-01 DOI:10.1109/ICDMW.2018.00140

Félix Iglesias, T. Zseby, A. Zimek

引用次数: 19

Abstract

Most outlier detection algorithms are based on lazy learning or imply quadratic complexity. Both characteristics make them unsuitable for big data and stream data applications and preclude their applicability in systems that must operate autonomously. In this paper we propose a new algorithm—called SDO (Sparse Data Observers)—to estimate outlierness based on low density models of data. SDO is an eager learner; therefore, computational costs in application phases are severely reduced. We perform tests with a wide variation of synthetic datasets as well as the main datasets published in the literature for anomaly detection testing. Results show that SDO satisfactorily competes with the best ranked outlier detection alternatives. The good detection performance coupled with a low complexity makes SDO highly flexible and adaptable to stand-alone frameworks that must detect outliers fast with accuracy rates equivalent to lazy learning algorithms.

查看原文本刊更多论文

基于低密度模型的离群点检测

大多数离群点检测算法都是基于惰性学习或隐含二次复杂度。这两种特性都使它们不适合大数据和流数据应用，也阻碍了它们在必须自主运行的系统中的适用性。在本文中，我们提出了一种新的算法，称为SDO(稀疏数据观察者)来估计基于低密度数据模型的离群值。SDO是一个渴望学习的人;因此，应用阶段的计算成本大大降低。我们使用各种合成数据集以及发表在异常检测测试文献中的主要数据集进行测试。结果表明，SDO与排名最佳的离群值检测方案具有令人满意的竞争力。良好的检测性能加上较低的复杂性使得SDO非常灵活，适合于必须快速检测异常值的独立框架，其准确率相当于惰性学习算法。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2018 IEEE International Conference on Data Mining Workshops (ICDMW)

自引率

0.00%

发文量