Robust traffic anomaly detection with principal component pursuit

A. Abdelkefi, Yuming Jiang, Wen Wang, Arne Øslebø, O. Kvittem
{"title":"Robust traffic anomaly detection with principal component pursuit","authors":"A. Abdelkefi, Yuming Jiang, Wen Wang, Arne Øslebø, O. Kvittem","doi":"10.1145/1921206.1921217","DOIUrl":null,"url":null,"abstract":"Principal component analysis (PCA) is a statistical technique that has been used for data analysis and dimensionality reduction. It was introduced as a network traffic anomaly detection technique firstly in [1]. Since then, a lot of research attention has been received, which results in an extensive analysis and several extensions. In [2], the sensitivity of PCA to its tuning parameters, such as the dimension of the low-rank subspace and the detection threshold, on traffic anomaly detection was indicated. However, no explanation on the underlying reasons of the problem was given in [2]. In [3], further investigation on the PCA sensitivity was conducted and it was found that the PCA sensitivity comes from the inability of PCA to detect temporal correlations. Based on this finding, an extension of PCA to Kalman-Loeve expansion (KLE) was proposed in [3]. While KLE shows slight improvement, it still exhibits similar sensitivity issue since a new tuning parameter called temporal correlation range was introduced. Recently, in [4], additional effort was paid to illustrate the PCA-poisoning problem. To underline this problem, an evading strategy called Boiled-Frog was proposed which adds a high fraction of outliers to the traffic. To defend against this, the authors employed a more robust version of PCA called PCA-GRID. While PCA-GRID shows performance improvement regarding the robustness to the outliers, it experiences a high sensitivity to the threshold estimate and the k-dimensional subspace that maximizes the dispersion of the data. The purpose of this work is to consider another technique to address the PCA poisoning problems to provide robust traffic anomaly detection: The Principal Component Pursuit.","PeriodicalId":325024,"journal":{"name":"CoNEXT '10 Student Workshop","volume":"33 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2010-11-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"28","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"CoNEXT '10 Student Workshop","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/1921206.1921217","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 28

Abstract

Principal component analysis (PCA) is a statistical technique that has been used for data analysis and dimensionality reduction. It was introduced as a network traffic anomaly detection technique firstly in [1]. Since then, a lot of research attention has been received, which results in an extensive analysis and several extensions. In [2], the sensitivity of PCA to its tuning parameters, such as the dimension of the low-rank subspace and the detection threshold, on traffic anomaly detection was indicated. However, no explanation on the underlying reasons of the problem was given in [2]. In [3], further investigation on the PCA sensitivity was conducted and it was found that the PCA sensitivity comes from the inability of PCA to detect temporal correlations. Based on this finding, an extension of PCA to Kalman-Loeve expansion (KLE) was proposed in [3]. While KLE shows slight improvement, it still exhibits similar sensitivity issue since a new tuning parameter called temporal correlation range was introduced. Recently, in [4], additional effort was paid to illustrate the PCA-poisoning problem. To underline this problem, an evading strategy called Boiled-Frog was proposed which adds a high fraction of outliers to the traffic. To defend against this, the authors employed a more robust version of PCA called PCA-GRID. While PCA-GRID shows performance improvement regarding the robustness to the outliers, it experiences a high sensitivity to the threshold estimate and the k-dimensional subspace that maximizes the dispersion of the data. The purpose of this work is to consider another technique to address the PCA poisoning problems to provide robust traffic anomaly detection: The Principal Component Pursuit.
基于主成分追踪的鲁棒交通异常检测
主成分分析(PCA)是一种用于数据分析和降维的统计技术。它作为一种网络流量异常检测技术在文献[1]中首次提出。从那时起,大量的研究得到了关注,这导致了广泛的分析和一些扩展。文献[2]表明了主成分分析对其调优参数(如低秩子空间维数和检测阈值)在交通异常检测中的敏感性。然而,文献[2]并没有对问题的根本原因做出解释。[3]对PCA的敏感性进行了进一步的研究,发现PCA的敏感性来自于PCA无法检测时间相关性。基于这一发现,[3]提出了将PCA扩展为Kalman-Loeve展开式(KLE)。虽然KLE略有改善,但由于引入了新的调优参数时间相关范围,仍然存在类似的灵敏度问题。最近,在[4]中,进一步努力说明了pca中毒问题。为了突出这一问题,提出了一种名为“沸腾青蛙”的规避策略,该策略在流量中添加了大量异常值。为了防止这种情况,作者采用了一种更健壮的PCA版本,称为PCA- grid。虽然PCA-GRID在对异常值的鲁棒性方面表现出性能改进,但它对阈值估计和k维子空间的灵敏度很高,从而最大化了数据的离散性。这项工作的目的是考虑另一种技术来解决PCA中毒问题,以提供鲁棒的流量异常检测:主成分追踪。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信