Incremental anomaly-based intrusion detection system using limited labeled data

2017 3th International Conference on Web Research (ICWR) Pub Date : 2017-04-01 DOI:10.1109/ICWR.2017.7959324

Parisa Alaei, Fakhroddin Noorbehbahani

{"title":"Incremental anomaly-based intrusion detection system using limited labeled data","authors":"Parisa Alaei, Fakhroddin Noorbehbahani","doi":"10.1109/ICWR.2017.7959324","DOIUrl":null,"url":null,"abstract":"With the proliferation of the internet and increased global access to online media, cybercrime is also occurring at an increasing rate. Currently, both personal users and companies are vulnerable to cybercrime. A number of tools including firewalls and Intrusion Detection Systems (IDS) can be used as defense mechanisms. A firewall acts as a checkpoint which allows packets to pass through according to predetermined conditions. In extreme cases, it may even disconnect all network traffic. An IDS, on the other hand, automates the monitoring process in computer networks. The streaming nature of data in computer networks poses a significant challenge in building IDS. In this paper, a method is proposed to overcome this problem by performing online classification on datasets. In doing so, an incremental naive Bayesian classifier is employed. Furthermore, active learning enables solving the problem using a small set of labeled data points which are often very expensive to acquire. The proposed method includes two groups of actions i.e. offline and online. The former involves data preprocessing while the latter introduces the NADAL online method. The proposed method is compared to the incremental naive Bayesian classifier using the NSL-KDD standard dataset. There are three advantages with the proposed method: (1) overcoming the streaming data challenge; (2) reducing the high cost associated with instance labeling; and (3) improved accuracy and Kappa compared to the incremental naive Bayesian approach. Thus, the method is well-suited to IDS applications.","PeriodicalId":304897,"journal":{"name":"2017 3th International Conference on Web Research (ICWR)","volume":"81 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"27","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 3th International Conference on Web Research (ICWR)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICWR.2017.7959324","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 27

Abstract

With the proliferation of the internet and increased global access to online media, cybercrime is also occurring at an increasing rate. Currently, both personal users and companies are vulnerable to cybercrime. A number of tools including firewalls and Intrusion Detection Systems (IDS) can be used as defense mechanisms. A firewall acts as a checkpoint which allows packets to pass through according to predetermined conditions. In extreme cases, it may even disconnect all network traffic. An IDS, on the other hand, automates the monitoring process in computer networks. The streaming nature of data in computer networks poses a significant challenge in building IDS. In this paper, a method is proposed to overcome this problem by performing online classification on datasets. In doing so, an incremental naive Bayesian classifier is employed. Furthermore, active learning enables solving the problem using a small set of labeled data points which are often very expensive to acquire. The proposed method includes two groups of actions i.e. offline and online. The former involves data preprocessing while the latter introduces the NADAL online method. The proposed method is compared to the incremental naive Bayesian classifier using the NSL-KDD standard dataset. There are three advantages with the proposed method: (1) overcoming the streaming data challenge; (2) reducing the high cost associated with instance labeling; and (3) improved accuracy and Kappa compared to the incremental naive Bayesian approach. Thus, the method is well-suited to IDS applications.

查看原文本刊更多论文

基于有限标记数据的增量异常入侵检测系统

随着互联网的普及和全球网络媒体的普及，网络犯罪也在以越来越高的速度发生。目前，个人用户和公司都很容易受到网络犯罪的攻击。包括防火墙和入侵检测系统(IDS)在内的许多工具都可以用作防御机制。防火墙充当检查点，允许数据包根据预定条件通过。在极端情况下，它甚至可能断开所有网络流量。另一方面，入侵检测系统使计算机网络中的监控过程自动化。计算机网络中数据的流性质对构建入侵检测系统提出了重大挑战。本文提出了一种通过对数据集进行在线分类来克服这一问题的方法。在此过程中，使用了增量朴素贝叶斯分类器。此外，主动学习可以使用一小部分标记数据点来解决问题，而这些数据点通常是非常昂贵的。该方法包括离线和在线两组动作。前者涉及数据预处理，后者引入了NADAL在线方法。将该方法与使用NSL-KDD标准数据集的增量朴素贝叶斯分类器进行了比较。该方法有三个优点:(1)克服了流数据的挑战;(2)降低与实例标记相关的高成本;(3)与增量朴素贝叶斯方法相比，提高了准确率和Kappa。因此，该方法非常适合IDS应用程序。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2017 3th International Conference on Web Research (ICWR)

自引率

0.00%

发文量