Analysis of data pre-processing influence on intrusion detection using NSL-KDD dataset

2017 Open Conference of Electrical, Electronic and Information Sciences (eStream) Pub Date : 2017-04-01 DOI:10.1109/ESTREAM.2017.7950325

N. Paulauskas, Juozas Auskalnis

引用次数: 56

Abstract

Data pre-processing for machine learning methods is key step for knowledge discovery process. Depending on nature of the data, pre-processing might take the majority time of data analysis. Correctly prepared data for processing guarantees precise and reliable results of data analysis. This paper analyses initial data pre-processing influence to attack detection accuracy by using Decision Trees, Naïve Bayes and Rule-Based classifiers with NSL-KDD dataset. In addition, the results of detected attacks accuracy dependency by selecting different attacks grouping options and using ensembles of various classifiers are presented.

查看原文本刊更多论文

基于NSL-KDD数据集的数据预处理对入侵检测的影响分析

机器学习方法的数据预处理是知识发现过程的关键步骤。根据数据的性质，预处理可能会占用数据分析的大部分时间。正确准备数据进行处理，保证数据分析结果准确可靠。针对NSL-KDD数据集，采用决策树、Naïve贝叶斯和基于规则的分类器分析了初始数据预处理对攻击检测精度的影响。此外，通过选择不同的攻击分组选项和使用各种分类器的集合，给出了检测到的攻击准确度依赖关系的结果。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2017 Open Conference of Electrical, Electronic and Information Sciences (eStream)

自引率

0.00%

发文量