Network Anomaly Detection by Means of Machine Learning: Random Forest Approach with Apache Spark

Informatica economica Pub Date : 2018-10-01 DOI:10.12948/issn14531305/22.4.2018.08

Hesamaldin Hajialian, Cristian Toma

引用次数: 3

Abstract

Nowadays the network security is a crucial issue and traditional intrusion detection systems are not a sufficient way. Hence the intelligent detection systems should have a major role in network security by taking into consideration to process the network big data and predict the anomalies behavior as fast as possible. In this paper, we implemented a well-known supervised algorithm Random Forest Classifier with Apache Spark on NSL-KDD dataset provided by the University of New Brunswick with the accuracy of 78.69% and 35.2% false negative ratio. Empirical results show this approach is well in order to use for intrusion detection system as well as we seeking the best number of trees to be used on Random Forest Classifier for getting higher accuracy and lower cost for the intrusion detection system.

查看原文本刊更多论文

基于机器学习的网络异常检测:使用Apache Spark的随机森林方法

当前，网络安全已成为一个重要的问题，传统的入侵检测系统已不能满足要求。因此，考虑到如何处理网络大数据并尽快预测异常行为，智能检测系统应该在网络安全中发挥重要作用。本文利用Apache Spark在New Brunswick大学提供的NSL-KDD数据集上实现了一种著名的监督算法Random Forest Classifier，准确率为78.69%，假阴性率为35.2%。实验结果表明，该方法可以很好地用于入侵检测系统，同时我们也在寻找随机森林分类器上使用的最佳树数，以获得更高的准确率和更低的入侵检测系统成本。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Informatica economica

自引率

0.00%

发文量

审稿时长

8 weeks