{"title":"Optimization of Network Intrusion Detection System Using Genetic Algorithm with Improved Feature Selection Technique","authors":"E. Matel, Ariel M. Sison, Ruji P. Medina","doi":"10.1109/HNICEM48295.2019.9073439","DOIUrl":null,"url":null,"abstract":"The on-line environment is growing in unimaginable speed and scale. It offers convenience in our professional and personal life but on the other hand, exposes us to threats and danger in terms of network intrusion that eventually leads to invasion of privacy and other network security issues. This paper has proposed an optimization of the Network Intrusion Detection System (NIDS) using Genetic Algorithm with improved feature selection (GA-IFS) technique. GA-IFS monitors and analyzes network simulated activities using DARPA KDD Cup 99 dataset to efficiently identify normal and anomalous network traffic. GA is a search heuristic that is suitable for problems with large population size, however, it has drawbacks with regards to time taken for convergence which sometimes leads to local optima if lack of population diversity. This became an opportunity to enhance GA by integrating Support Vector Machine (SVM) classifier for selecting feature subset, reducing dataset dimensionality, identifying relevant features and improving the intrusion detection rate. Experimental results show the significant effect of dataset preprocessing procedure of GA-IFS in removing redundant and irrelevant records of 79.07% training data and 80.47% test data. With the implementation of improved feature selection, R2L got the highest improvement of 3.87% detection rate. This is followed by DOS with 3.33%, Normal with 2.76%, Probe with 1.22% and U2R with 0.91% compared to traditional GA.","PeriodicalId":6733,"journal":{"name":"2019 IEEE 11th International Conference on Humanoid, Nanotechnology, Information Technology, Communication and Control, Environment, and Management ( HNICEM )","volume":"1 1","pages":"1-6"},"PeriodicalIF":0.0000,"publicationDate":"2019-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"7","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 IEEE 11th International Conference on Humanoid, Nanotechnology, Information Technology, Communication and Control, Environment, and Management ( HNICEM )","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/HNICEM48295.2019.9073439","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 7
Abstract
The on-line environment is growing in unimaginable speed and scale. It offers convenience in our professional and personal life but on the other hand, exposes us to threats and danger in terms of network intrusion that eventually leads to invasion of privacy and other network security issues. This paper has proposed an optimization of the Network Intrusion Detection System (NIDS) using Genetic Algorithm with improved feature selection (GA-IFS) technique. GA-IFS monitors and analyzes network simulated activities using DARPA KDD Cup 99 dataset to efficiently identify normal and anomalous network traffic. GA is a search heuristic that is suitable for problems with large population size, however, it has drawbacks with regards to time taken for convergence which sometimes leads to local optima if lack of population diversity. This became an opportunity to enhance GA by integrating Support Vector Machine (SVM) classifier for selecting feature subset, reducing dataset dimensionality, identifying relevant features and improving the intrusion detection rate. Experimental results show the significant effect of dataset preprocessing procedure of GA-IFS in removing redundant and irrelevant records of 79.07% training data and 80.47% test data. With the implementation of improved feature selection, R2L got the highest improvement of 3.87% detection rate. This is followed by DOS with 3.33%, Normal with 2.76%, Probe with 1.22% and U2R with 0.91% compared to traditional GA.
网络环境正以难以想象的速度和规模发展。它为我们的工作和个人生活提供了便利,但在另一方面,也使我们面临网络入侵的威胁和危险,最终导致侵犯隐私和其他网络安全问题。提出了一种基于遗传算法和改进特征选择技术的网络入侵检测系统优化方法。GA-IFS使用DARPA KDD Cup 99数据集监测和分析网络模拟活动,以有效识别正常和异常网络流量。遗传算法是一种搜索启发式算法,适用于大种群规模的问题,然而,它在收敛时间方面存在缺点,如果缺乏种群多样性,有时会导致局部最优。利用支持向量机(SVM)分类器选择特征子集、降低数据集维数、识别相关特征和提高入侵检测率,从而增强遗传算法。实验结果表明,GA-IFS数据集预处理程序对79.07%的训练数据和80.47%的测试数据的冗余和不相关记录的去除效果显著。改进特征选择后,R2L的检测率提高最高,达到3.87%。与传统遗传相比,DOS为3.33%,Normal为2.76%,Probe为1.22%,U2R为0.91%。