{"title":"A comprehensive survey on intrusion detection algorithms","authors":"Yang Li , Zhengming Li , Mengyao Li","doi":"10.1016/j.compeleceng.2024.109863","DOIUrl":null,"url":null,"abstract":"<div><div>Although there are many reviews on Intrusion Detection Systems (IDS), the basic parts of Intrusion Detection Algorithms (IDA), such as imbalanced datasets, feature engineering, and model design, have not been fully studied. This review thoroughly examines modern IDA, emphasizing recent progress, current challenges, and potential future research paths. First, we explore three different strategies to handle imbalanced datasets: resampling, Synthetic Minority Over-sampling Technique (SMOTE), Generative Adversarial Networks (GAN). Next, we examine a few key feature extraction techniques, including Principal Component Analysis (PCA), Linear Discriminant Analysis (LDA), Autoencoder (AE), among others. Additionally, we explore filtering, wrapper, and embedded methods for feature selection. Then, we explore model design approaches for IDA, considering both ensemble and non-ensemble learning methods. We provide a thorough assessment of ensemble techniques: bagging, boosting, and stacking. We also evaluate a variety of non-ensemble methods, including Naive Bayes (NB), K-Nearest Neighbors (KNN), Convolutional Neural Networks (CNN), Recurrent Neural Networks (RNN), among others. Finally, we briefly outline relevant applications, challenges and future research directions. This survey will serve as a valuable resource for researchers and practitioners, and foster the advancement of IDA technology.</div></div>","PeriodicalId":50630,"journal":{"name":"Computers & Electrical Engineering","volume":"121 ","pages":"Article 109863"},"PeriodicalIF":4.0000,"publicationDate":"2024-11-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computers & Electrical Engineering","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0045790624007900","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, HARDWARE & ARCHITECTURE","Score":null,"Total":0}
引用次数: 0
Abstract
Although there are many reviews on Intrusion Detection Systems (IDS), the basic parts of Intrusion Detection Algorithms (IDA), such as imbalanced datasets, feature engineering, and model design, have not been fully studied. This review thoroughly examines modern IDA, emphasizing recent progress, current challenges, and potential future research paths. First, we explore three different strategies to handle imbalanced datasets: resampling, Synthetic Minority Over-sampling Technique (SMOTE), Generative Adversarial Networks (GAN). Next, we examine a few key feature extraction techniques, including Principal Component Analysis (PCA), Linear Discriminant Analysis (LDA), Autoencoder (AE), among others. Additionally, we explore filtering, wrapper, and embedded methods for feature selection. Then, we explore model design approaches for IDA, considering both ensemble and non-ensemble learning methods. We provide a thorough assessment of ensemble techniques: bagging, boosting, and stacking. We also evaluate a variety of non-ensemble methods, including Naive Bayes (NB), K-Nearest Neighbors (KNN), Convolutional Neural Networks (CNN), Recurrent Neural Networks (RNN), among others. Finally, we briefly outline relevant applications, challenges and future research directions. This survey will serve as a valuable resource for researchers and practitioners, and foster the advancement of IDA technology.
尽管有许多关于入侵检测系统(IDS)的综述,但对入侵检测算法(IDA)的基本部分,如不平衡数据集、特征工程和模型设计,还没有进行充分的研究。本综述深入研究了现代 IDA,强调了最新进展、当前挑战和潜在的未来研究路径。首先,我们探讨了处理不平衡数据集的三种不同策略:重采样、合成少数群体过度采样技术(SMOTE)和生成对抗网络(GAN)。接下来,我们将研究几种关键的特征提取技术,包括主成分分析(PCA)、线性判别分析(LDA)、自动编码器(AE)等。此外,我们还探讨了用于特征选择的过滤、包装和嵌入方法。然后,我们探讨了 IDA 的模型设计方法,同时考虑了集合学习和非集合学习方法。我们对集合技术进行了全面的评估:bagging、boosting 和 stacking。我们还评估了各种非集合方法,包括奈维贝叶(NB)、K-近邻(KNN)、卷积神经网络(CNN)、循环神经网络(RNN)等。最后,我们简要概述了相关应用、挑战和未来研究方向。本调查报告将成为研究人员和从业人员的宝贵资源,并促进 IDA 技术的发展。
期刊介绍:
The impact of computers has nowhere been more revolutionary than in electrical engineering. The design, analysis, and operation of electrical and electronic systems are now dominated by computers, a transformation that has been motivated by the natural ease of interface between computers and electrical systems, and the promise of spectacular improvements in speed and efficiency.
Published since 1973, Computers & Electrical Engineering provides rapid publication of topical research into the integration of computer technology and computational techniques with electrical and electronic systems. The journal publishes papers featuring novel implementations of computers and computational techniques in areas like signal and image processing, high-performance computing, parallel processing, and communications. Special attention will be paid to papers describing innovative architectures, algorithms, and software tools.