Attention-driven multi-model architecture for unbalanced network traffic intrusion detection via extreme gradient boosting

Oluwadamilare Harazeem Abdulganiyu , Taha Ait Tchakoucht , Ahmed El Hilali Alaoui , Yakub Kayode Saheed
{"title":"Attention-driven multi-model architecture for unbalanced network traffic intrusion detection via extreme gradient boosting","authors":"Oluwadamilare Harazeem Abdulganiyu ,&nbsp;Taha Ait Tchakoucht ,&nbsp;Ahmed El Hilali Alaoui ,&nbsp;Yakub Kayode Saheed","doi":"10.1016/j.iswa.2025.200519","DOIUrl":null,"url":null,"abstract":"<div><div>Network Intrusion Detection Systems (NIDS) face significant challenges in identifying rare attack instances due to the inherent class imbalance and diversity in network traffic. This imbalance, often characterized by a dominance of benign network traffic data, reduces the effectiveness of traditional detection methods. To address this, we proposed CWFLAM-VAE, an attention-driven multi-model architecture that combines Class-Wise Focal Loss, Variational Autoencoder, and Extreme Gradient Boosting. CWFLAM-VAE generates synthetic rare-class attack data while preserving the original feature distribution, mitigating imbalance and improving classification performance. The effectiveness of our proposed system was evaluated by employing two datasets, one of which is the NSL-KDD, which exhibits a skewed distribution of network traffic favoring the majority class, and CSE-CIC-IDS2018 dataset, where approximately 83 % of the data consists of benign network traffic. We compared our method with existing sampling techniques (SMOTE, ROS, ADASYN, RUS) and existing classifiers (Logistic Regression, KNN, SVM, Decision Tree, LSTM, CNN). The experimental findings distinctly reveal the efficacy of the CWFLAM-VAE in resolving class imbalance concerns, with Extreme Gradient Boosting surpassing alternative machine learning techniques particularly in the detection of rare instances of attack traffic with an f-score of 97.6 % and 98.1 %, as well as a false positive rate of 0.17 and 0.27 for both data respectively.</div></div>","PeriodicalId":100684,"journal":{"name":"Intelligent Systems with Applications","volume":"26 ","pages":"Article 200519"},"PeriodicalIF":0.0000,"publicationDate":"2025-04-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Intelligent Systems with Applications","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2667305325000456","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Network Intrusion Detection Systems (NIDS) face significant challenges in identifying rare attack instances due to the inherent class imbalance and diversity in network traffic. This imbalance, often characterized by a dominance of benign network traffic data, reduces the effectiveness of traditional detection methods. To address this, we proposed CWFLAM-VAE, an attention-driven multi-model architecture that combines Class-Wise Focal Loss, Variational Autoencoder, and Extreme Gradient Boosting. CWFLAM-VAE generates synthetic rare-class attack data while preserving the original feature distribution, mitigating imbalance and improving classification performance. The effectiveness of our proposed system was evaluated by employing two datasets, one of which is the NSL-KDD, which exhibits a skewed distribution of network traffic favoring the majority class, and CSE-CIC-IDS2018 dataset, where approximately 83 % of the data consists of benign network traffic. We compared our method with existing sampling techniques (SMOTE, ROS, ADASYN, RUS) and existing classifiers (Logistic Regression, KNN, SVM, Decision Tree, LSTM, CNN). The experimental findings distinctly reveal the efficacy of the CWFLAM-VAE in resolving class imbalance concerns, with Extreme Gradient Boosting surpassing alternative machine learning techniques particularly in the detection of rare instances of attack traffic with an f-score of 97.6 % and 98.1 %, as well as a false positive rate of 0.17 and 0.27 for both data respectively.
基于极大梯度增强的注意力驱动多模型非平衡网络流量入侵检测体系结构
由于网络流量固有的类别不平衡和多样性,网络入侵检测系统(NIDS)在识别罕见攻击实例方面面临着巨大挑战。这种不平衡通常表现为良性网络流量数据占主导地位,从而降低了传统检测方法的有效性。为解决这一问题,我们提出了 CWFLAM-VAE,这是一种注意力驱动的多模型架构,结合了类智焦点损失、变异自动编码器和极梯度提升技术。CWFLAM-VAE 在生成合成稀有类攻击数据的同时保留了原始特征分布,从而减轻了不平衡性并提高了分类性能。我们采用了两个数据集来评估我们提出的系统的有效性,其中一个是 NSL-KDD,该数据集的网络流量分布偏向于多数类;另一个是 CSE-CIC-IDS2018 数据集,其中约 83% 的数据由良性网络流量组成。我们将我们的方法与现有的采样技术(SMOTE、ROS、ADASYN、RUS)和现有的分类器(逻辑回归、KNN、SVM、决策树、LSTM、CNN)进行了比较。实验结果明显揭示了 CWFLAM-VAE 在解决类不平衡问题方面的功效,尤其是在检测攻击流量的罕见实例方面,极端梯度提升技术超越了其他机器学习技术,在两种数据中的 f 分数分别为 97.6 % 和 98.1 %,误报率分别为 0.17 和 0.27。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
5.60
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信