Guardians of the Network: An Ensemble Learning Framework With Adversarial Alignment for Evasive Cyber Threat Detection

IF 2 Q3 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS

Engineering reports : open access Pub Date : 2025-09-30 DOI:10.1002/eng2.70419

Khandakar Md Shafin, G. M. Abdullah Al Kafi, Saha Reno

{"title":"Guardians of the Network: An Ensemble Learning Framework With Adversarial Alignment for Evasive Cyber Threat Detection","authors":"Khandakar Md Shafin, G. M. Abdullah Al Kafi, Saha Reno","doi":"10.1002/eng2.70419","DOIUrl":null,"url":null,"abstract":"<p>Advanced cyber threats such as zero-day exploits and sophisticated evasion techniques challenge Network Intrusion Detection Systems (NIDS). To address this, we propose a robust machine learning framework that integrates multi-source data fusion, protocol-aware preprocessing, and ensemble learning. Our study uses a comprehensive dataset of 12.7 million real-world network flows (10.1M benign, 2.6M malicious) collected from enterprise environments. Our key innovation is a weighted voting ensemble—combining Logistic Regression, Decision Trees, and a 1D-CNN—which achieves 99.8% detection accuracy while reducing false positives by 4.9% compared to individual models. The system also incorporates a lightweight adversarial aligner to counter evasion techniques (e.g., IP fragmentation, MAC spoofing), recovering up to 95% of baseline recall. Notably, under extreme class imbalance (1:99), our framework maintains 80.1% recall with only 8.2 false positives per million packets, outperforming deep learning models like LSTM and 1D-CNN while using 100 times fewer parameters. These results demonstrate the framework's practicality for efficient, high-throughput NIDS deployments in real-world settings.</p>","PeriodicalId":72922,"journal":{"name":"Engineering reports : open access","volume":"7 10","pages":""},"PeriodicalIF":2.0000,"publicationDate":"2025-09-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/eng2.70419","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Engineering reports : open access","FirstCategoryId":"1085","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/eng2.70419","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}

引用次数: 0

Abstract

Advanced cyber threats such as zero-day exploits and sophisticated evasion techniques challenge Network Intrusion Detection Systems (NIDS). To address this, we propose a robust machine learning framework that integrates multi-source data fusion, protocol-aware preprocessing, and ensemble learning. Our study uses a comprehensive dataset of 12.7 million real-world network flows (10.1M benign, 2.6M malicious) collected from enterprise environments. Our key innovation is a weighted voting ensemble—combining Logistic Regression, Decision Trees, and a 1D-CNN—which achieves 99.8% detection accuracy while reducing false positives by 4.9% compared to individual models. The system also incorporates a lightweight adversarial aligner to counter evasion techniques (e.g., IP fragmentation, MAC spoofing), recovering up to 95% of baseline recall. Notably, under extreme class imbalance (1:99), our framework maintains 80.1% recall with only 8.2 false positives per million packets, outperforming deep learning models like LSTM and 1D-CNN while using 100 times fewer parameters. These results demonstrate the framework's practicality for efficient, high-throughput NIDS deployments in real-world settings.

Abstract Image

查看原文本刊更多论文

网络守护者：用于规避网络威胁检测的具有对抗对齐的集成学习框架

先进的网络威胁，如零日漏洞和复杂的逃避技术挑战网络入侵检测系统（NIDS）。为了解决这个问题，我们提出了一个强大的机器学习框架，该框架集成了多源数据融合、协议感知预处理和集成学习。我们的研究使用了从企业环境中收集的1270万个真实网络流的综合数据集（1010万个良性网络流，260万个恶意网络流）。我们的关键创新是加权投票集成-结合逻辑回归，决策树和1d - cnn -达到99.8%的检测精度，同时与单个模型相比减少了4.9%的误报。该系统还结合了一个轻量级的对抗性对齐器来对抗逃避技术（例如，IP碎片，MAC欺骗），恢复高达95%的基线召回率。值得注意的是，在极端的类不平衡（1:99）下，我们的框架保持了80.1%的召回率，每百万数据包只有8.2个误报，优于LSTM和d1 - cnn等深度学习模型，同时使用的参数少了100倍。这些结果证明了该框架在实际环境中高效、高吞吐量NIDS部署的实用性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊