{"title":"Toward Efficient Neural Networks Through Predictor-Assisted NSGA-III for Anomaly Traffic Detection of IoT","authors":"Xinlei Wang;Mingshu He;Jiaxuan Wang;Xiaojuan Wang","doi":"10.1109/TCCN.2024.3355433","DOIUrl":null,"url":null,"abstract":"Edge computing enhances intrusion detection by extending its reach to smaller Internet of Things (IoT) devices and edge nodes, improving real-time detection and data privacy. However, due to the limited processing power and storage of edge nodes, lightweight models with high detection accuracy are urgently needed. The Neural Architecture Search (NAS) technique based on multi-objective genetic algorithms can simultaneously balance model complexity and performance, thus automatically designing models for fast and accurate detection. However, NAS requires training each model from scratch during the optimization process to obtain its detection accuracy as one of the fitness evaluation criteria. To address this limitation, we propose an efficient predictor-assisted NSGA-III algorithm. It uses proxy models to swiftly predict architecture accuracy, eliminating the need for complete training and greatly improving optimization efficiency. Furthermore, we have designed an innovative search space that allows for the reduction of internal channels and output feature map dimensions within the model, resulting in the creation of lightweight models with minimal impact on classification performance. The proposed method is validated by searching a narrower Pareto-optimal model of the competitive F1 score of 95.17% with 38.10 MB FLOPs on the UNSW-NB15 dataset. By adding a predictor, the number of optimizations per iteration increased, leading to faster convergence. Additionally, when comparing the search spaces before and after our design in the most complex structure (with 7 cells), the model’s classification error rate increased from 4.31% to 4.36%, while the FLOPs decreased from 271.13MB to 186.9MB.","PeriodicalId":13069,"journal":{"name":"IEEE Transactions on Cognitive Communications and Networking","volume":"10 3","pages":"982-995"},"PeriodicalIF":7.4000,"publicationDate":"2024-01-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Cognitive Communications and Networking","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10403928/","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"TELECOMMUNICATIONS","Score":null,"Total":0}
引用次数: 0
Abstract
Edge computing enhances intrusion detection by extending its reach to smaller Internet of Things (IoT) devices and edge nodes, improving real-time detection and data privacy. However, due to the limited processing power and storage of edge nodes, lightweight models with high detection accuracy are urgently needed. The Neural Architecture Search (NAS) technique based on multi-objective genetic algorithms can simultaneously balance model complexity and performance, thus automatically designing models for fast and accurate detection. However, NAS requires training each model from scratch during the optimization process to obtain its detection accuracy as one of the fitness evaluation criteria. To address this limitation, we propose an efficient predictor-assisted NSGA-III algorithm. It uses proxy models to swiftly predict architecture accuracy, eliminating the need for complete training and greatly improving optimization efficiency. Furthermore, we have designed an innovative search space that allows for the reduction of internal channels and output feature map dimensions within the model, resulting in the creation of lightweight models with minimal impact on classification performance. The proposed method is validated by searching a narrower Pareto-optimal model of the competitive F1 score of 95.17% with 38.10 MB FLOPs on the UNSW-NB15 dataset. By adding a predictor, the number of optimizations per iteration increased, leading to faster convergence. Additionally, when comparing the search spaces before and after our design in the most complex structure (with 7 cells), the model’s classification error rate increased from 4.31% to 4.36%, while the FLOPs decreased from 271.13MB to 186.9MB.
期刊介绍:
The IEEE Transactions on Cognitive Communications and Networking (TCCN) aims to publish high-quality manuscripts that push the boundaries of cognitive communications and networking research. Cognitive, in this context, refers to the application of perception, learning, reasoning, memory, and adaptive approaches in communication system design. The transactions welcome submissions that explore various aspects of cognitive communications and networks, focusing on innovative and holistic approaches to complex system design. Key topics covered include architecture, protocols, cross-layer design, and cognition cycle design for cognitive networks. Additionally, research on machine learning, artificial intelligence, end-to-end and distributed intelligence, software-defined networking, cognitive radios, spectrum sharing, and security and privacy issues in cognitive networks are of interest. The publication also encourages papers addressing novel services and applications enabled by these cognitive concepts.