特征熵估计(FEE)用于恶意物联网流量和机器学习检测

Mob. Inf. Syst. Pub Date : 2021-12-28 DOI:10.1155/2021/8091363

Tarun Dhar Diwan, Siddartha Choubey, H. Hota, S. B. Goyal, Sajjad Shaukat Jamal, P. Shukla, B. Tiwari

{"title":"特征熵估计(FEE)用于恶意物联网流量和机器学习检测","authors":"Tarun Dhar Diwan, Siddartha Choubey, H. Hota, S. B. Goyal, Sajjad Shaukat Jamal, P. Shukla, B. Tiwari","doi":"10.1155/2021/8091363","DOIUrl":null,"url":null,"abstract":"Identification of anomaly and malicious traffic in the Internet of things (IoT) network is essential for IoT security. Tracking and blocking unwanted traffic flows in the IoT network is required to design a framework for the identification of attacks more accurately, quickly, and with less complexity. Many machine learning (ML) algorithms proved their efficiency to detect intrusion in IoT networks. But this ML algorithm suffers many misclassification problems due to inappropriate and irrelevant feature size. In this paper, an in-depth study is presented to address such issues. We have presented lightweight low-cost feature selection IoT intrusion detection techniques with low complexity and high accuracy due to their low computational time. A novel feature selection technique was proposed with the integration of rank-based chi-square, Pearson correlation, and score correlation to extract relevant features out of all available features from the dataset. Then, feature entropy estimation was applied to validate the relationship among all extracted features to identify malicious traffic in IoT networks. Finally, an extreme gradient ensemble boosting approach was used to classify the features in relevant attack types. The simulation is performed on three datasets, i.e., NSL-KDD, USNW-NB15, and CCIDS2017, and results are presented on different test sets. It was observed that on the NSL-KDD dataset, accuracy was approx. 97.48%. Similarly, the accuracy of USNW-NB15 and CCIDS2017 was approx. 99.96% and 99.93%, respectively. Along with that, state-of-the-art comparison is also presented with existing techniques.","PeriodicalId":18790,"journal":{"name":"Mob. Inf. Syst.","volume":"54 1","pages":"8091363:1-8091363:13"},"PeriodicalIF":0.0000,"publicationDate":"2021-12-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"11","resultStr":"{\"title\":\"Feature Entropy Estimation (FEE) for Malicious IoT Traffic and Detection Using Machine Learning\",\"authors\":\"Tarun Dhar Diwan, Siddartha Choubey, H. Hota, S. B. Goyal, Sajjad Shaukat Jamal, P. Shukla, B. Tiwari\",\"doi\":\"10.1155/2021/8091363\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Identification of anomaly and malicious traffic in the Internet of things (IoT) network is essential for IoT security. Tracking and blocking unwanted traffic flows in the IoT network is required to design a framework for the identification of attacks more accurately, quickly, and with less complexity. Many machine learning (ML) algorithms proved their efficiency to detect intrusion in IoT networks. But this ML algorithm suffers many misclassification problems due to inappropriate and irrelevant feature size. In this paper, an in-depth study is presented to address such issues. We have presented lightweight low-cost feature selection IoT intrusion detection techniques with low complexity and high accuracy due to their low computational time. A novel feature selection technique was proposed with the integration of rank-based chi-square, Pearson correlation, and score correlation to extract relevant features out of all available features from the dataset. Then, feature entropy estimation was applied to validate the relationship among all extracted features to identify malicious traffic in IoT networks. Finally, an extreme gradient ensemble boosting approach was used to classify the features in relevant attack types. The simulation is performed on three datasets, i.e., NSL-KDD, USNW-NB15, and CCIDS2017, and results are presented on different test sets. It was observed that on the NSL-KDD dataset, accuracy was approx. 97.48%. Similarly, the accuracy of USNW-NB15 and CCIDS2017 was approx. 99.96% and 99.93%, respectively. Along with that, state-of-the-art comparison is also presented with existing techniques.\",\"PeriodicalId\":18790,\"journal\":{\"name\":\"Mob. Inf. Syst.\",\"volume\":\"54 1\",\"pages\":\"8091363:1-8091363:13\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-12-28\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"11\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Mob. Inf. Syst.\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1155/2021/8091363\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Mob. Inf. Syst.","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1155/2021/8091363","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 11

摘要

识别物联网(IoT)网络中的异常和恶意流量对于物联网安全至关重要。跟踪和阻止物联网网络中不需要的流量是设计一个框架的必要条件，以便更准确、更快速、更低复杂性地识别攻击。许多机器学习(ML)算法证明了它们在检测物联网网络入侵方面的效率。但是由于特征大小不合适和不相关，这种ML算法存在许多误分类问题。在本文中，提出了一个深入的研究来解决这些问题。我们提出了一种轻量级的低成本特征选择物联网入侵检测技术，由于其计算时间短，具有低复杂度和高精度。提出了一种新的特征选择技术，将基于秩的卡方、Pearson相关和分数相关相结合，从数据集中的所有可用特征中提取出相关特征。然后，应用特征熵估计验证提取的所有特征之间的关系，以识别物联网网络中的恶意流量。最后，采用极端梯度集成增强方法对相关攻击类型的特征进行分类。在NSL-KDD、USNW-NB15和CCIDS2017三个数据集上进行了仿真，并给出了不同测试集上的结果。观察到，在NSL-KDD数据集上，精度约为。97.48%。同样，USNW-NB15和CCIDS2017的精度约为。分别为99.96%和99.93%。与此同时，还与现有技术进行了比较。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Feature Entropy Estimation (FEE) for Malicious IoT Traffic and Detection Using Machine Learning

Identification of anomaly and malicious traffic in the Internet of things (IoT) network is essential for IoT security. Tracking and blocking unwanted traffic flows in the IoT network is required to design a framework for the identification of attacks more accurately, quickly, and with less complexity. Many machine learning (ML) algorithms proved their efficiency to detect intrusion in IoT networks. But this ML algorithm suffers many misclassification problems due to inappropriate and irrelevant feature size. In this paper, an in-depth study is presented to address such issues. We have presented lightweight low-cost feature selection IoT intrusion detection techniques with low complexity and high accuracy due to their low computational time. A novel feature selection technique was proposed with the integration of rank-based chi-square, Pearson correlation, and score correlation to extract relevant features out of all available features from the dataset. Then, feature entropy estimation was applied to validate the relationship among all extracted features to identify malicious traffic in IoT networks. Finally, an extreme gradient ensemble boosting approach was used to classify the features in relevant attack types. The simulation is performed on three datasets, i.e., NSL-KDD, USNW-NB15, and CCIDS2017, and results are presented on different test sets. It was observed that on the NSL-KDD dataset, accuracy was approx. 97.48%. Similarly, the accuracy of USNW-NB15 and CCIDS2017 was approx. 99.96% and 99.93%, respectively. Along with that, state-of-the-art comparison is also presented with existing techniques.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Mob. Inf. Syst.

自引率

0.00%

发文量