基于DNS查询的僵尸网络检测优化随机森林模型

2020 32nd International Conference on Microelectronics (ICM) Pub Date : 2020-12-14 DOI:10.1109/ICM50269.2020.9331819

Abdallah Moubayed, M. Injadat, A. Shami

{"title":"基于DNS查询的僵尸网络检测优化随机森林模型","authors":"Abdallah Moubayed, M. Injadat, A. Shami","doi":"10.1109/ICM50269.2020.9331819","DOIUrl":null,"url":null,"abstract":"The Domain Name System (DNS) protocol plays a major role in today’s Internet as it translates between website names and corresponding IP addresses. However, due to the lack of processes for data integrity and origin authentication, the DNS protocol has several security vulnerabilities. This often leads to a variety of cyber-attacks, including botnet network attacks. One promising solution to detect DNS-based botnet attacks is adopting machine learning (ML) based solutions. To that end, this paper proposes a novel optimized ML-based framework to detect botnets based on their corresponding DNS queries. More specifically, the framework consists of using information gain as a feature selection method and genetic algorithm (GA) as a hyper-parameter optimization model to tune the parameters of a random forest (RF) classifier. The proposed framework is evaluated using a state-of-the-art TI-2016 DNS dataset. Experimental results show that the proposed optimized framework reduced the feature set size by up to 60%. Moreover, it achieved a high detection accuracy, precision, recall, and F-score compared to the default classifier. This highlights the effectiveness and robustness of the proposed framework in detecting botnet attacks.","PeriodicalId":243968,"journal":{"name":"2020 32nd International Conference on Microelectronics (ICM)","volume":"30 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-12-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"10","resultStr":"{\"title\":\"Optimized Random Forest Model for Botnet Detection Based on DNS Queries\",\"authors\":\"Abdallah Moubayed, M. Injadat, A. Shami\",\"doi\":\"10.1109/ICM50269.2020.9331819\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The Domain Name System (DNS) protocol plays a major role in today’s Internet as it translates between website names and corresponding IP addresses. However, due to the lack of processes for data integrity and origin authentication, the DNS protocol has several security vulnerabilities. This often leads to a variety of cyber-attacks, including botnet network attacks. One promising solution to detect DNS-based botnet attacks is adopting machine learning (ML) based solutions. To that end, this paper proposes a novel optimized ML-based framework to detect botnets based on their corresponding DNS queries. More specifically, the framework consists of using information gain as a feature selection method and genetic algorithm (GA) as a hyper-parameter optimization model to tune the parameters of a random forest (RF) classifier. The proposed framework is evaluated using a state-of-the-art TI-2016 DNS dataset. Experimental results show that the proposed optimized framework reduced the feature set size by up to 60%. Moreover, it achieved a high detection accuracy, precision, recall, and F-score compared to the default classifier. This highlights the effectiveness and robustness of the proposed framework in detecting botnet attacks.\",\"PeriodicalId\":243968,\"journal\":{\"name\":\"2020 32nd International Conference on Microelectronics (ICM)\",\"volume\":\"30 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-12-14\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"10\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2020 32nd International Conference on Microelectronics (ICM)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICM50269.2020.9331819\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 32nd International Conference on Microelectronics (ICM)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICM50269.2020.9331819","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 10

摘要

域名系统(DNS)协议在今天的互联网中扮演着重要的角色，因为它在网站名称和相应的IP地址之间进行转换。然而，由于缺乏数据完整性和来源认证的过程，DNS协议存在一些安全漏洞。这往往导致各种网络攻击，包括僵尸网络攻击。检测基于dns的僵尸网络攻击的一个有前途的解决方案是采用基于机器学习(ML)的解决方案。为此，本文提出了一种新的优化的基于ml的框架，基于相应的DNS查询来检测僵尸网络。具体来说，该框架包括使用信息增益作为特征选择方法和遗传算法(GA)作为超参数优化模型来调整随机森林(RF)分类器的参数。使用最先进的TI-2016 DNS数据集对提议的框架进行评估。实验结果表明，所提出的优化框架将特征集的大小减少了60%。此外，与默认分类器相比，它实现了更高的检测准确度、精度、召回率和f分。这突出了所提出的框架在检测僵尸网络攻击方面的有效性和鲁棒性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Optimized Random Forest Model for Botnet Detection Based on DNS Queries

The Domain Name System (DNS) protocol plays a major role in today’s Internet as it translates between website names and corresponding IP addresses. However, due to the lack of processes for data integrity and origin authentication, the DNS protocol has several security vulnerabilities. This often leads to a variety of cyber-attacks, including botnet network attacks. One promising solution to detect DNS-based botnet attacks is adopting machine learning (ML) based solutions. To that end, this paper proposes a novel optimized ML-based framework to detect botnets based on their corresponding DNS queries. More specifically, the framework consists of using information gain as a feature selection method and genetic algorithm (GA) as a hyper-parameter optimization model to tune the parameters of a random forest (RF) classifier. The proposed framework is evaluated using a state-of-the-art TI-2016 DNS dataset. Experimental results show that the proposed optimized framework reduced the feature set size by up to 60%. Moreover, it achieved a high detection accuracy, precision, recall, and F-score compared to the default classifier. This highlights the effectiveness and robustness of the proposed framework in detecting botnet attacks.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2020 32nd International Conference on Microelectronics (ICM)

自引率

0.00%

发文量