基于随机森林参数估计的入侵检测系统改进

A. N. Iman, T. Ahmad
{"title":"基于随机森林参数估计的入侵检测系统改进","authors":"A. N. Iman, T. Ahmad","doi":"10.1109/ICoSTA48221.2020.1570609975","DOIUrl":null,"url":null,"abstract":"To overcome the security problem of computer networks, the Intrusion Detection System (IDS) is developed. It is intended to identify an attack. Various types of IDS are built according to the environment: signature-based and anomaly-based. This second type of IDS can identify attacks that have not been known. In this case, machine learning is a possible method to develop an IDS model, which comprises many processes, including feature selection. The Boruta Algorithm is a feature selection method that is good enough to apply to machine learning. However, in its application on the NSL-KDD dataset, this algorithm has an infinite loop problem. This paper presents the analysis and estimation of random forest parameters, precisely the depth and number of trees; additionally, the use of entropy and Gini index as z-score in the Boruta Algorithm is considered. The experimental result shows that the proposed method can prevent the infinite loop, which indirectly improves the performance of the existing algorithm.","PeriodicalId":375166,"journal":{"name":"2020 International Conference on Smart Technology and Applications (ICoSTA)","volume":"51 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"14","resultStr":"{\"title\":\"Improving Intrusion Detection System by Estimating Parameters of Random Forest in Boruta\",\"authors\":\"A. N. Iman, T. Ahmad\",\"doi\":\"10.1109/ICoSTA48221.2020.1570609975\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"To overcome the security problem of computer networks, the Intrusion Detection System (IDS) is developed. It is intended to identify an attack. Various types of IDS are built according to the environment: signature-based and anomaly-based. This second type of IDS can identify attacks that have not been known. In this case, machine learning is a possible method to develop an IDS model, which comprises many processes, including feature selection. The Boruta Algorithm is a feature selection method that is good enough to apply to machine learning. However, in its application on the NSL-KDD dataset, this algorithm has an infinite loop problem. This paper presents the analysis and estimation of random forest parameters, precisely the depth and number of trees; additionally, the use of entropy and Gini index as z-score in the Boruta Algorithm is considered. The experimental result shows that the proposed method can prevent the infinite loop, which indirectly improves the performance of the existing algorithm.\",\"PeriodicalId\":375166,\"journal\":{\"name\":\"2020 International Conference on Smart Technology and Applications (ICoSTA)\",\"volume\":\"51 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-02-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"14\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2020 International Conference on Smart Technology and Applications (ICoSTA)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICoSTA48221.2020.1570609975\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 International Conference on Smart Technology and Applications (ICoSTA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICoSTA48221.2020.1570609975","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 14

摘要

为了解决计算机网络的安全问题,开发了入侵检测系统(IDS)。它的目的是识别攻击。根据环境不同,可以构建不同类型的入侵检测:基于签名的和基于异常的。第二种类型的IDS可以识别未知的攻击。在这种情况下,机器学习是开发IDS模型的一种可能方法,它包括许多过程,包括特征选择。Boruta算法是一种足以应用于机器学习的特征选择方法。然而,该算法在NSL-KDD数据集上的应用存在着无限循环问题。本文给出了随机森林参数的分析和估计,准确地说是树的深度和数目;此外,还考虑了在Boruta算法中使用熵和基尼指数作为z-score。实验结果表明,该方法能够有效防止无限循环,间接提高了现有算法的性能。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Improving Intrusion Detection System by Estimating Parameters of Random Forest in Boruta
To overcome the security problem of computer networks, the Intrusion Detection System (IDS) is developed. It is intended to identify an attack. Various types of IDS are built according to the environment: signature-based and anomaly-based. This second type of IDS can identify attacks that have not been known. In this case, machine learning is a possible method to develop an IDS model, which comprises many processes, including feature selection. The Boruta Algorithm is a feature selection method that is good enough to apply to machine learning. However, in its application on the NSL-KDD dataset, this algorithm has an infinite loop problem. This paper presents the analysis and estimation of random forest parameters, precisely the depth and number of trees; additionally, the use of entropy and Gini index as z-score in the Boruta Algorithm is considered. The experimental result shows that the proposed method can prevent the infinite loop, which indirectly improves the performance of the existing algorithm.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信