针对无线局域网入侵检测的网络事件记录标注

2006 IEEE International Conference on Information Reuse & Integration Pub Date : 2006-12-04 DOI:10.1109/IRI.2006.252413

T. Khoshgoftaar, Chris Seiffert, Naeem Seliya

{"title":"针对无线局域网入侵检测的网络事件记录标注","authors":"T. Khoshgoftaar, Chris Seiffert, Naeem Seliya","doi":"10.1109/IRI.2006.252413","DOIUrl":null,"url":null,"abstract":"A data mining approach to network intrusion detection requires a training dataset of network event records labeled as either normal or an attack type. Since there are too many events to track, such datasets are typically very large. This is particularly so in a WLAN where number of non-wired devices communicating with the WLAN can be too many and adhoc. This creates a problem for the domain expert in labeling all records in the training dataset which is used to train a machine learner. We present a simple approach by which the number of network records the expert has to examine is a relatively small proportion of the given training dataset. A clustering algorithm is used to form relatively coherent groups which the expert examines as a whole to label records as one of four classes: red (definite intrusion), yellow (possibly intrusion), blue (probably normal), and green (definite normal). An ensemble classifier-based data cleansing approach is then used to detect records that were likely mislabeled by the expert. The proposed approach is investigated with a case study of a real-world WLAN. An ensemble classifier-based intrusion detection model built using the labeled training dataset demonstrates the effectiveness of the labeling approach and the good generalization accuracy","PeriodicalId":402255,"journal":{"name":"2006 IEEE International Conference on Information Reuse & Integration","volume":"57 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2006-12-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":"{\"title\":\"Labeling Network Event Records for Intrusion Detection in aWireless LAN\",\"authors\":\"T. Khoshgoftaar, Chris Seiffert, Naeem Seliya\",\"doi\":\"10.1109/IRI.2006.252413\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"A data mining approach to network intrusion detection requires a training dataset of network event records labeled as either normal or an attack type. Since there are too many events to track, such datasets are typically very large. This is particularly so in a WLAN where number of non-wired devices communicating with the WLAN can be too many and adhoc. This creates a problem for the domain expert in labeling all records in the training dataset which is used to train a machine learner. We present a simple approach by which the number of network records the expert has to examine is a relatively small proportion of the given training dataset. A clustering algorithm is used to form relatively coherent groups which the expert examines as a whole to label records as one of four classes: red (definite intrusion), yellow (possibly intrusion), blue (probably normal), and green (definite normal). An ensemble classifier-based data cleansing approach is then used to detect records that were likely mislabeled by the expert. The proposed approach is investigated with a case study of a real-world WLAN. An ensemble classifier-based intrusion detection model built using the labeled training dataset demonstrates the effectiveness of the labeling approach and the good generalization accuracy\",\"PeriodicalId\":402255,\"journal\":{\"name\":\"2006 IEEE International Conference on Information Reuse & Integration\",\"volume\":\"57 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2006-12-04\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2006 IEEE International Conference on Information Reuse & Integration\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/IRI.2006.252413\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2006 IEEE International Conference on Information Reuse & Integration","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IRI.2006.252413","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 3

摘要

网络入侵检测的数据挖掘方法需要一个标记为正常或攻击类型的网络事件记录的训练数据集。由于有太多的事件需要跟踪，这样的数据集通常非常大。这在WLAN中尤其如此，因为与WLAN通信的非有线设备数量可能太多而且是临时的。这给领域专家在标记用于训练机器学习的训练数据集中的所有记录带来了一个问题。我们提出了一种简单的方法，通过这种方法，专家必须检查的网络记录的数量是给定训练数据集的相对较小的比例。使用聚类算法形成相对连贯的组，专家将这些组作为一个整体进行检查，并将记录标记为四类之一:红色(确定入侵)，黄色(可能入侵)，蓝色(可能正常)和绿色(确定正常)。然后使用基于集成分类器的数据清理方法来检测可能被专家错误标记的记录。本文以实际无线局域网为例对所提出的方法进行了研究。利用标记训练数据集建立了基于集成分类器的入侵检测模型，验证了标记方法的有效性和良好的泛化精度

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Labeling Network Event Records for Intrusion Detection in aWireless LAN

A data mining approach to network intrusion detection requires a training dataset of network event records labeled as either normal or an attack type. Since there are too many events to track, such datasets are typically very large. This is particularly so in a WLAN where number of non-wired devices communicating with the WLAN can be too many and adhoc. This creates a problem for the domain expert in labeling all records in the training dataset which is used to train a machine learner. We present a simple approach by which the number of network records the expert has to examine is a relatively small proportion of the given training dataset. A clustering algorithm is used to form relatively coherent groups which the expert examines as a whole to label records as one of four classes: red (definite intrusion), yellow (possibly intrusion), blue (probably normal), and green (definite normal). An ensemble classifier-based data cleansing approach is then used to detect records that were likely mislabeled by the expert. The proposed approach is investigated with a case study of a real-world WLAN. An ensemble classifier-based intrusion detection model built using the labeled training dataset demonstrates the effectiveness of the labeling approach and the good generalization accuracy

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2006 IEEE International Conference on Information Reuse & Integration

自引率

0.00%

发文量