{"title":"入侵检测的降维与监督学习","authors":"I. Obeidat, Wafa' Eleisah, Kinda Magableh","doi":"10.1109/ICIM56520.2022.00023","DOIUrl":null,"url":null,"abstract":"This paper proposes the application of supervised machine learning approaches for Intrusion detection. Moreover, it examines the effect of applying dimensionality reduction on classification performance. For validation, we use a subset of the NSL-KDD dataset, a widely applied dataset in this domain. Our results indicate that seven classification algorithms perform well on the dataset and based on the accuracy and false positive rate measure. The best reported accuracy results are using Random Forest algorithm with 80.5% accuracy. To enhance the classification performance, we use two dimensionality reduction algorithms: Principal Component analysis (PCA) feature reduction algorithm and BestFirst feature selection algorithm. PCA is effective in enhancing the performance of four algorithms with ranges of improvements from (1.0% – 4.1 %). Moreover, BestFirst algorithm is effective in enhancing the performance of five algorithms with improvements ranging from (0.1 % – 2.0%). In addition, there is saving in the training time after feature selection with slightly better results compared to the original full feature set.","PeriodicalId":391964,"journal":{"name":"2022 8th International Conference on Information Management (ICIM)","volume":"4 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Dimensionality Reduction and Supervised Learning for Intrusion Detection\",\"authors\":\"I. Obeidat, Wafa' Eleisah, Kinda Magableh\",\"doi\":\"10.1109/ICIM56520.2022.00023\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper proposes the application of supervised machine learning approaches for Intrusion detection. Moreover, it examines the effect of applying dimensionality reduction on classification performance. For validation, we use a subset of the NSL-KDD dataset, a widely applied dataset in this domain. Our results indicate that seven classification algorithms perform well on the dataset and based on the accuracy and false positive rate measure. The best reported accuracy results are using Random Forest algorithm with 80.5% accuracy. To enhance the classification performance, we use two dimensionality reduction algorithms: Principal Component analysis (PCA) feature reduction algorithm and BestFirst feature selection algorithm. PCA is effective in enhancing the performance of four algorithms with ranges of improvements from (1.0% – 4.1 %). Moreover, BestFirst algorithm is effective in enhancing the performance of five algorithms with improvements ranging from (0.1 % – 2.0%). In addition, there is saving in the training time after feature selection with slightly better results compared to the original full feature set.\",\"PeriodicalId\":391964,\"journal\":{\"name\":\"2022 8th International Conference on Information Management (ICIM)\",\"volume\":\"4 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-03-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 8th International Conference on Information Management (ICIM)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICIM56520.2022.00023\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 8th International Conference on Information Management (ICIM)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICIM56520.2022.00023","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Dimensionality Reduction and Supervised Learning for Intrusion Detection
This paper proposes the application of supervised machine learning approaches for Intrusion detection. Moreover, it examines the effect of applying dimensionality reduction on classification performance. For validation, we use a subset of the NSL-KDD dataset, a widely applied dataset in this domain. Our results indicate that seven classification algorithms perform well on the dataset and based on the accuracy and false positive rate measure. The best reported accuracy results are using Random Forest algorithm with 80.5% accuracy. To enhance the classification performance, we use two dimensionality reduction algorithms: Principal Component analysis (PCA) feature reduction algorithm and BestFirst feature selection algorithm. PCA is effective in enhancing the performance of four algorithms with ranges of improvements from (1.0% – 4.1 %). Moreover, BestFirst algorithm is effective in enhancing the performance of five algorithms with improvements ranging from (0.1 % – 2.0%). In addition, there is saving in the training time after feature selection with slightly better results compared to the original full feature set.