{"title":"机器学习与深度学习在多云环境中的异常检测和分类","authors":"J. Akoto, Tara Salman","doi":"10.1109/CloudSummit54781.2022.00013","DOIUrl":null,"url":null,"abstract":"Detecting intrusions is a critical issue in cyberse-curity. One way to overcome this issue is to build efficient and robust Network Intrusion Detection Systems (NIDS) using existing Machine Learning (ML) algorithms. Such an approach has been proposed in the literature and has been shown to perform well. However, a comparative analysis of the performance of ML and Deep Learning (DL) based NIDS for both detection and categorization of intrusions is still needed. This paper investigates the performance of ML and DL models for both intrusion detection and categorization. We use the publicly available Canadian Institute of Cybersecurity Intrusion Detection System 2017 (CICIDS-2017) dataset to train and test ML and DL models. We apply three traditional ML models, namely, Logistic Regression (LR), Random Forest (RF), K-Nearest Neighbor (KNN), and three DL models − 1-D Convolutional Neural Network (ConvlD), Recurrent Neural Network (RNN), and a two-staged model that combines an unsupervised Dense Autoencoders (DAE) for pre-training and an Artificial Neural Network (ANN) for classification. Our results demonstrate that RF is the best performing ML model with a detection accuracy of 99.5% and DAE-ANN is the best performing DL model with a detection accuracy of 98.7%. We also show the advantages of using a stepwise multi-classification over a classical single-stage multi-classification. Finally, we observe that RF outperforms DAE-ANN in categorization with detection rates of 91.35 % and 84.66 %, respectively.","PeriodicalId":106553,"journal":{"name":"2022 IEEE Cloud Summit","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Machine Learning vs Deep Learning for Anomaly Detection and Categorization in Multi-cloud Environments\",\"authors\":\"J. Akoto, Tara Salman\",\"doi\":\"10.1109/CloudSummit54781.2022.00013\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Detecting intrusions is a critical issue in cyberse-curity. One way to overcome this issue is to build efficient and robust Network Intrusion Detection Systems (NIDS) using existing Machine Learning (ML) algorithms. Such an approach has been proposed in the literature and has been shown to perform well. However, a comparative analysis of the performance of ML and Deep Learning (DL) based NIDS for both detection and categorization of intrusions is still needed. This paper investigates the performance of ML and DL models for both intrusion detection and categorization. We use the publicly available Canadian Institute of Cybersecurity Intrusion Detection System 2017 (CICIDS-2017) dataset to train and test ML and DL models. We apply three traditional ML models, namely, Logistic Regression (LR), Random Forest (RF), K-Nearest Neighbor (KNN), and three DL models − 1-D Convolutional Neural Network (ConvlD), Recurrent Neural Network (RNN), and a two-staged model that combines an unsupervised Dense Autoencoders (DAE) for pre-training and an Artificial Neural Network (ANN) for classification. Our results demonstrate that RF is the best performing ML model with a detection accuracy of 99.5% and DAE-ANN is the best performing DL model with a detection accuracy of 98.7%. We also show the advantages of using a stepwise multi-classification over a classical single-stage multi-classification. Finally, we observe that RF outperforms DAE-ANN in categorization with detection rates of 91.35 % and 84.66 %, respectively.\",\"PeriodicalId\":106553,\"journal\":{\"name\":\"2022 IEEE Cloud Summit\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-10-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 IEEE Cloud Summit\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CloudSummit54781.2022.00013\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE Cloud Summit","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CloudSummit54781.2022.00013","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Machine Learning vs Deep Learning for Anomaly Detection and Categorization in Multi-cloud Environments
Detecting intrusions is a critical issue in cyberse-curity. One way to overcome this issue is to build efficient and robust Network Intrusion Detection Systems (NIDS) using existing Machine Learning (ML) algorithms. Such an approach has been proposed in the literature and has been shown to perform well. However, a comparative analysis of the performance of ML and Deep Learning (DL) based NIDS for both detection and categorization of intrusions is still needed. This paper investigates the performance of ML and DL models for both intrusion detection and categorization. We use the publicly available Canadian Institute of Cybersecurity Intrusion Detection System 2017 (CICIDS-2017) dataset to train and test ML and DL models. We apply three traditional ML models, namely, Logistic Regression (LR), Random Forest (RF), K-Nearest Neighbor (KNN), and three DL models − 1-D Convolutional Neural Network (ConvlD), Recurrent Neural Network (RNN), and a two-staged model that combines an unsupervised Dense Autoencoders (DAE) for pre-training and an Artificial Neural Network (ANN) for classification. Our results demonstrate that RF is the best performing ML model with a detection accuracy of 99.5% and DAE-ANN is the best performing DL model with a detection accuracy of 98.7%. We also show the advantages of using a stepwise multi-classification over a classical single-stage multi-classification. Finally, we observe that RF outperforms DAE-ANN in categorization with detection rates of 91.35 % and 84.66 %, respectively.