Mira Rani Choudhury, M. N, P. Acharjee, Aleena Terese George
{"title":"Network Traffic Classification Using Supervised Learning Algorithms","authors":"Mira Rani Choudhury, M. N, P. Acharjee, Aleena Terese George","doi":"10.1109/ICCECE51049.2023.10084931","DOIUrl":null,"url":null,"abstract":"Network traffic classification is crucial for traffic monitoring and application-based policy enforcement. However, the widespread use of encrypted protocols has greatly challenged conventional traffic classification techniques using packet payload and port numbers. For the network application in this paper, two machine learning algorithms, Decision Tree (DT) and Random Forest (RF) are used. An open-access Kaggle dataset with six different types of applications is used for this study. To achieve the best values for model training, loop iteration is used rather than the hyper-parameter optimization technique. When compared to DT, RF has the highest accuracy (99.72%). In order to improve the classification process and various hidden patterns connected with the statistical features, more statistical features were taken into account in comparison to other related works that had already been done. The outcomes demonstrate the potency of supervised learning algorithms for categorizing network traffic.","PeriodicalId":447131,"journal":{"name":"2023 International Conference on Computer, Electrical & Communication Engineering (ICCECE)","volume":"54 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-01-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 International Conference on Computer, Electrical & Communication Engineering (ICCECE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCECE51049.2023.10084931","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Network traffic classification is crucial for traffic monitoring and application-based policy enforcement. However, the widespread use of encrypted protocols has greatly challenged conventional traffic classification techniques using packet payload and port numbers. For the network application in this paper, two machine learning algorithms, Decision Tree (DT) and Random Forest (RF) are used. An open-access Kaggle dataset with six different types of applications is used for this study. To achieve the best values for model training, loop iteration is used rather than the hyper-parameter optimization technique. When compared to DT, RF has the highest accuracy (99.72%). In order to improve the classification process and various hidden patterns connected with the statistical features, more statistical features were taken into account in comparison to other related works that had already been done. The outcomes demonstrate the potency of supervised learning algorithms for categorizing network traffic.