Murat Karayaka, Arda Bayer, Semih Balki, E. Anarim, M. Koca
{"title":"Application Based Network Traffic Dataset and SPID Analysis","authors":"Murat Karayaka, Arda Bayer, Semih Balki, E. Anarim, M. Koca","doi":"10.1109/SIU55565.2022.9864929","DOIUrl":null,"url":null,"abstract":"Currently, web-based applications have become a part of every piece of our daily lives. The rapid advancements in these applications which have found its use in variety of sectors has made it necessary for the respective security systems to adapt as fast in order to identify these applications. In this work, some up-to-date and commonly used applications’ web traffic data have been collected for network traffic classification problem and they are presented for the use of researchers. In addition, an analysis of these data with respect to this problem is performed using features based on statistical protocol identification. It has been shown that for the traffic classification, training a Random Forest classifier with these features is more effective than using the mean KL divergence which was used in previous work with these features.","PeriodicalId":115446,"journal":{"name":"2022 30th Signal Processing and Communications Applications Conference (SIU)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-05-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 30th Signal Processing and Communications Applications Conference (SIU)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SIU55565.2022.9864929","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Currently, web-based applications have become a part of every piece of our daily lives. The rapid advancements in these applications which have found its use in variety of sectors has made it necessary for the respective security systems to adapt as fast in order to identify these applications. In this work, some up-to-date and commonly used applications’ web traffic data have been collected for network traffic classification problem and they are presented for the use of researchers. In addition, an analysis of these data with respect to this problem is performed using features based on statistical protocol identification. It has been shown that for the traffic classification, training a Random Forest classifier with these features is more effective than using the mean KL divergence which was used in previous work with these features.