Maonan Wang, K. Zheng, Xinyi Ning, Yanqing Yang, Xiujuan Wang
{"title":"CENTIME:一种用于加密流量分类的直接综合流量特征提取方法","authors":"Maonan Wang, K. Zheng, Xinyi Ning, Yanqing Yang, Xiujuan Wang","doi":"10.1109/ICCCS52626.2021.9449280","DOIUrl":null,"url":null,"abstract":"With the rapid development of the network, encrypted traffic classification plays a vital role in guaranteeing the quality of network services and ensuring the security of the network. Recent studies show that machine learning approaches based on statistical features and raw traffic sessions are effective for this task. However, the performance of the statistical-based approaches largely depends on the quality of the features. Experts need to design different features for different encrypted traffic classification tasks, which is time-consuming. Meanwhile, the raw traffic-based approach needs to uniformize the traffic size; this will cause the loss of information about the overall structure of the network traffic; for example, we do not know the time from the first packet to the last packet in a session. This paper proposes the CENTIME, which can extract comprehensive information based on ResNet and AutoEncoder to identify encrypted traffic. ResNet is used to extract information from uniformized traffic, and AutoEncoder is used to encode statistical features. The statistical features are used to compensate for the information loss caused by traffic uniformization. They only need to be designed once rather than be designed separately for different tasks. Moreover, the pooling layers are removed, and 1D convolution layers are used to help CENTIME make more effective use of raw traffic information. We evaluate the CENTIME on the public dataset “ISCX VPN-nonVPN”, and the results demonstrate the CENTIME outperforms the state-of-the-art encrypted traffic classification methods. More importantly, comprehensive traffic features generated in the CENTIME can represent different classes of traffic well.","PeriodicalId":376290,"journal":{"name":"2021 IEEE 6th International Conference on Computer and Communication Systems (ICCCS)","volume":"38 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-04-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"CENTIME: A Direct Comprehensive Traffic Features Extraction for Encrypted Traffic Classification\",\"authors\":\"Maonan Wang, K. Zheng, Xinyi Ning, Yanqing Yang, Xiujuan Wang\",\"doi\":\"10.1109/ICCCS52626.2021.9449280\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"With the rapid development of the network, encrypted traffic classification plays a vital role in guaranteeing the quality of network services and ensuring the security of the network. Recent studies show that machine learning approaches based on statistical features and raw traffic sessions are effective for this task. However, the performance of the statistical-based approaches largely depends on the quality of the features. Experts need to design different features for different encrypted traffic classification tasks, which is time-consuming. Meanwhile, the raw traffic-based approach needs to uniformize the traffic size; this will cause the loss of information about the overall structure of the network traffic; for example, we do not know the time from the first packet to the last packet in a session. This paper proposes the CENTIME, which can extract comprehensive information based on ResNet and AutoEncoder to identify encrypted traffic. ResNet is used to extract information from uniformized traffic, and AutoEncoder is used to encode statistical features. The statistical features are used to compensate for the information loss caused by traffic uniformization. They only need to be designed once rather than be designed separately for different tasks. Moreover, the pooling layers are removed, and 1D convolution layers are used to help CENTIME make more effective use of raw traffic information. We evaluate the CENTIME on the public dataset “ISCX VPN-nonVPN”, and the results demonstrate the CENTIME outperforms the state-of-the-art encrypted traffic classification methods. More importantly, comprehensive traffic features generated in the CENTIME can represent different classes of traffic well.\",\"PeriodicalId\":376290,\"journal\":{\"name\":\"2021 IEEE 6th International Conference on Computer and Communication Systems (ICCCS)\",\"volume\":\"38 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-04-23\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 IEEE 6th International Conference on Computer and Communication Systems (ICCCS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICCCS52626.2021.9449280\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE 6th International Conference on Computer and Communication Systems (ICCCS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCCS52626.2021.9449280","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
CENTIME: A Direct Comprehensive Traffic Features Extraction for Encrypted Traffic Classification
With the rapid development of the network, encrypted traffic classification plays a vital role in guaranteeing the quality of network services and ensuring the security of the network. Recent studies show that machine learning approaches based on statistical features and raw traffic sessions are effective for this task. However, the performance of the statistical-based approaches largely depends on the quality of the features. Experts need to design different features for different encrypted traffic classification tasks, which is time-consuming. Meanwhile, the raw traffic-based approach needs to uniformize the traffic size; this will cause the loss of information about the overall structure of the network traffic; for example, we do not know the time from the first packet to the last packet in a session. This paper proposes the CENTIME, which can extract comprehensive information based on ResNet and AutoEncoder to identify encrypted traffic. ResNet is used to extract information from uniformized traffic, and AutoEncoder is used to encode statistical features. The statistical features are used to compensate for the information loss caused by traffic uniformization. They only need to be designed once rather than be designed separately for different tasks. Moreover, the pooling layers are removed, and 1D convolution layers are used to help CENTIME make more effective use of raw traffic information. We evaluate the CENTIME on the public dataset “ISCX VPN-nonVPN”, and the results demonstrate the CENTIME outperforms the state-of-the-art encrypted traffic classification methods. More importantly, comprehensive traffic features generated in the CENTIME can represent different classes of traffic well.