{"title":"基于字节样本熵的VPN加密流量识别深度学习","authors":"Yajuan Wang, Gengshen Yu, Wen Shen, Lintan Sun","doi":"10.1109/AEMCSE55572.2022.00066","DOIUrl":null,"url":null,"abstract":"Network traffic identification is important for traffic engineering, resource allocation, network management, attack detection and improving network QoS. However, with the rapid development of computer network technology, various VPN technologies and applications have emerged, which use encryption and decryption technology, tunneling technology and authentication technology to obfuscate and hide traffic characteristics, making VPN traffic difficult to identify. The recent rise of V2Ray makes up for and completes the shortcomings of previous VPN technologies with a more complete protocol, more robust performance and richer functionality, using V2Ray’s customised VMess protocol, and the VMess protocol supports TLS-based implementations, making it a full-featured and powerful application. These undoubtedly pose a huge challenge for network traffic identification and auditing, as well as a huge risk for network security. Therefore, the identification of VPN traffic is of great importance. In this paper, we propose a VPN traffic identification method based on byte sample entropy and session interaction time difference. We use the byte sample entropy and session interaction time difference of some message sequences in network traffic as feature data, and use Random Forest RF (RF) algorithm to identify V2Ray VMess traffic, TLS-based VMess traffic and ISCX VPN-NonVPN public dataset, achieving 95.97%, 90.32% and 91.78% recognition accuracy, respectively. The experimental results show that the method can be used for the detection and identification of V2Ray traffic, and also supports the detection and identification of the rest of VPN traffic.","PeriodicalId":309096,"journal":{"name":"2022 5th International Conference on Advanced Electronic Materials, Computers and Software Engineering (AEMCSE)","volume":"3 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Deep learning based on byte sample entropy for VPN encrypted traffic identification\",\"authors\":\"Yajuan Wang, Gengshen Yu, Wen Shen, Lintan Sun\",\"doi\":\"10.1109/AEMCSE55572.2022.00066\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Network traffic identification is important for traffic engineering, resource allocation, network management, attack detection and improving network QoS. However, with the rapid development of computer network technology, various VPN technologies and applications have emerged, which use encryption and decryption technology, tunneling technology and authentication technology to obfuscate and hide traffic characteristics, making VPN traffic difficult to identify. The recent rise of V2Ray makes up for and completes the shortcomings of previous VPN technologies with a more complete protocol, more robust performance and richer functionality, using V2Ray’s customised VMess protocol, and the VMess protocol supports TLS-based implementations, making it a full-featured and powerful application. These undoubtedly pose a huge challenge for network traffic identification and auditing, as well as a huge risk for network security. Therefore, the identification of VPN traffic is of great importance. In this paper, we propose a VPN traffic identification method based on byte sample entropy and session interaction time difference. We use the byte sample entropy and session interaction time difference of some message sequences in network traffic as feature data, and use Random Forest RF (RF) algorithm to identify V2Ray VMess traffic, TLS-based VMess traffic and ISCX VPN-NonVPN public dataset, achieving 95.97%, 90.32% and 91.78% recognition accuracy, respectively. The experimental results show that the method can be used for the detection and identification of V2Ray traffic, and also supports the detection and identification of the rest of VPN traffic.\",\"PeriodicalId\":309096,\"journal\":{\"name\":\"2022 5th International Conference on Advanced Electronic Materials, Computers and Software Engineering (AEMCSE)\",\"volume\":\"3 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-04-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 5th International Conference on Advanced Electronic Materials, Computers and Software Engineering (AEMCSE)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/AEMCSE55572.2022.00066\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 5th International Conference on Advanced Electronic Materials, Computers and Software Engineering (AEMCSE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/AEMCSE55572.2022.00066","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Deep learning based on byte sample entropy for VPN encrypted traffic identification
Network traffic identification is important for traffic engineering, resource allocation, network management, attack detection and improving network QoS. However, with the rapid development of computer network technology, various VPN technologies and applications have emerged, which use encryption and decryption technology, tunneling technology and authentication technology to obfuscate and hide traffic characteristics, making VPN traffic difficult to identify. The recent rise of V2Ray makes up for and completes the shortcomings of previous VPN technologies with a more complete protocol, more robust performance and richer functionality, using V2Ray’s customised VMess protocol, and the VMess protocol supports TLS-based implementations, making it a full-featured and powerful application. These undoubtedly pose a huge challenge for network traffic identification and auditing, as well as a huge risk for network security. Therefore, the identification of VPN traffic is of great importance. In this paper, we propose a VPN traffic identification method based on byte sample entropy and session interaction time difference. We use the byte sample entropy and session interaction time difference of some message sequences in network traffic as feature data, and use Random Forest RF (RF) algorithm to identify V2Ray VMess traffic, TLS-based VMess traffic and ISCX VPN-NonVPN public dataset, achieving 95.97%, 90.32% and 91.78% recognition accuracy, respectively. The experimental results show that the method can be used for the detection and identification of V2Ray traffic, and also supports the detection and identification of the rest of VPN traffic.