{"title":"探讨基于流量统计的网络流分类中最优特征的选择","authors":"Ming Xu, Wenbo Zhu, Jian Xu, Ning Zheng","doi":"10.1109/APNOMS.2015.7275371","DOIUrl":null,"url":null,"abstract":"The network traffic classification is one of the most fundamental work in the network measurement and management, and this problem is more and more impact as the network scale grows. Many methods are proposed by researchers, but methods based on flow statistics seem more popular than the others. In this paper, we proposed a novel method based on refined flow statistical features. The new statistics, skewness and kurtosis, and new flow statistical features, payload length, were introduced into raw feature set firstly. Then, with the consideration of efficiency in the classification stage, the feature selection was used on the raw feature set to get an optimal feature set and the feature selection are mainly based on the K-means clustering algorithm. The comparison experiment results show that the proposed optimal feature set reaches the same precision level with half time consuming and internal cluster distance when compared with the raw set.","PeriodicalId":269263,"journal":{"name":"2015 17th Asia-Pacific Network Operations and Management Symposium (APNOMS)","volume":"13 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2015-09-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":"{\"title\":\"Towards selecting optimal features for flow statistical based network traffic classification\",\"authors\":\"Ming Xu, Wenbo Zhu, Jian Xu, Ning Zheng\",\"doi\":\"10.1109/APNOMS.2015.7275371\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The network traffic classification is one of the most fundamental work in the network measurement and management, and this problem is more and more impact as the network scale grows. Many methods are proposed by researchers, but methods based on flow statistics seem more popular than the others. In this paper, we proposed a novel method based on refined flow statistical features. The new statistics, skewness and kurtosis, and new flow statistical features, payload length, were introduced into raw feature set firstly. Then, with the consideration of efficiency in the classification stage, the feature selection was used on the raw feature set to get an optimal feature set and the feature selection are mainly based on the K-means clustering algorithm. The comparison experiment results show that the proposed optimal feature set reaches the same precision level with half time consuming and internal cluster distance when compared with the raw set.\",\"PeriodicalId\":269263,\"journal\":{\"name\":\"2015 17th Asia-Pacific Network Operations and Management Symposium (APNOMS)\",\"volume\":\"13 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2015-09-28\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"5\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2015 17th Asia-Pacific Network Operations and Management Symposium (APNOMS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/APNOMS.2015.7275371\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2015 17th Asia-Pacific Network Operations and Management Symposium (APNOMS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/APNOMS.2015.7275371","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Towards selecting optimal features for flow statistical based network traffic classification
The network traffic classification is one of the most fundamental work in the network measurement and management, and this problem is more and more impact as the network scale grows. Many methods are proposed by researchers, but methods based on flow statistics seem more popular than the others. In this paper, we proposed a novel method based on refined flow statistical features. The new statistics, skewness and kurtosis, and new flow statistical features, payload length, were introduced into raw feature set firstly. Then, with the consideration of efficiency in the classification stage, the feature selection was used on the raw feature set to get an optimal feature set and the feature selection are mainly based on the K-means clustering algorithm. The comparison experiment results show that the proposed optimal feature set reaches the same precision level with half time consuming and internal cluster distance when compared with the raw set.