{"title":"基于在线密度空间聚类算法的交通识别方法","authors":"Jian Zhang, Zongjue Qian, Guochu Shou, Yihong Hu","doi":"10.1109/ICNIDC.2010.5657786","DOIUrl":null,"url":null,"abstract":"Recently traffic identification based on Machine Learning (ML) techniques has attracted a great deal of interest. Two challenging issues for these methods are how to deal with encrypted flows and cope with the rapid growing number of new application types correctly and early. We propose a hybrid traffic identification method and a novel unsupervised clustering algorithm, On-Line Density Based Spatial Clustering (OLDBSC) algorithm, in which flows are automatically clustered based on sub-flow statistical features instead of full flows. We select Best-first features algorithm to find an optimal feature-sets, and then map the clusters to application types based on maximum probabilities applications in the clusters. The experiment results demonstrate that the proposed hybrid traffic identification method and OLDBSC algorithm is capable of identifying encrypted flows and potential new application types.","PeriodicalId":348778,"journal":{"name":"2010 2nd IEEE InternationalConference on Network Infrastructure and Digital Content","volume":"41 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2010-12-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Traffic identification method based on on-line density based spatial clustering algorithm\",\"authors\":\"Jian Zhang, Zongjue Qian, Guochu Shou, Yihong Hu\",\"doi\":\"10.1109/ICNIDC.2010.5657786\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Recently traffic identification based on Machine Learning (ML) techniques has attracted a great deal of interest. Two challenging issues for these methods are how to deal with encrypted flows and cope with the rapid growing number of new application types correctly and early. We propose a hybrid traffic identification method and a novel unsupervised clustering algorithm, On-Line Density Based Spatial Clustering (OLDBSC) algorithm, in which flows are automatically clustered based on sub-flow statistical features instead of full flows. We select Best-first features algorithm to find an optimal feature-sets, and then map the clusters to application types based on maximum probabilities applications in the clusters. The experiment results demonstrate that the proposed hybrid traffic identification method and OLDBSC algorithm is capable of identifying encrypted flows and potential new application types.\",\"PeriodicalId\":348778,\"journal\":{\"name\":\"2010 2nd IEEE InternationalConference on Network Infrastructure and Digital Content\",\"volume\":\"41 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2010-12-03\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2010 2nd IEEE InternationalConference on Network Infrastructure and Digital Content\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICNIDC.2010.5657786\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2010 2nd IEEE InternationalConference on Network Infrastructure and Digital Content","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICNIDC.2010.5657786","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Traffic identification method based on on-line density based spatial clustering algorithm
Recently traffic identification based on Machine Learning (ML) techniques has attracted a great deal of interest. Two challenging issues for these methods are how to deal with encrypted flows and cope with the rapid growing number of new application types correctly and early. We propose a hybrid traffic identification method and a novel unsupervised clustering algorithm, On-Line Density Based Spatial Clustering (OLDBSC) algorithm, in which flows are automatically clustered based on sub-flow statistical features instead of full flows. We select Best-first features algorithm to find an optimal feature-sets, and then map the clusters to application types based on maximum probabilities applications in the clusters. The experiment results demonstrate that the proposed hybrid traffic identification method and OLDBSC algorithm is capable of identifying encrypted flows and potential new application types.