{"title":"Evaluating Application-Layer Classification Using a Machine Learning Technique over Different High Speed Networks","authors":"S. Ubik, P. Zejdl","doi":"10.1109/ICSNC.2010.66","DOIUrl":null,"url":null,"abstract":"Application–layer classification is needed in many monitoring applications. Classification based on machine learning offers an alternative method to methods based on port or payload based techniques. It is based on statistical features computed from network flows. Several works investigated the efficiency of machine learning techniques and found algorithms suitable for network classification. A classifier based on machine learning is built by learning from a training data set that consists of data from known application traces. In this paper, we evaluate the efficiency of application-layer classification based on C4.5~machine learning algorithm used for classification network flows from different high speed networks, such as 100~Mbit, 1~Gbit and 10~Gbit networks. We find a significant decrease in the classification efficiency when classifier built for one network is used to classify other network. We recommend to build classifier from data collected from all available networks for best results. However, if different networks are not available, good results can be obtained from data traces to the commodity Internet.","PeriodicalId":152012,"journal":{"name":"2010 Fifth International Conference on Systems and Networks Communications","volume":"30 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2010-08-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"23","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2010 Fifth International Conference on Systems and Networks Communications","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICSNC.2010.66","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 23
Abstract
Application–layer classification is needed in many monitoring applications. Classification based on machine learning offers an alternative method to methods based on port or payload based techniques. It is based on statistical features computed from network flows. Several works investigated the efficiency of machine learning techniques and found algorithms suitable for network classification. A classifier based on machine learning is built by learning from a training data set that consists of data from known application traces. In this paper, we evaluate the efficiency of application-layer classification based on C4.5~machine learning algorithm used for classification network flows from different high speed networks, such as 100~Mbit, 1~Gbit and 10~Gbit networks. We find a significant decrease in the classification efficiency when classifier built for one network is used to classify other network. We recommend to build classifier from data collected from all available networks for best results. However, if different networks are not available, good results can be obtained from data traces to the commodity Internet.