Yang Liu, Shuzhuang Zhang, Bo Ding, Xiaoqing Li, Yipeng Wang
{"title":"A cascade forest approach to application classification of mobile traces","authors":"Yang Liu, Shuzhuang Zhang, Bo Ding, Xiaoqing Li, Yipeng Wang","doi":"10.1109/WCNC.2018.8377311","DOIUrl":null,"url":null,"abstract":"With the rapid development of mobile networks, mobile traffic classification, a mapping of mobile traffic to mobile applications, becomes more and more important for variant networking and security issues, such as network management, monitoring and the detection of malware activities. In this paper, we propose CFMTC (Cascade Forest for Mobile Traces Classification), a mobile network trace-based traffic classification system, which exploits flow statistical features extracted from mobile traces. Compared to other classification approaches, our system is based upon the key insight that deep learning techniques and the statistical features of bidirectional flows of mobile traces can be combined together for accurate mobile application classification. In CFMTC, we first filter UDP and TCP flows from mobile traces according to the flow attributes (Source IP, Destination IP, Source port, Destination port, Protocol), and then train Cascade Forest to classify raw mobile traces. We use a feature selection method to find the optimal feature set and determine the influence of different features. Our approach involves the following key features: 1) suitable for mobile traces classification; 2) adapted Cascade Forest algorithm for mobile traffic classification; 3) applicable to both connection-oriented protocols and connection-less protocols; 4) effective for both encrypted and non-encrypted flows. We implement CFMTC and conduct extensive evaluations on mobile network traces containing text, audio and video flows generated by Kuwo Music, WeChat, PPTV Live traces. Our experimental results show that CFMTC has the ability to accurately classify the mobile traces of the target mobile applications with an average accuracy of about 88.71%. Our experimental results prove that CFMTC is a robust system, and meanwhile displays competitive performance in practice.","PeriodicalId":360054,"journal":{"name":"2018 IEEE Wireless Communications and Networking Conference (WCNC)","volume":"7 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-04-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"10","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 IEEE Wireless Communications and Networking Conference (WCNC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/WCNC.2018.8377311","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 10
Abstract
With the rapid development of mobile networks, mobile traffic classification, a mapping of mobile traffic to mobile applications, becomes more and more important for variant networking and security issues, such as network management, monitoring and the detection of malware activities. In this paper, we propose CFMTC (Cascade Forest for Mobile Traces Classification), a mobile network trace-based traffic classification system, which exploits flow statistical features extracted from mobile traces. Compared to other classification approaches, our system is based upon the key insight that deep learning techniques and the statistical features of bidirectional flows of mobile traces can be combined together for accurate mobile application classification. In CFMTC, we first filter UDP and TCP flows from mobile traces according to the flow attributes (Source IP, Destination IP, Source port, Destination port, Protocol), and then train Cascade Forest to classify raw mobile traces. We use a feature selection method to find the optimal feature set and determine the influence of different features. Our approach involves the following key features: 1) suitable for mobile traces classification; 2) adapted Cascade Forest algorithm for mobile traffic classification; 3) applicable to both connection-oriented protocols and connection-less protocols; 4) effective for both encrypted and non-encrypted flows. We implement CFMTC and conduct extensive evaluations on mobile network traces containing text, audio and video flows generated by Kuwo Music, WeChat, PPTV Live traces. Our experimental results show that CFMTC has the ability to accurately classify the mobile traces of the target mobile applications with an average accuracy of about 88.71%. Our experimental results prove that CFMTC is a robust system, and meanwhile displays competitive performance in practice.
随着移动网络的快速发展,移动流量分类作为一种将移动流量映射到移动应用的方法,在网络管理、监控和恶意软件活动检测等变型网络和安全问题中变得越来越重要。本文提出了一种基于移动网络轨迹的流量分类系统CFMTC (Cascade Forest for Mobile Traces Classification),该系统利用了从移动轨迹中提取的流量统计特征。与其他分类方法相比,我们的系统基于深度学习技术和移动痕迹双向流的统计特征可以结合在一起进行准确的移动应用分类的关键见解。在CFMTC中,我们首先根据流属性(源IP,目的IP,源端口,目的端口,协议)从移动跟踪中过滤UDP和TCP流,然后训练Cascade Forest对原始移动跟踪进行分类。我们使用特征选择方法来找到最优特征集,并确定不同特征的影响。我们的方法有以下几个主要特点:1)适合移动轨迹的分类;2)采用Cascade Forest算法进行移动流量分类;3)面向连接协议和无连接协议均适用;4)对加密和非加密流量都有效。我们实施CFMTC,并对包括酷我音乐、微信、PPTV Live痕迹生成的文本、音频和视频流在内的移动网络痕迹进行广泛的评估。实验结果表明,CFMTC能够准确地对目标移动应用的移动痕迹进行分类,平均准确率约为88.71%。实验结果表明,CFMTC系统具有良好的鲁棒性,同时在实际应用中也具有一定的竞争力。