{"title":"On using eXtreme Gradient Boosting (XGBoost) Machine Learning algorithm for Home Network Traffic Classification","authors":"Iyad Lahsen Cherif, A. Kortebi","doi":"10.1109/WD.2019.8734193","DOIUrl":null,"url":null,"abstract":"Traffic classification (TC) is a fundamental task of network management and monitoring operations. Previous works relying on selected packet header fields (e.g. port numbers) or application layer protocol decoding techniques are becoming increasingly difficult and inefficient when facing encrypted traffic and peer-to-peer flows. In this paper, we address the problem of flow based TC using machine learning (ML) algorithms. Our work considers a supervised approach, namely eXtreme Gradient Boosting (XGBoost) algorithm, which has never been investigated for TC. Performance evaluation results show that we obtain 99.5% accuracy on a dataset containing real flows. Additionally, compared to other ML algorithms, XGBoost is the most accurate one.","PeriodicalId":432101,"journal":{"name":"2019 Wireless Days (WD)","volume":"48 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"43","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 Wireless Days (WD)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/WD.2019.8734193","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 43
Abstract
Traffic classification (TC) is a fundamental task of network management and monitoring operations. Previous works relying on selected packet header fields (e.g. port numbers) or application layer protocol decoding techniques are becoming increasingly difficult and inefficient when facing encrypted traffic and peer-to-peer flows. In this paper, we address the problem of flow based TC using machine learning (ML) algorithms. Our work considers a supervised approach, namely eXtreme Gradient Boosting (XGBoost) algorithm, which has never been investigated for TC. Performance evaluation results show that we obtain 99.5% accuracy on a dataset containing real flows. Additionally, compared to other ML algorithms, XGBoost is the most accurate one.