D. Sanvito, G. Siracusano, Roberto Gonzalez, R. Bifulco
{"title":"Prediphant: Short Term Heavy User Prediction","authors":"D. Sanvito, G. Siracusano, Roberto Gonzalez, R. Bifulco","doi":"10.1109/CSNDSP54353.2022.9907909","DOIUrl":null,"url":null,"abstract":"Traffic prediction is of paramount importance for the correct management of network infrastructures. Most research efforts try to forecast the aggregated traffic over the network and over large time windows. In this work, we tackle the problem the other way around. That is, we predict the behaviour of individual users over short time windows. First, we investigate the contribution of the most data eager users to the global network traffic. We do it by analyzing network traces coming from several thousand real users. Then, we design a technique, based on a combination of Natural Language Processing (NLP) and Long Short-Term Memory (LSTM) machine learning models, that leverages past navigation patterns to predict sudden changes in the amount of resources consumed by each user. Finally, we evaluate our method using real data finding it is able to predict about 80% of the users that will rump up their network needs in most realistic scenarios.","PeriodicalId":288069,"journal":{"name":"2022 13th International Symposium on Communication Systems, Networks and Digital Signal Processing (CSNDSP)","volume":"381 2","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-07-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 13th International Symposium on Communication Systems, Networks and Digital Signal Processing (CSNDSP)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CSNDSP54353.2022.9907909","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Traffic prediction is of paramount importance for the correct management of network infrastructures. Most research efforts try to forecast the aggregated traffic over the network and over large time windows. In this work, we tackle the problem the other way around. That is, we predict the behaviour of individual users over short time windows. First, we investigate the contribution of the most data eager users to the global network traffic. We do it by analyzing network traces coming from several thousand real users. Then, we design a technique, based on a combination of Natural Language Processing (NLP) and Long Short-Term Memory (LSTM) machine learning models, that leverages past navigation patterns to predict sudden changes in the amount of resources consumed by each user. Finally, we evaluate our method using real data finding it is able to predict about 80% of the users that will rump up their network needs in most realistic scenarios.