{"title":"A Statistical Framework to Forecast Duration and Volume of Internet Usage Based on Pervasive Monitoring of NetFlow Logs","authors":"S. Sarmadi, Mingyang Li, S. Chellappan","doi":"10.1109/AINA.2018.00077","DOIUrl":null,"url":null,"abstract":"In this paper, we address an important and practical problem - namely to forecast the duration and volume of Internet usage of a subject based on pervasive and unobtrusive of past history. Unfortunately though, profiling users can have privacy ramifications. In this paper, we present a statistical framework to forecast duration and volume of Internet usage of subjects via processing NetFlow logs from routers. Briefly, NetFlow logs are network level information of IP packets as they traverse a router, but they do not contain the packet payload. In our experimental study, Internet traffic logs of octets and durations of 66 subjects in a college campus were collected (via privacy-preserving NetFlow records) in a pervasive and unobtrusive manner for a month. By applying times series forecasting techniques, we demonstrate that predictions on duration and volume of usage at future times can be made based on past usage, with very good precision. Furthermore, our results also show that with more historical data, prediction accuracy improves further. We believe that our problem in this paper has not been addressed in the literature. We also believe that our contributions in this paper have important consequences in enabling privacy preserving techniques to manage network resources for administrators, cyber security via behavioral based authentication, and smarter advertising.","PeriodicalId":239730,"journal":{"name":"2018 IEEE 32nd International Conference on Advanced Information Networking and Applications (AINA)","volume":"78 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-05-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 IEEE 32nd International Conference on Advanced Information Networking and Applications (AINA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/AINA.2018.00077","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3
Abstract
In this paper, we address an important and practical problem - namely to forecast the duration and volume of Internet usage of a subject based on pervasive and unobtrusive of past history. Unfortunately though, profiling users can have privacy ramifications. In this paper, we present a statistical framework to forecast duration and volume of Internet usage of subjects via processing NetFlow logs from routers. Briefly, NetFlow logs are network level information of IP packets as they traverse a router, but they do not contain the packet payload. In our experimental study, Internet traffic logs of octets and durations of 66 subjects in a college campus were collected (via privacy-preserving NetFlow records) in a pervasive and unobtrusive manner for a month. By applying times series forecasting techniques, we demonstrate that predictions on duration and volume of usage at future times can be made based on past usage, with very good precision. Furthermore, our results also show that with more historical data, prediction accuracy improves further. We believe that our problem in this paper has not been addressed in the literature. We also believe that our contributions in this paper have important consequences in enabling privacy preserving techniques to manage network resources for administrators, cyber security via behavioral based authentication, and smarter advertising.