{"title":"Learning from large-scale commercial networks: challenges and knowledge extraction towards machine learning use cases","authors":"Roman Zhohov, Alexandros Palaios, Philipp Geuer","doi":"10.1145/3472771.3472773","DOIUrl":null,"url":null,"abstract":"Machine Learning (ML) algorithms are proposed to replace conventional algorithms in the area of wireless networking. Many of the suggested algorithms are often based on simulators or smallscale test-beds. We provide a study based on a dataset collected over a large commercial network, and highlight some of the real network dynamics that learning agents need to cope with. Our dataset includes not only measurements from the User Equipment (UE) but also integrates information from the network. Based on the collected data, we highlight some of the aspects that are important for the design of learning agents and discuss potential dataset characteristics that might hinder the learning process. Then we discuss what dataset characteristics can facilitate the deployment of ML algorithms in the real networks. Finally, we showcase how throughput prediction can be implemented by using ML techniques and provide some examples and insights on feature engineering and the training process.","PeriodicalId":270618,"journal":{"name":"Proceedings of the 1st Workshop on 5G Measurements, Modeling, and Use Cases","volume":"108 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-08-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 1st Workshop on 5G Measurements, Modeling, and Use Cases","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3472771.3472773","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
Machine Learning (ML) algorithms are proposed to replace conventional algorithms in the area of wireless networking. Many of the suggested algorithms are often based on simulators or smallscale test-beds. We provide a study based on a dataset collected over a large commercial network, and highlight some of the real network dynamics that learning agents need to cope with. Our dataset includes not only measurements from the User Equipment (UE) but also integrates information from the network. Based on the collected data, we highlight some of the aspects that are important for the design of learning agents and discuss potential dataset characteristics that might hinder the learning process. Then we discuss what dataset characteristics can facilitate the deployment of ML algorithms in the real networks. Finally, we showcase how throughput prediction can be implemented by using ML techniques and provide some examples and insights on feature engineering and the training process.