A. Murtic, M. Maljic, S. Gruicic, D. Pintar, M. Vranić
{"title":"SNA-based artificial call detail records generator","authors":"A. Murtic, M. Maljic, S. Gruicic, D. Pintar, M. Vranić","doi":"10.23919/MIPRO.2018.8400222","DOIUrl":null,"url":null,"abstract":"Research involving Big Data often has to deal with the problem of data availability. Real-life data involving people and their activities is usually tied with various issues of privacy, security and secrecy, which results in difficult barriers which need to be overcome before the research can even start. In this paper we suggest an approach which can reliably provide researchers with an arbitrary amount of synthetic Call Detail Records (CDR) data which would exhibit a high level of similarity with a corresponding real-life dataset. We base our approach on a simulator whose functionality is derived on results of an exploratory analysis performed on a real-life dataset which represents a social network of users with records of their activities. In this paper we concentrate on generating CDR data used in telecommunications industry, although the approach is applicable in the other domains too.","PeriodicalId":431110,"journal":{"name":"2018 41st International Convention on Information and Communication Technology, Electronics and Microelectronics (MIPRO)","volume":"39 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-05-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 41st International Convention on Information and Communication Technology, Electronics and Microelectronics (MIPRO)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.23919/MIPRO.2018.8400222","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3
Abstract
Research involving Big Data often has to deal with the problem of data availability. Real-life data involving people and their activities is usually tied with various issues of privacy, security and secrecy, which results in difficult barriers which need to be overcome before the research can even start. In this paper we suggest an approach which can reliably provide researchers with an arbitrary amount of synthetic Call Detail Records (CDR) data which would exhibit a high level of similarity with a corresponding real-life dataset. We base our approach on a simulator whose functionality is derived on results of an exploratory analysis performed on a real-life dataset which represents a social network of users with records of their activities. In this paper we concentrate on generating CDR data used in telecommunications industry, although the approach is applicable in the other domains too.