I. Soric, Davor Dinjar, Marko Stajcer, Dražen Oreščanin
{"title":"Efficient social network analysis in big data architectures","authors":"I. Soric, Davor Dinjar, Marko Stajcer, Dražen Oreščanin","doi":"10.23919/MIPRO.2017.7973640","DOIUrl":null,"url":null,"abstract":"Social network analysis (SNA) is the application of graph theory to understand, categorize and quantify relationships in a social network. It can be a great tool to improve analytic capabilities in any field, for example marketing analytics, churn prediction, health care, etc. In terms of SNA, network structure is defined by nodes, edges and metrics which quantify the importance or influence of certain nodes in the network or relationship strength between nodes. Algorithms for network metrics calculation are complex and that makes SNA difficult to implement in big data environments on large datasets with many nodes and edges. In this paper we will elaborate how to efficiently and performance wise perform SNA and visualize results of the analysis on large datasets using increasingly popular GraphX and JavaScript libraries.","PeriodicalId":203046,"journal":{"name":"2017 40th International Convention on Information and Communication Technology, Electronics and Microelectronics (MIPRO)","volume":"112 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-05-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"8","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 40th International Convention on Information and Communication Technology, Electronics and Microelectronics (MIPRO)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.23919/MIPRO.2017.7973640","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 8
Abstract
Social network analysis (SNA) is the application of graph theory to understand, categorize and quantify relationships in a social network. It can be a great tool to improve analytic capabilities in any field, for example marketing analytics, churn prediction, health care, etc. In terms of SNA, network structure is defined by nodes, edges and metrics which quantify the importance or influence of certain nodes in the network or relationship strength between nodes. Algorithms for network metrics calculation are complex and that makes SNA difficult to implement in big data environments on large datasets with many nodes and edges. In this paper we will elaborate how to efficiently and performance wise perform SNA and visualize results of the analysis on large datasets using increasingly popular GraphX and JavaScript libraries.