{"title":"Semi-clustering That Scales: An Empirical Evaluation of GraphX","authors":"J. S. Andersen, O. Zukunft","doi":"10.1109/BigDataCongress.2016.51","DOIUrl":null,"url":null,"abstract":"GraphX is a distributed graph processing framework build on top of Spark Core. This work investigates the two questions, whether GraphX is an appropriate environment for the implementation of graph algorithms and how the computation of graph algorithms based on GraphX scales. This paper examines a graph algorithm for semi-clustering as used in social network analysis. We describe the implementation process of this algorithm beginning with a graph-oriented modeling tailored for GraphX up to an executable program. Based on our implementation, we have performed empirical evaluations regarding the scalability of our implementation and the GraphX platform. The experiments evidence that different kind of graph algorithms are supported by GraphX and that the execution of our algorithm can scale almost linearly when properly designed.","PeriodicalId":407471,"journal":{"name":"2016 IEEE International Congress on Big Data (BigData Congress)","volume":"316 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 IEEE International Congress on Big Data (BigData Congress)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/BigDataCongress.2016.51","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 4
Abstract
GraphX is a distributed graph processing framework build on top of Spark Core. This work investigates the two questions, whether GraphX is an appropriate environment for the implementation of graph algorithms and how the computation of graph algorithms based on GraphX scales. This paper examines a graph algorithm for semi-clustering as used in social network analysis. We describe the implementation process of this algorithm beginning with a graph-oriented modeling tailored for GraphX up to an executable program. Based on our implementation, we have performed empirical evaluations regarding the scalability of our implementation and the GraphX platform. The experiments evidence that different kind of graph algorithms are supported by GraphX and that the execution of our algorithm can scale almost linearly when properly designed.