Gossip-based partitioning and replication for Online Social Networks

2014 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM 2014) Pub Date : 2014-08-17 DOI:10.1109/ASONAM.2014.6921557

Muhammad Anis Uddin Nasir, Fatemeh Rahimian, Sarunas Girdzijauskas

{"title":"Gossip-based partitioning and replication for Online Social Networks","authors":"Muhammad Anis Uddin Nasir, Fatemeh Rahimian, Sarunas Girdzijauskas","doi":"10.1109/ASONAM.2014.6921557","DOIUrl":null,"url":null,"abstract":"Online Social Networks (OSNs) have been gaining tremendous growth and popularity in the last decade, as they have been attracting billions of users from all over the world. Such networks generate petabytes of data from the social interactions among their users and create many management and scalability challenges. OSN users share common interests and exhibit strong community structures, which create complex dependability patterns within OSN data, thus, make it difficult to partition and distribute in a data center environment. Existing solutions, such as, distributed databases, key-value stores and auto scaling services use random partitioning to distribute the data across a cluster, which breaks existing dependencies of the OSN data and may generate huge inter-server traffic. Therefore, there is a need for intelligent data allocation strategy that can reduce the network cost for various OSN operations. In this paper, we present a gossip-based partitioning and replication scheme that efficiently splits OSN data and distributes the data across a cluster. We achieve fault tolerance and data locality, for one-hop neighbors, through replication. Our main contribution is a social graph placement strategy that divides the social graph into predefined size partitions and periodically updates the partitions to place socially connected users together. To evaluate our algorithm, we compare it with random partitioning and a state-of-the-art solution SPAR. Results show that our algorithm generates up to four times less replication overhead compared to random partitioning and half the replication overhead compared to SPAR.","PeriodicalId":143584,"journal":{"name":"2014 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM 2014)","volume":"10 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2014-08-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2014 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM 2014)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ASONAM.2014.6921557","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 5

Abstract

Online Social Networks (OSNs) have been gaining tremendous growth and popularity in the last decade, as they have been attracting billions of users from all over the world. Such networks generate petabytes of data from the social interactions among their users and create many management and scalability challenges. OSN users share common interests and exhibit strong community structures, which create complex dependability patterns within OSN data, thus, make it difficult to partition and distribute in a data center environment. Existing solutions, such as, distributed databases, key-value stores and auto scaling services use random partitioning to distribute the data across a cluster, which breaks existing dependencies of the OSN data and may generate huge inter-server traffic. Therefore, there is a need for intelligent data allocation strategy that can reduce the network cost for various OSN operations. In this paper, we present a gossip-based partitioning and replication scheme that efficiently splits OSN data and distributes the data across a cluster. We achieve fault tolerance and data locality, for one-hop neighbors, through replication. Our main contribution is a social graph placement strategy that divides the social graph into predefined size partitions and periodically updates the partitions to place socially connected users together. To evaluate our algorithm, we compare it with random partitioning and a state-of-the-art solution SPAR. Results show that our algorithm generates up to four times less replication overhead compared to random partitioning and half the replication overhead compared to SPAR.

查看原文本刊更多论文

基于八卦的在线社交网络分区和复制

在线社交网络(OSNs)在过去十年中获得了巨大的增长和普及，因为它们吸引了来自世界各地的数十亿用户。这样的网络从用户之间的社交交互中生成pb级的数据，并带来许多管理和可伸缩性方面的挑战。OSN用户具有共同的兴趣爱好，且具有较强的社区结构，这使得OSN数据内部的可靠性模式较为复杂，在数据中心环境中难以进行分区和分布。现有的解决方案，如分布式数据库、键值存储、自动伸缩服务等，都是采用随机分区的方式将数据分布在集群中，这会破坏OSN数据之间的依赖关系，可能会产生巨大的服务器间流量。因此，需要一种智能的数据分配策略，降低各种OSN操作的网络成本。在本文中，我们提出了一种基于八卦的分区和复制方案，该方案可以有效地分割OSN数据并将数据分布在集群中。我们通过复制实现了单跳邻居的容错和数据局部性。我们的主要贡献是社交图谱放置策略，该策略将社交图谱划分为预定义大小的分区，并定期更新分区，将社交连接的用户放在一起。为了评估我们的算法，我们将其与随机分区和最先进的解决方案SPAR进行比较。结果表明，与随机分区相比，我们的算法产生的复制开销减少了四倍，与SPAR相比，复制开销减少了一半。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2014 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM 2014)

自引率

0.00%

发文量