Massively Parallel Algorithms for Finding Well-Connected Components in Sparse Graphs

Sepehr Assadi, Xiaorui Sun, Omri Weinstein
{"title":"Massively Parallel Algorithms for Finding Well-Connected Components in Sparse Graphs","authors":"Sepehr Assadi, Xiaorui Sun, Omri Weinstein","doi":"10.1145/3293611.3331596","DOIUrl":null,"url":null,"abstract":"Massively parallel computation (MPC) algorithms for graph problems have witnessed a resurgence of interest in recent years. Despite major progress for numerous graph problems however, the complexity of the sparse graph connectivity problem in this model has remained elusive: While classical logarithmic-round PRAM algorithms for finding connected components in any n-vertex graph have been known for more than three decades (and imply the same bounds for MPC model), no o(log n)-round MPC algorithms are known for this task with truly sublinear in n memory per machine (which is the only interesting regime for sparse graphs with O(n) edges). It is conjectured that an o(log n)-round algorithm for connectivity on general sparse graphs with n1-Ω (1) per-machine memory may not exist, a conjecture that also forms the basis for multiple conditional hardness results on the round complexity of other problems in the MPC model. We take an opportunistic approach towards the sparse graph connectivity problem by designing an algorithm with improved performance in terms of the connectivity structure of the input graph. Formally, we design an MPC algorithm that finds all connected components with spectral gap at least λ in a graph in O(log log n + log(1/λ)) MPC rounds and nδ memory per machine for any constant δ ∈ (0,1). While this algorithm still requires Θ(log n) rounds in the worst-case, it achieves an exponential round reduction on \"well-connected'' components with λ ≥ 1/polylog(n) using only nδ memory per machine and ł(n) total memory, and still operates in o(log n)l rounds even when λ = 1/no(1). En-route to our main result, we design a new distributed data structure for performing independent random walks from all vertices simultaneously, as well as a new leader-election algorithm with exponentially faster round complexity on random graphs.","PeriodicalId":153766,"journal":{"name":"Proceedings of the 2019 ACM Symposium on Principles of Distributed Computing","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-05-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"54","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2019 ACM Symposium on Principles of Distributed Computing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3293611.3331596","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 54

Abstract

Massively parallel computation (MPC) algorithms for graph problems have witnessed a resurgence of interest in recent years. Despite major progress for numerous graph problems however, the complexity of the sparse graph connectivity problem in this model has remained elusive: While classical logarithmic-round PRAM algorithms for finding connected components in any n-vertex graph have been known for more than three decades (and imply the same bounds for MPC model), no o(log n)-round MPC algorithms are known for this task with truly sublinear in n memory per machine (which is the only interesting regime for sparse graphs with O(n) edges). It is conjectured that an o(log n)-round algorithm for connectivity on general sparse graphs with n1-Ω (1) per-machine memory may not exist, a conjecture that also forms the basis for multiple conditional hardness results on the round complexity of other problems in the MPC model. We take an opportunistic approach towards the sparse graph connectivity problem by designing an algorithm with improved performance in terms of the connectivity structure of the input graph. Formally, we design an MPC algorithm that finds all connected components with spectral gap at least λ in a graph in O(log log n + log(1/λ)) MPC rounds and nδ memory per machine for any constant δ ∈ (0,1). While this algorithm still requires Θ(log n) rounds in the worst-case, it achieves an exponential round reduction on "well-connected'' components with λ ≥ 1/polylog(n) using only nδ memory per machine and ł(n) total memory, and still operates in o(log n)l rounds even when λ = 1/no(1). En-route to our main result, we design a new distributed data structure for performing independent random walks from all vertices simultaneously, as well as a new leader-election algorithm with exponentially faster round complexity on random graphs.
稀疏图中良连通分量的大规模并行算法
近年来,图问题的大规模并行计算(MPC)算法重新引起了人们的兴趣。然而,尽管在许多图问题上取得了重大进展,但该模型中稀疏图连通性问题的复杂性仍然难以捉摸:虽然用于在任何n顶点图中查找连接组件的经典对数轮PRAM算法已经被发现了三十多年(并且意味着MPC模型的边界相同),但是对于每台机器n个内存的真正次线性的任务,没有已知的o(log n)轮MPC算法(这是具有o(n)条边的稀疏图的唯一有趣的制度)。据推测,对于每台机器内存为n1-Ω(1)的一般稀疏图,可能不存在o(log n)轮的连通性算法,这一猜想也构成了MPC模型中其他问题的轮复杂度的多个条件硬度结果的基础。我们通过设计一种在输入图的连通性结构方面具有改进性能的算法,采用机会主义方法解决稀疏图连通性问题。形式上,我们设计了一种MPC算法,对于任意常数δ∈(0,1),该算法在O(log log n + log(1/λ)) MPC轮数和每台机器nδ内存中找到图中频谱间隙至少λ的所有连接分量。虽然该算法在最坏情况下仍然需要Θ(log n)轮,但它在λ≥1/polylog(n)的“良好连接”组件上实现指数轮缩减,每台机器只使用nδ内存和z (n)总内存,并且即使λ = 1/no(1),仍然在o(log n)l轮中运行。在我们的主要结果中,我们设计了一种新的分布式数据结构,用于同时从所有顶点执行独立随机行走,以及一种新的leader选举算法,该算法在随机图上具有指数级快的轮复杂度。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信