A Caching Approach to Reduce Communication in Graph Search Algorithms

2014 International Workshop on Data Intensive Scalable Computing Systems Pub Date : 2014-11-16 DOI:10.1109/DISCS.2014.8

Pietro Cicotti, L. Carrington

引用次数: 3

Abstract

In many scientific and computational domains, graphs are used to represent and analyze data. Such graphs often exhibit the characteristics of small-world networks: few high-degree vertexes connect many low-degree vertexes. Despite the randomness in a graph search, it is possible to capitalize on this characteristic and cache relevant information in high-degree vertexes. We applied this idea by caching remote vertex ids in a parallel breadth-first search implementation, and demonstrated 1.6x to 2.4x speedup over the reference implementation on 64 to 1024 cores. We proposed a system design in which resources are dedicated exclusively to caching, and shared among a set of nodes. Our evaluation demonstrates that this design has the potential to reduce communication and improve performance over large scale systems. Finally, we used a memcached system as the cache server finding that a generic protocol that does not match the usage semantics may hinder the potential performance improvements.

查看原文本刊更多论文

图搜索算法中减少通信的缓存方法

在许多科学和计算领域，图形被用来表示和分析数据。这样的图通常表现出小世界网络的特征:很少有高度顶点连接许多低度顶点。尽管图搜索具有随机性，但可以利用这一特性并将相关信息缓存在高阶顶点中。我们通过在并行宽度优先搜索实现中缓存远程顶点id来应用这个想法，并演示了在64到1024核上比参考实现提高1.6到2.4倍的速度。我们提出了一种系统设计，其中资源专门用于缓存，并在一组节点之间共享。我们的评估表明，这种设计具有减少通信和提高大型系统性能的潜力。最后，我们使用memcached系统作为缓存服务器，发现与使用语义不匹配的通用协议可能会阻碍潜在的性能改进。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2014 International Workshop on Data Intensive Scalable Computing Systems

自引率

0.00%

发文量