图遍历的拓扑自适应策略

2017 International Conference on High Performance Computing & Simulation (HPCS) Pub Date : 2017-07-01 DOI:10.1109/HPCS.2017.60

Jia Meng, Liang Cao, Huashan Yu

{"title":"图遍历的拓扑自适应策略","authors":"Jia Meng, Liang Cao, Huashan Yu","doi":"10.1109/HPCS.2017.60","DOIUrl":null,"url":null,"abstract":"Graphs are a key form of Big Data. Although graph computing technology has been studied extensively in recent years, it remains a grand challenge to process large-scale graphs efficiently. Computation on a graph is to propagate and update the vertex values systematically. Both its complexity and parallelism are affected mainly by the algorithm's value propagating pattern. Efficient graph computing depends on techniques compatible with the algorithm's value propagating pattern. Graph traversing is a value propagating pattern used by representative graph applications. This paper presents an efficient value propagating framework for large-scale graph traversing applications. By partitioning the input graph based on the topology, it allows values for different source vertices to be propagated together, so as to reduce value propagating overhead. To improve the parallel efficiency of graph traversals, a novel task scheduling mechanism has been devised. The mechanism allows the framework to improve load balance without loss of locality. A prototype for the framework has been implemented. We evaluated the prototype with a set of typical real-world and synthetic graphs. By comparing with the owner-computing rule, experimental results show that this work has an overall speedup from 1.23 to 3.97. The speedup to Ligra is from 4.7 to 20.7.","PeriodicalId":115758,"journal":{"name":"2017 International Conference on High Performance Computing & Simulation (HPCS)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A Topology-Adaptive Strategy for Graph Traversing\",\"authors\":\"Jia Meng, Liang Cao, Huashan Yu\",\"doi\":\"10.1109/HPCS.2017.60\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Graphs are a key form of Big Data. Although graph computing technology has been studied extensively in recent years, it remains a grand challenge to process large-scale graphs efficiently. Computation on a graph is to propagate and update the vertex values systematically. Both its complexity and parallelism are affected mainly by the algorithm's value propagating pattern. Efficient graph computing depends on techniques compatible with the algorithm's value propagating pattern. Graph traversing is a value propagating pattern used by representative graph applications. This paper presents an efficient value propagating framework for large-scale graph traversing applications. By partitioning the input graph based on the topology, it allows values for different source vertices to be propagated together, so as to reduce value propagating overhead. To improve the parallel efficiency of graph traversals, a novel task scheduling mechanism has been devised. The mechanism allows the framework to improve load balance without loss of locality. A prototype for the framework has been implemented. We evaluated the prototype with a set of typical real-world and synthetic graphs. By comparing with the owner-computing rule, experimental results show that this work has an overall speedup from 1.23 to 3.97. The speedup to Ligra is from 4.7 to 20.7.\",\"PeriodicalId\":115758,\"journal\":{\"name\":\"2017 International Conference on High Performance Computing & Simulation (HPCS)\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2017-07-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2017 International Conference on High Performance Computing & Simulation (HPCS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/HPCS.2017.60\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 International Conference on High Performance Computing & Simulation (HPCS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/HPCS.2017.60","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

图表是大数据的关键形式。尽管近年来图计算技术得到了广泛的研究，但如何高效地处理大规模图仍然是一个巨大的挑战。图的计算是系统地传播和更新顶点值。其复杂度和并行度主要受算法的值传播模式的影响。高效的图计算依赖于与算法的值传播模式相兼容的技术。图遍历是典型图应用程序使用的一种值传播模式。针对大规模图遍历应用，提出了一种高效的值传播框架。通过基于拓扑划分输入图，它允许不同源顶点的值一起传播，从而减少值传播开销。为了提高图遍历的并行效率，设计了一种新的任务调度机制。该机制允许框架在不丢失局部性的情况下改善负载平衡。该框架的原型已经实现。我们用一组典型的真实世界和合成图来评估原型。通过与所有者计算规则的比较，实验结果表明，该工作的整体速度从1.23提高到3.97。liga的加速从4.7提升到20.7。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

A Topology-Adaptive Strategy for Graph Traversing

Graphs are a key form of Big Data. Although graph computing technology has been studied extensively in recent years, it remains a grand challenge to process large-scale graphs efficiently. Computation on a graph is to propagate and update the vertex values systematically. Both its complexity and parallelism are affected mainly by the algorithm's value propagating pattern. Efficient graph computing depends on techniques compatible with the algorithm's value propagating pattern. Graph traversing is a value propagating pattern used by representative graph applications. This paper presents an efficient value propagating framework for large-scale graph traversing applications. By partitioning the input graph based on the topology, it allows values for different source vertices to be propagated together, so as to reduce value propagating overhead. To improve the parallel efficiency of graph traversals, a novel task scheduling mechanism has been devised. The mechanism allows the framework to improve load balance without loss of locality. A prototype for the framework has been implemented. We evaluated the prototype with a set of typical real-world and synthetic graphs. By comparing with the owner-computing rule, experimental results show that this work has an overall speedup from 1.23 to 3.97. The speedup to Ligra is from 4.7 to 20.7.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2017 International Conference on High Performance Computing & Simulation (HPCS)

自引率

0.00%

发文量