GraphSD: A State and Dependency aware Out-of-Core Graph Processing System

Proceedings of the 51st International Conference on Parallel Processing Pub Date : 2022-08-29 DOI:10.1145/3545008.3545039

Xianghao Xu, Hong Jiang, Fang Wang, Yongli Cheng, Peng Fang

{"title":"GraphSD: A State and Dependency aware Out-of-Core Graph Processing System","authors":"Xianghao Xu, Hong Jiang, Fang Wang, Yongli Cheng, Peng Fang","doi":"10.1145/3545008.3545039","DOIUrl":null,"url":null,"abstract":"In recent years, system researchers have proposed many out-of-core graph processing systems to efficiently handle graphs that exceed the memory capacity of a single machine. Through disk-friendly graph data organizations and well-designed execution engines, existing out-of-core graph processing systems can maintain sequential locality on disk access and greatly reduce disk I/Os during processing. However, they have not fully explored the characteristics of graph data and algorithm execution to further reduce disk I/Os, leaving significant room for performance improvement. In this paper, we present a novel out-of-core graph processing system called GraphSD, which optimizes the I/O traffic by simultaneously capturing the state and dependency of graph data during computation. At the heart of GraphSD is a state- and dependency-aware update strategy that includes two adaptive update models, selective cross-iteration update (SCIU) and full cross-iteration update (FCIU). These two update models are dynamically triggered at runtime to enable active-vertex aware processing and cross-iteration vertex value computation, which avoid loading inactive edges and reduce disk I/Os in the future iterations. Moreover, an efficient sub-block based buffering scheme is proposed to further minimize I/O overheads. Our evaluation results show that GraphSD outperforms two state-of-the-art out-of-core graph processing systems HUS-Graph and Lumos by up to 2.7 × and 3.9 × respectively.","PeriodicalId":360504,"journal":{"name":"Proceedings of the 51st International Conference on Parallel Processing","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-08-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 51st International Conference on Parallel Processing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3545008.3545039","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 2

Abstract

In recent years, system researchers have proposed many out-of-core graph processing systems to efficiently handle graphs that exceed the memory capacity of a single machine. Through disk-friendly graph data organizations and well-designed execution engines, existing out-of-core graph processing systems can maintain sequential locality on disk access and greatly reduce disk I/Os during processing. However, they have not fully explored the characteristics of graph data and algorithm execution to further reduce disk I/Os, leaving significant room for performance improvement. In this paper, we present a novel out-of-core graph processing system called GraphSD, which optimizes the I/O traffic by simultaneously capturing the state and dependency of graph data during computation. At the heart of GraphSD is a state- and dependency-aware update strategy that includes two adaptive update models, selective cross-iteration update (SCIU) and full cross-iteration update (FCIU). These two update models are dynamically triggered at runtime to enable active-vertex aware processing and cross-iteration vertex value computation, which avoid loading inactive edges and reduce disk I/Os in the future iterations. Moreover, an efficient sub-block based buffering scheme is proposed to further minimize I/O overheads. Our evaluation results show that GraphSD outperforms two state-of-the-art out-of-core graph processing systems HUS-Graph and Lumos by up to 2.7 × and 3.9 × respectively.

查看原文本刊更多论文

一个状态和依赖感知的核外图处理系统

近年来，系统研究人员提出了许多外核图形处理系统，以有效地处理超出单个机器内存容量的图形。通过对磁盘友好的图形数据组织和设计良好的执行引擎，现有的核外图形处理系统可以保持磁盘访问的顺序局部性，并在处理过程中大大减少磁盘I/ o。然而，他们并没有充分探索图数据和算法执行的特点，以进一步减少磁盘I/ o，这给性能提升留下了很大的空间。本文提出了一种新型的核外图形处理系统GraphSD，该系统通过在计算过程中同时捕获图形数据的状态和依赖关系来优化I/O流量。GraphSD的核心是一个状态感知和依赖关系感知的更新策略，它包括两个自适应更新模型，选择性交叉迭代更新(SCIU)和完全交叉迭代更新(FCIU)。这两个更新模型在运行时动态触发，以支持活动顶点感知处理和交叉迭代顶点值计算，从而避免加载非活动边缘并减少未来迭代中的磁盘I/ o。此外，还提出了一种有效的基于子块的缓冲方案，以进一步减少I/O开销。我们的评估结果表明，GraphSD比两个最先进的核心外图形处理系统HUS-Graph和Lumos分别高出2.7倍和3.9倍。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings of the 51st International Conference on Parallel Processing

自引率

0.00%

发文量