并行图处理实现的性能和推荐系统:正在进行中

Companion of the 2019 ACM/SPEC International Conference on Performance Engineering Pub Date : 2019-03-27 DOI:10.1145/3302541.3313097

Samuel D. Pollard, Sudharshan Srinivasan, B. Norris

{"title":"并行图处理实现的性能和推荐系统:正在进行中","authors":"Samuel D. Pollard, Sudharshan Srinivasan, B. Norris","doi":"10.1145/3302541.3313097","DOIUrl":null,"url":null,"abstract":"There are nearly one hundred parallel and distributed graph processing packages. Selecting the best package for a given problem is difficult; some packages require GPUs, some are optimized for distributed or shared memory, and some require proprietary compilers or perform better on different hardware. Furthermore, performance may vary wildly depending on the graph itself. This complexity makes selecting the optimal implementation manually infeasible. We develop an approach to predict the performance of parallel graph processing using both regression models and binary classification by labeling configurations as either well-performing or not. We demonstrate our approach on six graph processing packages: GraphMat, the Graph500, the Graph Algorithm Platform Benchmark Suite, GraphBIG, Galois, and PowerGraph and on four algorithms: PageRank, single-source shortest paths, triangle counting, and breadth first search. Given a graph, our method can estimate execution time or suggest an implementation and thread count expected to perform well. Our method correctly identifies well-performing configurations in 97% of test cases.","PeriodicalId":231712,"journal":{"name":"Companion of the 2019 ACM/SPEC International Conference on Performance Engineering","volume":"83 2 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-03-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":"{\"title\":\"A Performance and Recommendation System for Parallel Graph Processing Implementations: Work-In-Progress\",\"authors\":\"Samuel D. Pollard, Sudharshan Srinivasan, B. Norris\",\"doi\":\"10.1145/3302541.3313097\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"There are nearly one hundred parallel and distributed graph processing packages. Selecting the best package for a given problem is difficult; some packages require GPUs, some are optimized for distributed or shared memory, and some require proprietary compilers or perform better on different hardware. Furthermore, performance may vary wildly depending on the graph itself. This complexity makes selecting the optimal implementation manually infeasible. We develop an approach to predict the performance of parallel graph processing using both regression models and binary classification by labeling configurations as either well-performing or not. We demonstrate our approach on six graph processing packages: GraphMat, the Graph500, the Graph Algorithm Platform Benchmark Suite, GraphBIG, Galois, and PowerGraph and on four algorithms: PageRank, single-source shortest paths, triangle counting, and breadth first search. Given a graph, our method can estimate execution time or suggest an implementation and thread count expected to perform well. Our method correctly identifies well-performing configurations in 97% of test cases.\",\"PeriodicalId\":231712,\"journal\":{\"name\":\"Companion of the 2019 ACM/SPEC International Conference on Performance Engineering\",\"volume\":\"83 2 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-03-27\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Companion of the 2019 ACM/SPEC International Conference on Performance Engineering\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3302541.3313097\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Companion of the 2019 ACM/SPEC International Conference on Performance Engineering","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3302541.3313097","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 3

摘要

有近百个并行和分布式图形处理包。为给定问题选择最佳软件包是困难的;有些软件包需要gpu，有些软件包针对分布式或共享内存进行了优化，有些软件包需要专有编译器，或者在不同的硬件上表现更好。此外，性能可能因图本身的不同而有很大差异。这种复杂性使得手动选择最佳实现变得不可行。我们开发了一种方法来预测并行图处理的性能，使用回归模型和二元分类，通过标记配置是否表现良好。我们在六个图形处理包上展示了我们的方法:GraphMat、Graph500、图形算法平台基准套件、GraphBIG、Galois和PowerGraph，以及四种算法:PageRank、单源最短路径、三角形计数和广度优先搜索。给定一个图，我们的方法可以估计执行时间或建议执行良好的实现和线程数。我们的方法在97%的测试用例中正确地识别出性能良好的配置。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

A Performance and Recommendation System for Parallel Graph Processing Implementations: Work-In-Progress

There are nearly one hundred parallel and distributed graph processing packages. Selecting the best package for a given problem is difficult; some packages require GPUs, some are optimized for distributed or shared memory, and some require proprietary compilers or perform better on different hardware. Furthermore, performance may vary wildly depending on the graph itself. This complexity makes selecting the optimal implementation manually infeasible. We develop an approach to predict the performance of parallel graph processing using both regression models and binary classification by labeling configurations as either well-performing or not. We demonstrate our approach on six graph processing packages: GraphMat, the Graph500, the Graph Algorithm Platform Benchmark Suite, GraphBIG, Galois, and PowerGraph and on four algorithms: PageRank, single-source shortest paths, triangle counting, and breadth first search. Given a graph, our method can estimate execution time or suggest an implementation and thread count expected to perform well. Our method correctly identifies well-performing configurations in 97% of test cases.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Companion of the 2019 ACM/SPEC International Conference on Performance Engineering

自引率

0.00%

发文量