A Task Parallel Algorithm for Computing the Costs of All-Pairs Shortest Paths on the CUDA-Compatible GPU

2008 IEEE International Symposium on Parallel and Distributed Processing with Applications Pub Date : 2008-12-10 DOI:10.1109/ISPA.2008.40

T. Okuyama, Fumihiko Ino, K. Hagihara

引用次数: 24

Abstract

This paper proposes a fast method for computing the costs of all-pairs shortest paths (APSPs) on the graphics processing unit (GPU). The proposed method is implemented using compute unified device architecture (CUDA), which offers us a development environment for performing general-purpose computation on the GPU. Our method is based on Harish's iterative algorithm that computes the cost of the single-source shortest path (SSSP) for every source vertex. We present that exploiting task parallelism in the APSP problem allows us to efficiently use on-chip memory in the GPU, reducing the amount of data being transferred from relatively slower off-chip memory. Furthermore, our task parallel scheme is useful to exploit a higher parallelism, increasing the efficiency with highly threaded code. As a result, our method is 3.4--15 times faster than the prior method. Using on-chip memory, our method eliminates approximately 20% of data loads from off-chip memory.

查看原文本刊更多论文

在cuda兼容GPU上计算全对最短路径代价的任务并行算法

提出了一种在图形处理单元(GPU)上快速计算全对最短路径(apsp)代价的方法。该方法采用计算统一设备架构(CUDA)实现，为在GPU上执行通用计算提供了一个开发环境。我们的方法基于Harish的迭代算法，该算法计算每个源顶点的单源最短路径(SSSP)的代价。我们提出在APSP问题中利用任务并行性使我们能够有效地使用GPU中的片上内存，减少从相对较慢的片外内存传输的数据量。此外，我们的任务并行方案有助于利用更高的并行性，提高高线程代码的效率。因此，我们的方法比之前的方法快3.4- 15倍。使用片上内存，我们的方法消除了来自片外内存的大约20%的数据负载。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2008 IEEE International Symposium on Parallel and Distributed Processing with Applications

自引率

0.00%

发文量