Toward Auto-tuned Krylov Basis Computation for Different Sparse Matrix Formats and Interconnects on GPU Clusters

2015 IEEE International Conference on Cluster Computing Pub Date : 2015-09-08 DOI:10.1109/CLUSTER.2015.153

Langshi Chen, Serge G. Petition

引用次数: 0

Abstract

Krylov subspace methods (KSMs) are widely used in solving large-scale sparse linear problems. The orthogonalization process in methods like GMRES would consume a majority of the time. Since modern manycore architecture based accelerators have provided great horsepowers for computations,communication overheads remain a bottleneck, especially in clusters with a great number of nodes. The HA-PACS/TCA of Tsukuba University is a CPU-GPU hybrid cluster equipped with different interconnects for communications among GPUs. We testa group of Krylov basis computation methods with different sparse matrices and interconnects on HA-PACS/TCA. Results show that an auto-tuning scheme is required to deal with various types of matrices.

查看原文本刊更多论文

GPU集群上不同稀疏矩阵格式和互连的自调Krylov基计算

Krylov子空间方法在求解大规模稀疏线性问题中得到了广泛的应用。在GMRES等方法中，正交化过程将消耗大部分时间。由于基于现代多核架构的加速器为计算提供了巨大的动力，通信开销仍然是一个瓶颈，特别是在具有大量节点的集群中。筑波大学的HA-PACS/TCA是一个CPU-GPU混合集群，配备了不同的互连方式，用于gpu之间的通信。在HA-PACS/TCA上测试了一组具有不同稀疏矩阵和互连方式的Krylov基计算方法。结果表明，需要一种自动调优方案来处理各种类型的矩阵。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2015 IEEE International Conference on Cluster Computing

自引率

0.00%

发文量