Improving the locality of the sparse matrix-vector product on shared memory multiprocessors

12th Euromicro Conference on Parallel, Distributed and Network-Based Processing, 2004. Proceedings. Pub Date : 2004-03-08 DOI:10.1109/EMPDP.2004.1271429

J. C. Pichel, D. Heras, J. C. Cabaleiro, F. F. Rivera

引用次数: 34

Abstract

We extend a model of locality and the subsequent process of locality improvement previously developed for the case of sparse algebra codes in monoprocessors to the case of NUMA shared memory multiprocessors (SMPs). In particular the product of a sparse matrix by a dense vector (SpM/spl times/V) is studied. In the model, locality is established at run-time considering parameters that describe the structure of the sparse matrix involved in the computations. The problem of increasing the locality is formulated as a graph problem, whose solution indicates some appropriate reordering of rows and columns of the sparse matrix. The reordering algorithms were tested for a broad set of matrices. We have also performed a comparison with other reordering algorithms. The results lead to general conclusions about improving SMP performance for other sparse algebra codes.

查看原文本刊更多论文

改进共享内存多处理器上稀疏矩阵向量积的局部性

我们将先前针对单处理器稀疏代数码的局部性模型和随后的局部性改进过程扩展到NUMA共享内存多处理器(SMPs)的情况。特别研究了稀疏矩阵与密集向量(SpM/spl乘以/V)的乘积。在该模型中，考虑描述计算中涉及的稀疏矩阵结构的参数，在运行时建立局部性。将增加局部性问题表述为一个图问题，其解表明对稀疏矩阵的行和列进行适当的重新排序。在广泛的矩阵集上测试了重新排序算法。我们还与其他排序算法进行了比较。研究结果对提高其他稀疏代数码的SMP性能有一定的指导意义。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

12th Euromicro Conference on Parallel, Distributed and Network-Based Processing, 2004. Proceedings.

自引率

0.00%

发文量