Large-grain pipelining on hypercube multiprocessors

Conference on Hypercube Concurrent Computers and Applications Pub Date : 1989-01-03 DOI:10.1145/63047.63119

C. King, L.M. Ni

引用次数: 8

Abstract

A new paradigm, called large-grain pipelining, for developing efficient parallel algorithms on distributed-memory multiprocessors, e.g., hypercube machines, is introduced. Large-grain pipelining attempts to maximize the degree of overlapping and minimize the effect of communication overhead in a multiprocessor system through macro-pipelining between the nodes. Algorithms developed through large-grain pipelining to perform matrix multiplication are presented. To model the pipelined computations, an analytic model is introduced, which takes into account both underlying architecture and algorithm behavior. Through the analytic model, important design parameters, such as data partition sizes, can be determined. Experiments were conducted on a 64-node NCUBE multiprocessor. The measured results match closely with the analyzed results, which establishes the analytic model as an integral part of algorithm design. Comparison with an algorithm which does not use large-grain pipelining also shows that large-grain pipelining is an efficient scheme for achieving a greater parallelism.

查看原文本刊更多论文

超立方体多处理器上的大粒度流水线

介绍了一种新的范式，称为大粒度流水线，用于在分布式内存多处理器(如超立方体机器)上开发高效的并行算法。在多处理器系统中，大粒度管道试图通过节点之间的宏管道来最大化重叠程度和最小化通信开销的影响。提出了通过大粒度流水线实现矩阵乘法的算法。为了对流水线计算建模，引入了一个考虑底层架构和算法行为的分析模型。通过分析模型，可以确定重要的设计参数，如数据分区大小。实验在64节点的NCUBE多处理器上进行。实测结果与分析结果吻合较好，建立了作为算法设计组成部分的解析模型。与不使用大粒度流水线的算法进行比较也表明，大粒度流水线是实现更高并行度的有效方案。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Conference on Hypercube Concurrent Computers and Applications

自引率

0.00%

发文量