Parallel and Pipelined Parallel Consecutive Sums on a Hypercube with Application to Ray Casting

Int. J. High Speed Comput. Pub Date : 1995-03-01 DOI:10.1142/S0129053395000099

Jianjian Song, R. Shu

引用次数: 0

Abstract

Communication penalty for parallel computation is related to message startup time and speed of data transmission between the host and processing elements (PEs). We propose two algorithms in this paper to show that the first factor can be alleviated by reducing the number of messages and the second by making the host-PE communication concurrent with computation on the PE array. The algorithms perform 2n consecutive sums of 2n numbers each on a hypercube of degree n. The first algorithm leaves one sum on each processor. It takes n steps to complete the sums and reduces the number of messages generated by a PE from 2n to n. The second algorithm sends all the sums back to the host as the sums are generated one by one. It takes 2n+n−1 steps to complete the sums in a pipeline so that one sum is completed every step after the initial (n−1) steps. We apply our second algorithm to the front-to-back composition for ray casting. For large number of rays, the efficiency and speedup of our algorithm are close to theoretically optimal values.

查看原文本刊更多论文

超立方体上的并行和管道并行连续和及其在光线投射中的应用

并行计算的通信损失与消息启动时间和主机与处理元素(pe)之间的数据传输速度有关。本文提出了两种算法，通过减少消息数来缓解第一个因素，通过使主机-PE通信与PE阵列上的计算并行来缓解第二个因素。这些算法在n次的超立方体上执行2n个连续的和，每个和2n个数字。第一种算法在每个处理器上留下一个和。它需要n步来完成求和，并将PE生成的消息数量从2n个减少到n个。第二种算法将所有的和逐个生成，并将所有的和发送回主机。在管道中完成求和需要2n+n−1个步骤，因此在初始(n−1)个步骤之后，每一步都完成一个求和。我们将第二种算法应用于光线投射的前后构图。对于大量的射线，我们的算法的效率和加速接近理论最优值。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Int. J. High Speed Comput.

自引率

0.00%

发文量