A case for distributed work-stealing in regular applications

Proceedings of the 6th ACM SIGPLAN Workshop on X10 Pub Date : 2016-06-02 DOI:10.1145/2931028.2931035

Brendan Sheridan, Jeremy T. Fineman

引用次数: 2

Abstract

This paper presents a dynamically heterogeneous architecture use-case that is both realistic and favorable for distributed work-stealing in regular parallel applications. Using a straightforward implementation of distributed dense matrix multiplication in X10's Global Load Balancing (GLB) library, we show that moderate differences in node processing power allow work-stealing to significantly outperform a standard static schedule such as SUMMA. It also scales comparably on up to 128 cores.

查看原文本刊更多论文

一个在常规应用程序中分布式窃取工作的案例

本文提出了一个动态异构架构用例，该用例既现实又有利于常规并行应用中的分布式工作窃取。使用X10的全局负载平衡(GLB)库中的分布式密集矩阵乘法的直接实现，我们展示了节点处理能力的适度差异允许偷取工作显著优于SUMMA等标准静态调度。它也可以扩展到128核。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings of the 6th ACM SIGPLAN Workshop on X10

自引率

0.00%

发文量