RUMR: robust scheduling for divisible workloads

High Performance Distributed Computing, 2003. Proceedings. 12th IEEE International Symposium on Pub Date : 2003-06-22 DOI:10.1109/HPDC.2003.1210021

Yang Yang, H. Casanova

引用次数: 46

Abstract

Divisible workload applications arise in many fields of science and engineering. They can be parallelized in master-worker fashion and relevant scheduling strategies have been proposed to reduce application markspan. Our goal is to developed a practical divisible workload scheduling strategy. This requires that previous work be revisited as several usual assumptions about the computing platform do not hold in practice. We have partially addressed this concern in a previous paper via an algorithm that achieves high performance with realistic resource latency models. In this paper we extend our approach to account for performance prediction errors, which are expected for most real-world performance and applications. In essence, we combine ideas from multiround divisible workload scheduling, for performance, and from factoring-based scheduling, for robustness. We present simulation results to quantify the benefits of our approach compared to our original algorithm and to other previously proposed algorithms.

查看原文本刊更多论文

RUMR:可分割工作负载的健壮调度

可分负载的应用出现在许多科学和工程领域。它们可以以主工方式并行化，并提出了相应的调度策略来减小应用程序的标记跨度。我们的目标是开发一种实用的可分割工作负载调度策略。这需要重新审视以前的工作，因为关于计算平台的几个通常假设在实践中并不成立。在之前的一篇论文中，我们已经通过一种算法部分解决了这个问题，该算法通过现实的资源延迟模型实现了高性能。在本文中，我们扩展了我们的方法来解释大多数实际性能和应用程序所期望的性能预测误差。从本质上讲，我们结合了多轮可分工作负载调度的思想，以提高性能，以及基于因式调度的思想，以提高鲁棒性。我们给出了仿真结果来量化我们的方法与我们的原始算法和其他先前提出的算法相比的好处。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

High Performance Distributed Computing, 2003. Proceedings. 12th IEEE International Symposium on

自引率

0.00%

发文量