On checkpointing strategies in unreliable computing environments

Proceedings of the 6th IEEE International Conference on Intelligent Data Acquisition and Advanced Computing Systems Pub Date : 2011-11-10 DOI:10.1109/IDAACS.2011.6072739

P. Fiorini

引用次数: 0

Abstract

In this paper, we analyze performance implications of checkpointing strategies in unreliable computing environments. We show that if the appropriate checkpointing strategy is not chosen, the time to complete a job is heavy-tailed distributed. This can lead to highly-variable and long completion times. We generate asymptotics for job completion times when there is no checkpointing, a fixed number of random checkpoints, and when checkpoints occur at fixed intervals for various task time distributions. Our asymptotic results are derived using large deviation theory.

查看原文本刊更多论文

不可靠计算环境下的检查点策略

在本文中，我们分析了检查点策略在不可靠计算环境中的性能影响。我们表明，如果没有选择适当的检查点策略，完成作业的时间将是重尾分布的。这可能导致高度可变和较长的完成时间。当没有检查点、固定数量的随机检查点以及检查点以固定间隔出现在各种任务时间分布中时，我们生成作业完成时间的渐近值。我们的渐近结果是用大偏差理论推导出来的。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings of the 6th IEEE International Conference on Intelligent Data Acquisition and Advanced Computing Systems

自引率

0.00%

发文量