Simultaneous scheduling of replication and computation for bioinformatics applications on the grid

CLADE 2005. Proceedings Challenges of Large Applications in Distributed Environments, 2005. Pub Date : 2005-11-10 DOI:10.1109/CLADE.2005.1520903

F. Desprez, Antoine Vernois

引用次数: 16

Abstract

One of the first motivations of using grids comes from applications managing large data sets infield such as high energy physics or life sciences. To improve the global throughput of software environments, replicas are usually put at wisely selected sites. Moreover, computation requests have to be scheduled among the available resources. To get the best performance, scheduling and data replication have to be tightly coupled. However, there are few approaches that provide this coupling. This paper presents an algorithm that combines data management and scheduling using a steady-state approach. Our theoretical results are validated using simulation and logs from a large life science application (ACI GRID GriPPS).

查看原文本刊更多论文

网格上生物信息学应用复制和计算的同步调度

使用网格的最初动机之一来自于管理大型数据集的应用程序，如高能物理或生命科学。为了提高软件环境的全局吞吐量，副本通常被放置在明智选择的站点上。此外，计算请求必须在可用资源之间进行调度。为了获得最佳性能，调度和数据复制必须紧密耦合。然而，很少有方法可以提供这种耦合。本文提出了一种采用稳态方法将数据管理和调度相结合的算法。我们的理论结果通过模拟和来自大型生命科学应用(ACI GRID GriPPS)的日志进行了验证。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

CLADE 2005. Proceedings Challenges of Large Applications in Distributed Environments, 2005.

自引率

0.00%

发文量