Simultaneous scheduling of replication and computation for bioinformatics applications on the grid

F. Desprez, Antoine Vernois
{"title":"Simultaneous scheduling of replication and computation for bioinformatics applications on the grid","authors":"F. Desprez, Antoine Vernois","doi":"10.1109/CLADE.2005.1520903","DOIUrl":null,"url":null,"abstract":"One of the first motivations of using grids comes from applications managing large data sets infield such as high energy physics or life sciences. To improve the global throughput of software environments, replicas are usually put at wisely selected sites. Moreover, computation requests have to be scheduled among the available resources. To get the best performance, scheduling and data replication have to be tightly coupled. However, there are few approaches that provide this coupling. This paper presents an algorithm that combines data management and scheduling using a steady-state approach. Our theoretical results are validated using simulation and logs from a large life science application (ACI GRID GriPPS).","PeriodicalId":330715,"journal":{"name":"CLADE 2005. Proceedings Challenges of Large Applications in Distributed Environments, 2005.","volume":"71 ","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2005-11-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"16","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"CLADE 2005. Proceedings Challenges of Large Applications in Distributed Environments, 2005.","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CLADE.2005.1520903","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 16

Abstract

One of the first motivations of using grids comes from applications managing large data sets infield such as high energy physics or life sciences. To improve the global throughput of software environments, replicas are usually put at wisely selected sites. Moreover, computation requests have to be scheduled among the available resources. To get the best performance, scheduling and data replication have to be tightly coupled. However, there are few approaches that provide this coupling. This paper presents an algorithm that combines data management and scheduling using a steady-state approach. Our theoretical results are validated using simulation and logs from a large life science application (ACI GRID GriPPS).
网格上生物信息学应用复制和计算的同步调度
使用网格的最初动机之一来自于管理大型数据集的应用程序,如高能物理或生命科学。为了提高软件环境的全局吞吐量,副本通常被放置在明智选择的站点上。此外,计算请求必须在可用资源之间进行调度。为了获得最佳性能,调度和数据复制必须紧密耦合。然而,很少有方法可以提供这种耦合。本文提出了一种采用稳态方法将数据管理和调度相结合的算法。我们的理论结果通过模拟和来自大型生命科学应用(ACI GRID GriPPS)的日志进行了验证。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信