科学应用工作流的持久局部性管理

2010 13th IEEE International Conference on Computational Science and Engineering Pub Date : 2010-12-11 DOI:10.1109/CSE.2010.60

Lamine M. Aouad, Mohand Tahar Kechadi, S. Petiton

{"title":"科学应用工作流的持久局部性管理","authors":"Lamine M. Aouad, Mohand Tahar Kechadi, S. Petiton","doi":"10.1109/CSE.2010.60","DOIUrl":null,"url":null,"abstract":"The huge data requirements of large nowadays applications in science and engineering make optimised and scalable data placement mechanisms an essential need. For this purpose, we propose a scheduling scheme based on an efficient data locality management for data-intensive workflows. Transfer and placement decisions are made based on constructions in the workflow, representing inter-relationships between inputs and outputs at its different levels. When running large applications, most of the input data would not be shipped, keeping the data close to the jobs, and resulting on mush less communication and transfer overheads. We have implemented these techniques for the YML workflow system. This paper presents results showing a substantial improvement in the performance of many interdependent multi-level workflows through these data placement optimisations.","PeriodicalId":342688,"journal":{"name":"2010 13th IEEE International Conference on Computational Science and Engineering","volume":"101 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2010-12-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Persistent Locality Management of Scientific Application Workflows\",\"authors\":\"Lamine M. Aouad, Mohand Tahar Kechadi, S. Petiton\",\"doi\":\"10.1109/CSE.2010.60\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The huge data requirements of large nowadays applications in science and engineering make optimised and scalable data placement mechanisms an essential need. For this purpose, we propose a scheduling scheme based on an efficient data locality management for data-intensive workflows. Transfer and placement decisions are made based on constructions in the workflow, representing inter-relationships between inputs and outputs at its different levels. When running large applications, most of the input data would not be shipped, keeping the data close to the jobs, and resulting on mush less communication and transfer overheads. We have implemented these techniques for the YML workflow system. This paper presents results showing a substantial improvement in the performance of many interdependent multi-level workflows through these data placement optimisations.\",\"PeriodicalId\":342688,\"journal\":{\"name\":\"2010 13th IEEE International Conference on Computational Science and Engineering\",\"volume\":\"101 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2010-12-11\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2010 13th IEEE International Conference on Computational Science and Engineering\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CSE.2010.60\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2010 13th IEEE International Conference on Computational Science and Engineering","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CSE.2010.60","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

摘要

当今科学和工程领域的大量应用对数据的需求使得优化和可扩展的数据放置机制成为一种基本需求。为此，我们提出了一种基于高效数据局部性管理的数据密集型工作流调度方案。转移和安置决策是基于工作流中的结构做出的，代表了不同层次的输入和输出之间的相互关系。在运行大型应用程序时，大多数输入数据不会被传送，从而使数据保持在作业附近，从而大大减少了通信和传输开销。我们已经在YML工作流系统中实现了这些技术。本文提出的结果显示，通过这些数据放置优化，许多相互依赖的多层次工作流的性能有了实质性的改善。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Persistent Locality Management of Scientific Application Workflows

The huge data requirements of large nowadays applications in science and engineering make optimised and scalable data placement mechanisms an essential need. For this purpose, we propose a scheduling scheme based on an efficient data locality management for data-intensive workflows. Transfer and placement decisions are made based on constructions in the workflow, representing inter-relationships between inputs and outputs at its different levels. When running large applications, most of the input data would not be shipped, keeping the data close to the jobs, and resulting on mush less communication and transfer overheads. We have implemented these techniques for the YML workflow system. This paper presents results showing a substantial improvement in the performance of many interdependent multi-level workflows through these data placement optimisations.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2010 13th IEEE International Conference on Computational Science and Engineering

自引率

0.00%

发文量