{"title":"Persistent Locality Management of Scientific Application Workflows","authors":"Lamine M. Aouad, Mohand Tahar Kechadi, S. Petiton","doi":"10.1109/CSE.2010.60","DOIUrl":null,"url":null,"abstract":"The huge data requirements of large nowadays applications in science and engineering make optimised and scalable data placement mechanisms an essential need. For this purpose, we propose a scheduling scheme based on an efficient data locality management for data-intensive workflows. Transfer and placement decisions are made based on constructions in the workflow, representing inter-relationships between inputs and outputs at its different levels. When running large applications, most of the input data would not be shipped, keeping the data close to the jobs, and resulting on mush less communication and transfer overheads. We have implemented these techniques for the YML workflow system. This paper presents results showing a substantial improvement in the performance of many interdependent multi-level workflows through these data placement optimisations.","PeriodicalId":342688,"journal":{"name":"2010 13th IEEE International Conference on Computational Science and Engineering","volume":"101 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2010-12-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2010 13th IEEE International Conference on Computational Science and Engineering","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CSE.2010.60","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
The huge data requirements of large nowadays applications in science and engineering make optimised and scalable data placement mechanisms an essential need. For this purpose, we propose a scheduling scheme based on an efficient data locality management for data-intensive workflows. Transfer and placement decisions are made based on constructions in the workflow, representing inter-relationships between inputs and outputs at its different levels. When running large applications, most of the input data would not be shipped, keeping the data close to the jobs, and resulting on mush less communication and transfer overheads. We have implemented these techniques for the YML workflow system. This paper presents results showing a substantial improvement in the performance of many interdependent multi-level workflows through these data placement optimisations.