分布式环境下科学工作流高效执行的运行时系统。

IF 0.9 4区 计算机科学 Q3 COMPUTER SCIENCE, THEORY & METHODS
George Teodoro, Tulio Tavares, Renato Ferreira, Tahsin Kurc, Wagner Meira, Dorgival Guedes, Tony Pan, Joel Saltz
{"title":"分布式环境下科学工作流高效执行的运行时系统。","authors":"George Teodoro,&nbsp;Tulio Tavares,&nbsp;Renato Ferreira,&nbsp;Tahsin Kurc,&nbsp;Wagner Meira,&nbsp;Dorgival Guedes,&nbsp;Tony Pan,&nbsp;Joel Saltz","doi":"10.1007/s10766-007-0068-8","DOIUrl":null,"url":null,"abstract":"<p><p>Scientific workflow systems have been introduced in response to the demand of researchers from several domains of science who need to process and analyze increasingly larger datasets. The design of these systems is largely based on the observation that data analysis applications can be composed as pipelines or networks of computations on data. In this work, we present a runtime support system that is designed to facilitate this type of computation in distributed computing environments. Our system is optimized for data-intensive workflows, in which efficient management and retrieval of data, coordination of data processing and data movement, and check-pointing of intermediate results are critical and challenging issues. Experimental evaluation of our system shows that linear speedups can be achieved for sophisticated applications, which are implemented as a network of multiple data processing components.</p>","PeriodicalId":14313,"journal":{"name":"International Journal of Parallel Programming","volume":"36 2","pages":"250-266"},"PeriodicalIF":0.9000,"publicationDate":"2008-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1007/s10766-007-0068-8","citationCount":"5","resultStr":"{\"title\":\"A Run-time System for Efficient Execution of Scientific Workflows on Distributed Environments.\",\"authors\":\"George Teodoro,&nbsp;Tulio Tavares,&nbsp;Renato Ferreira,&nbsp;Tahsin Kurc,&nbsp;Wagner Meira,&nbsp;Dorgival Guedes,&nbsp;Tony Pan,&nbsp;Joel Saltz\",\"doi\":\"10.1007/s10766-007-0068-8\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>Scientific workflow systems have been introduced in response to the demand of researchers from several domains of science who need to process and analyze increasingly larger datasets. The design of these systems is largely based on the observation that data analysis applications can be composed as pipelines or networks of computations on data. In this work, we present a runtime support system that is designed to facilitate this type of computation in distributed computing environments. Our system is optimized for data-intensive workflows, in which efficient management and retrieval of data, coordination of data processing and data movement, and check-pointing of intermediate results are critical and challenging issues. Experimental evaluation of our system shows that linear speedups can be achieved for sophisticated applications, which are implemented as a network of multiple data processing components.</p>\",\"PeriodicalId\":14313,\"journal\":{\"name\":\"International Journal of Parallel Programming\",\"volume\":\"36 2\",\"pages\":\"250-266\"},\"PeriodicalIF\":0.9000,\"publicationDate\":\"2008-04-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://sci-hub-pdf.com/10.1007/s10766-007-0068-8\",\"citationCount\":\"5\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"International Journal of Parallel Programming\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://doi.org/10.1007/s10766-007-0068-8\",\"RegionNum\":4,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"COMPUTER SCIENCE, THEORY & METHODS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Parallel Programming","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.1007/s10766-007-0068-8","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"COMPUTER SCIENCE, THEORY & METHODS","Score":null,"Total":0}
引用次数: 5

摘要

科学工作流系统的引入是为了响应来自多个科学领域的研究人员的需求,他们需要处理和分析越来越大的数据集。这些系统的设计很大程度上是基于这样的观察:数据分析应用程序可以组成数据计算的管道或网络。在这项工作中,我们提出了一个运行时支持系统,旨在促进分布式计算环境中的这种类型的计算。我们的系统针对数据密集型工作流程进行了优化,其中数据的有效管理和检索,数据处理和数据移动的协调以及中间结果的检查点是关键和具有挑战性的问题。我们的系统的实验评估表明,线性加速可以实现复杂的应用,这是一个由多个数据处理组件组成的网络。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
A Run-time System for Efficient Execution of Scientific Workflows on Distributed Environments.

Scientific workflow systems have been introduced in response to the demand of researchers from several domains of science who need to process and analyze increasingly larger datasets. The design of these systems is largely based on the observation that data analysis applications can be composed as pipelines or networks of computations on data. In this work, we present a runtime support system that is designed to facilitate this type of computation in distributed computing environments. Our system is optimized for data-intensive workflows, in which efficient management and retrieval of data, coordination of data processing and data movement, and check-pointing of intermediate results are critical and challenging issues. Experimental evaluation of our system shows that linear speedups can be achieved for sophisticated applications, which are implemented as a network of multiple data processing components.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
International Journal of Parallel Programming
International Journal of Parallel Programming 工程技术-计算机:理论方法
CiteScore
4.40
自引率
0.00%
发文量
15
审稿时长
>12 weeks
期刊介绍: International Journal of Parallel Programming is a forum for the publication of peer-reviewed, high-quality original papers in the computer and information sciences, focusing specifically on programming aspects of parallel computing systems. Such systems are characterized by the coexistence over time of multiple coordinated activities. The journal publishes both original research and survey papers. Fields of interest include: linguistic foundations, conceptual frameworks, high-level languages, evaluation methods, implementation techniques, programming support systems, pragmatic considerations, architectural characteristics, software engineering aspects, advances in parallel algorithms, performance studies, and application studies.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信