A Web Service-Enabled Distributed Workflow System for Scientific Data Processing

Rajesh Kalyanam, Lan Zhao, Taezoon Park, S. Goasguen
{"title":"A Web Service-Enabled Distributed Workflow System for Scientific Data Processing","authors":"Rajesh Kalyanam, Lan Zhao, Taezoon Park, S. Goasguen","doi":"10.1109/FTDCS.2007.9","DOIUrl":null,"url":null,"abstract":"This paper presents the design and implementation of a distributed data-driven workflow system on top of the TeraGrid infrastructure. The workflow system is based on a data management architecture that provides easy access to scientific data collections via the TeraGrid network. The workflow system allows researchers to construct scientific workflows for data discovery, access, transformation, and analysis. The system leverages JOpera, an open-source workflow engine and visual composer, as well as a set of Web service-based data and computation modules. To demonstrate its effectiveness, we create an end-to-end climate simulation data analysis workflow that connects the data management architecture to TeraGrid computation resources. We also develop a workflow monitoring service to keep track of distributed workflow execution","PeriodicalId":199987,"journal":{"name":"11th IEEE International Workshop on Future Trends of Distributed Computing Systems (FTDCS'07)","volume":"13 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2007-03-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"7","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"11th IEEE International Workshop on Future Trends of Distributed Computing Systems (FTDCS'07)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/FTDCS.2007.9","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 7

Abstract

This paper presents the design and implementation of a distributed data-driven workflow system on top of the TeraGrid infrastructure. The workflow system is based on a data management architecture that provides easy access to scientific data collections via the TeraGrid network. The workflow system allows researchers to construct scientific workflows for data discovery, access, transformation, and analysis. The system leverages JOpera, an open-source workflow engine and visual composer, as well as a set of Web service-based data and computation modules. To demonstrate its effectiveness, we create an end-to-end climate simulation data analysis workflow that connects the data management architecture to TeraGrid computation resources. We also develop a workflow monitoring service to keep track of distributed workflow execution
面向科学数据处理的Web服务分布式工作流系统
本文提出了一个基于TeraGrid基础架构的分布式数据驱动工作流系统的设计与实现。工作流系统基于数据管理架构,该架构通过TeraGrid网络方便地访问科学数据集合。工作流系统允许研究人员为数据发现、访问、转换和分析构建科学的工作流。该系统利用了JOpera,一个开源的工作流引擎和可视化编写器,以及一组基于Web服务的数据和计算模块。为了证明其有效性,我们创建了一个端到端的气候模拟数据分析工作流,将数据管理架构连接到TeraGrid计算资源。我们还开发了一个工作流监控服务来跟踪分布式工作流的执行
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信