The 5th Workshop on Workflows in Support of Large-Scale Science最新文献

On optimization of scientific workflows to support streaming applications in distributed network environments 分布式网络环境下支持流应用的科学工作流优化研究

The 5th Workshop on Workflows in Support of Large-Scale Science Pub Date : 2010-12-17 DOI: 10.1109/WORKS.2010.5671851

Qi-Shi Wu, Yi Gu, Xukang Lu, Mengxia Zhu, P. Brown, Wuyin Lin, Yangang Liu

引用次数: 3

A history-tracing XML-based provenance framework for workflows 工作流的基于xml的历史跟踪溯源框架

The 5th Workshop on Workflows in Support of Large-Scale Science Pub Date : 2010-12-17 DOI: 10.1109/WORKS.2010.5671873

M. Gerhards, A. Belloum, F. Berretz, V. Sander, S. Skorupa

引用次数: 5

Taming complex bioinformatics workflows with weaver, makeflow, and starch 驯服复杂的生物信息学工作流程与编织，制作流程，和淀粉

The 5th Workshop on Workflows in Support of Large-Scale Science Pub Date : 2010-12-17 DOI: 10.1109/WORKS.2010.5671858

A. Thrasher, Rory Carmichael, Peter Bui, Li Yu, D. Thain, S. Emrich

引用次数: 17

Network resource selection for data transfer processes in scientific workflows 科学工作流中数据传输过程的网络资源选择

The 5th Workshop on Workflows in Support of Large-Scale Science Pub Date : 2010-12-17 DOI: 10.1109/WORKS.2010.5671840

Zhiming Zhao, P. Grosso, R. Koning, J. van der Ham, C. de Laat

{"title":"Network resource selection for data transfer processes in scientific workflows","authors":"Zhiming Zhao, P. Grosso, R. Koning, J. van der Ham, C. de Laat","doi":"10.1109/WORKS.2010.5671840","DOIUrl":"https://doi.org/10.1109/WORKS.2010.5671840","url":null,"abstract":"Quality of the service (QoS) plays an important role in the life-cycle of scientific workflows for composing and executing applications. However, the quality of network services has so far rarely been considered in composing and executing scientific workflows. Currently, scientific applications tune the execution quality neglecting network resources, and by selecting only optimal software services and computing resources. One reason is that IP-based networks provide few possibilities for workflow systems to manage the service quality, and limit or prevent bandwidth reservation or network paths selection. We see nonetheless a strong need from scientific applications, and network operators, to include the network quality management in the workflow systems. In this paper, we discuss our ongoing research on this issue and present a semantic based solution to searching network resources with awareness of QoS requirements. The solution aims at complementing existing workflow systems on selecting network resources in the context of workflow composition, scheduling and execution when advanced network services are available. Our research is conducted in the context of the CineGrid project.","PeriodicalId":400999,"journal":{"name":"The 5th Workshop on Workflows in Support of Large-Scale Science","volume":"41 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-12-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129944875","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 4

BReW: Blackbox resource selection for e-Science workflows BReW:电子科学工作流的黑盒资源选择

The 5th Workshop on Workflows in Support of Large-Scale Science Pub Date : 2010-12-17 DOI: 10.1109/WORKS.2010.5671857

Yogesh L. Simmhan, Emad Soroush, C. Ingen, Deb Agarwal, L. Ramakrishnan

引用次数: 1

The 5th Workshop on Workflows in Support of Large-Scale Science in conjunction with SC 10 第五届支持大规模科学的工作流程研讨会(与cs10联合举办)

The 5th Workshop on Workflows in Support of Large-Scale Science Pub Date : 2010-12-17 DOI: 10.1109/WORKS.2010.5671879

E. Deelman, I. Taylor

引用次数: 0

Streaming satellite data to cloud workflows for on-demand computing of environmental data products 将卫星数据流式传输到云工作流程，用于按需计算环境数据产品

The 5th Workshop on Workflows in Support of Large-Scale Science Pub Date : 2010-12-17 DOI: 10.1109/WORKS.2010.5671841

Daniel Zinn, Q. Hart, Bertram Ludäscher, Yogesh L. Simmhan

{"title":"Streaming satellite data to cloud workflows for on-demand computing of environmental data products","authors":"Daniel Zinn, Q. Hart, Bertram Ludäscher, Yogesh L. Simmhan","doi":"10.1109/WORKS.2010.5671841","DOIUrl":"https://doi.org/10.1109/WORKS.2010.5671841","url":null,"abstract":"Environmental data arriving constantly from satellites and weather stations are used to compute weather coefficients that are essential for agriculture and viticulture. For example, the reference evapotranspiration (ET0) coefficient, overlaid on regional maps, is provided each day by the California Department of Water Resources to local farmers and turf managers to plan daily water use. Scaling out single-processor compute/data intensive applications operating on realtime data to support more users and higher-resolution data poses data engineering challenges. Cloud computing helps data providers expand resource capacity to meet growing needs besides supporting scientific needs like reprocessing historic data using new models. In this article, we examine migration of a legacy script used for daily ET0 computation by CIMIS to a workflow model that eases deployment to and scaling on the Windows Azure Cloud. Our architecture incorporates a direct streaming model into Cloud virtual machines (VMs) that improves the performance by 130% to 160% for our workflow over using Cloud storage for data staging, used commonly. The streaming workflows achieve runtimes comparable to desktop execution for single VMs and a linear speed-up when using multiple VMs, thus allowing computation of environmental coefficients at a much larger resolution than done presently.","PeriodicalId":400999,"journal":{"name":"The 5th Workshop on Workflows in Support of Large-Scale Science","volume":"9 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-12-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130689336","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 12

Linking multiple workflow provenance traces for interoperable collaborative science 为可互操作的协作科学链接多个工作流来源跟踪

The 5th Workshop on Workflows in Support of Large-Scale Science Pub Date : 2010-12-17 DOI: 10.1109/WORKS.2010.5671861

P. Missier, Bertram Ludäscher, S. Bowers, Saumen C. Dey, A. Sarkar, B. Shrestha, I. Altintas, M. Anand, C. Goble

{"title":"Linking multiple workflow provenance traces for interoperable collaborative science","authors":"P. Missier, Bertram Ludäscher, S. Bowers, Saumen C. Dey, A. Sarkar, B. Shrestha, I. Altintas, M. Anand, C. Goble","doi":"10.1109/WORKS.2010.5671861","DOIUrl":"https://doi.org/10.1109/WORKS.2010.5671861","url":null,"abstract":"Scientific collaboration increasingly involves data sharing between separate groups. We consider a scenario where data products of scientific workflows are published and then used by other researchers as inputs to their workflows. For proper interpretation, shared data must be complemented by descriptive metadata. We focus on provenance traces, a prime example of such metadata which describes the genesis and processing history of data products in terms of the computational workflow steps. Through the reuse of published data, virtual, implicitly collaborative experiments emerge, making it desirable to compose the independently generated traces into global ones that describe the combined executions as single, seamless experiments. We present a model for provenance sharing that realizes this holistic view by overcoming the various interoperability problems that emerge from the heterogeneity of workflow systems, data formats, and provenance models. At the heart lie (i) an abstract workflow and provenance model in which (ii) data sharing becomes itself part of the combined workflow. We then describe an implementation of our model that we developed in the context of the Data Observation Network for Earth (DataONE) project and that can “stitch together” traces from different Kepler and Taverna workflow runs. It provides a prototypical framework for seamless cross-system, collaborative provenance management and can be easily extended to include other systems. Our approach also opens the door to new ways of workflow interoperability not only through often elusive workflow standards but through shared provenance information from public repositories.","PeriodicalId":400999,"journal":{"name":"The 5th Workshop on Workflows in Support of Large-Scale Science","volume":"47 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-12-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124389241","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 55

Workflow-based comparison of two Distributed Computing Infrastructures 基于工作流的两种分布式计算基础架构比较

The 5th Workshop on Workflows in Support of Large-Scale Science Pub Date : 2010-12-17 DOI: 10.1109/WORKS.2010.5671856

J. Montagnat, T. Glatard, Damien Reimert, K. Maheshwari, E. Caron, F. Desprez

引用次数: 19

Using SchedFlow for performance evaluation of workflow applications 使用SchedFlow对工作流应用程序进行性能评估

The 5th Workshop on Workflows in Support of Large-Scale Science Pub Date : 2010-12-17 DOI: 10.1109/WORKS.2010.5671864

Gustavo Martínez, Gustavo Martínez, E. Heymann, Miguel Angel Senar, E. Luque, B. Miller

{"title":"Using SchedFlow for performance evaluation of workflow applications","authors":"Gustavo Martínez, Gustavo Martínez, E. Heymann, Miguel Angel Senar, E. Luque, B. Miller","doi":"10.1109/WORKS.2010.5671864","DOIUrl":"https://doi.org/10.1109/WORKS.2010.5671864","url":null,"abstract":"Computational science increasingly relies on the execution of workflows in distributed networks to solve complex applications. However, the heterogeneity of resources in these environments complicates resource management and the scheduling of such applications. Sophisticated scheduling policies are being developed for workflows, but they have had little impact in practice because their integration into existing workflow engines is complex and time consuming as each policy has to be individually ported to a particular workflow engine. In addition, choosing a particular scheduling policy is difficult, as factors like machine availability, workload, and communication volume between tasks are difficult to predict. In this paper, we describe SchedFlow, a tool that integrates scheduling policies into workflow engines such as Taverna, DAGMan or Karajan. We show how SchedFlow was used to take advantage of different scheduling policies at different times, depending on the dynamic workload of the workflows. Our experiments included two real workflow applications and four different scheduling policies. We show that no single scheduling policy is the best for all scenarios, so tools like SchedFlow can improve performance by providing flexibility when scheduling workflows.","PeriodicalId":400999,"journal":{"name":"The 5th Workshop on Workflows in Support of Large-Scale Science","volume":"2 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-12-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128854615","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1