Why workflows break — Understanding and combating decay in Taverna workflows

Jun Zhao, José Manuél Gómez-Pérez, Khalid Belhajjame, G. Klyne, Esteban García-Cuesta, Aleix Garrido, K. Hettne, M. Roos, D. D. Roure, C. Goble
{"title":"Why workflows break — Understanding and combating decay in Taverna workflows","authors":"Jun Zhao, José Manuél Gómez-Pérez, Khalid Belhajjame, G. Klyne, Esteban García-Cuesta, Aleix Garrido, K. Hettne, M. Roos, D. D. Roure, C. Goble","doi":"10.1109/ESCIENCE.2012.6404482","DOIUrl":null,"url":null,"abstract":"Workflows provide a popular means for preserving scientific methods by explicitly encoding their process. However, some of them are subject to a decay in their ability to be re-executed or reproduce the same results over time, largely due to the volatility of the resources required for workflow executions. This paper provides an analysis of the root causes of workflow decay based on an empirical study of a collection of Taverna workflows from the myExperiment repository. Although our analysis was based on a specific type of workflow, the outcomes and methodology should be applicable to workflows from other systems, at least those whose executions also rely largely on accessing third-party resources. Based on our understanding about decay we recommend a minimal set of auxiliary resources to be preserved together with the workflows as an aggregation object and provide a software tool for end-users to create such aggregations and to assess their completeness.","PeriodicalId":6364,"journal":{"name":"2012 IEEE 8th International Conference on E-Science","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2012-10-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"114","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2012 IEEE 8th International Conference on E-Science","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ESCIENCE.2012.6404482","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 114

Abstract

Workflows provide a popular means for preserving scientific methods by explicitly encoding their process. However, some of them are subject to a decay in their ability to be re-executed or reproduce the same results over time, largely due to the volatility of the resources required for workflow executions. This paper provides an analysis of the root causes of workflow decay based on an empirical study of a collection of Taverna workflows from the myExperiment repository. Although our analysis was based on a specific type of workflow, the outcomes and methodology should be applicable to workflows from other systems, at least those whose executions also rely largely on accessing third-party resources. Based on our understanding about decay we recommend a minimal set of auxiliary resources to be preserved together with the workflows as an aggregation object and provide a software tool for end-users to create such aggregations and to assess their completeness.
为什么工作流中断——理解和对抗Taverna工作流中的衰减
工作流通过显式地对其过程进行编码,为保存科学方法提供了一种流行的方法。然而,随着时间的推移,它们中的一些在重新执行或重现相同结果的能力上受到衰减的影响,这主要是由于工作流执行所需资源的不稳定性。本文基于对myExperiment存储库中的Taverna工作流集合的实证研究,对工作流衰减的根本原因进行了分析。尽管我们的分析是基于特定类型的工作流,但结果和方法应该适用于来自其他系统的工作流,至少是那些执行也主要依赖于访问第三方资源的工作流。基于我们对衰减的理解,我们建议将一组最小的辅助资源与工作流一起保存为一个聚合对象,并为最终用户提供一个软件工具来创建这样的聚合并评估它们的完整性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信