Detecting common scientific workflow fragments using templates and execution provenance

D. Garijo, Óscar Corcho, Y. Gil
{"title":"Detecting common scientific workflow fragments using templates and execution provenance","authors":"D. Garijo, Óscar Corcho, Y. Gil","doi":"10.1145/2479832.2479848","DOIUrl":null,"url":null,"abstract":"Provenance plays a major role when understanding and reusing the methods applied in a scientific experiment, as it provides a record of inputs, the processes carried out and the use and generation of intermediate and final results. In the specific case of in-silico scientific experiments, a large variety of scientific workflow systems (e.g., Wings, Taverna, Galaxy, Vistrails) have been created to support scientists. All of these systems produce some sort of provenance about the executions of the workflows that encode scientific experiments. However, provenance is normally recorded at a very low level of detail, which complicates the understanding of what happened during execution. In this paper we propose an approach to automatically obtain abstractions from low-level provenance data by finding common workflow fragments on workflow execution provenance and relating them to templates. We have tested our approach with a dataset of workflows published by the Wings workflow system. Our results show that by using these kinds of abstractions we can highlight the most common abstract methods used in the executions of a repository, relating different runs and workflow templates with each other.","PeriodicalId":388497,"journal":{"name":"Proceedings of the seventh international conference on Knowledge capture","volume":"63 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2013-06-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"31","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the seventh international conference on Knowledge capture","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/2479832.2479848","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 31

Abstract

Provenance plays a major role when understanding and reusing the methods applied in a scientific experiment, as it provides a record of inputs, the processes carried out and the use and generation of intermediate and final results. In the specific case of in-silico scientific experiments, a large variety of scientific workflow systems (e.g., Wings, Taverna, Galaxy, Vistrails) have been created to support scientists. All of these systems produce some sort of provenance about the executions of the workflows that encode scientific experiments. However, provenance is normally recorded at a very low level of detail, which complicates the understanding of what happened during execution. In this paper we propose an approach to automatically obtain abstractions from low-level provenance data by finding common workflow fragments on workflow execution provenance and relating them to templates. We have tested our approach with a dataset of workflows published by the Wings workflow system. Our results show that by using these kinds of abstractions we can highlight the most common abstract methods used in the executions of a repository, relating different runs and workflow templates with each other.
使用模板和执行来源检测常见的科学工作流片段
在理解和重用科学实验中应用的方法时,来源起着重要作用,因为它提供了输入、进行的过程以及中间和最终结果的使用和生成的记录。在硅科学实验的具体情况下,已经创建了各种各样的科学工作流程系统(例如,Wings, Taverna, Galaxy, Vistrails)来支持科学家。所有这些系统都产生了某种关于编码科学实验的工作流执行的来源。然而,来源通常记录在非常低的细节水平上,这使得对执行期间发生的事情的理解变得复杂。本文提出了一种从低级来源数据中自动获取抽象的方法,该方法通过查找工作流执行来源上的常见工作流片段并将其与模板关联来实现。我们已经用Wings工作流系统发布的工作流数据集测试了我们的方法。我们的结果表明,通过使用这些类型的抽象,我们可以突出显示存储库执行中使用的最常见的抽象方法,将不同的运行和工作流模板彼此关联起来。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信