使用频繁模式的工作流组合工具推荐

Rupika Wijesinghe, Ruwan Weerasinghe
{"title":"使用频繁模式的工作流组合工具推荐","authors":"Rupika Wijesinghe, Ruwan Weerasinghe","doi":"10.1145/3569192.3569204","DOIUrl":null,"url":null,"abstract":"Workflows or pipelines provide a means for executing complex data analysis seamlessly. Composing tools into a workflow is essential in bioinformatics experiments. There are scientific workflow systems such as Taverna and Galaxy that facilitate automatic workflow composition. However, designing workflows using workflow systems becomes more complex with the availability of vast numbers of complex, heterogeneous tools. Connecting such heterogeneous tools to a workflow is error-prone and time-consuming. The objective of the study is to develop a suggestive system for interactive workflow composition using frequent patterns in workflows. The approach basically consists of three main phases: pattern mining, component suggestion, and updating the workflow. Frequent patterns of workflows are identified using frequent subgraph mining techniques and N-gram modeling. The suggested components allow reusing best practice workflows while reducing the time required in composing the workflows. Frequent usage patterns identified can also be used in searching similar workflows in workflow repositories. An interactive workflow composition approach is useful for novice as well as experienced scientists in composing workflows with state-of-the-art tools. The approach enhances the reuse of best practice workflows developed by other users. Such systems would succeed more in future with the availability of more and more workflows in the light of open science","PeriodicalId":249004,"journal":{"name":"Proceedings of the 9th International Conference on Bioinformatics Research and Applications","volume":"71 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-09-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Tool recommendation for workflow composition using frequent patterns\",\"authors\":\"Rupika Wijesinghe, Ruwan Weerasinghe\",\"doi\":\"10.1145/3569192.3569204\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Workflows or pipelines provide a means for executing complex data analysis seamlessly. Composing tools into a workflow is essential in bioinformatics experiments. There are scientific workflow systems such as Taverna and Galaxy that facilitate automatic workflow composition. However, designing workflows using workflow systems becomes more complex with the availability of vast numbers of complex, heterogeneous tools. Connecting such heterogeneous tools to a workflow is error-prone and time-consuming. The objective of the study is to develop a suggestive system for interactive workflow composition using frequent patterns in workflows. The approach basically consists of three main phases: pattern mining, component suggestion, and updating the workflow. Frequent patterns of workflows are identified using frequent subgraph mining techniques and N-gram modeling. The suggested components allow reusing best practice workflows while reducing the time required in composing the workflows. Frequent usage patterns identified can also be used in searching similar workflows in workflow repositories. An interactive workflow composition approach is useful for novice as well as experienced scientists in composing workflows with state-of-the-art tools. The approach enhances the reuse of best practice workflows developed by other users. Such systems would succeed more in future with the availability of more and more workflows in the light of open science\",\"PeriodicalId\":249004,\"journal\":{\"name\":\"Proceedings of the 9th International Conference on Bioinformatics Research and Applications\",\"volume\":\"71 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-09-18\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 9th International Conference on Bioinformatics Research and Applications\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3569192.3569204\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 9th International Conference on Bioinformatics Research and Applications","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3569192.3569204","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

工作流或管道提供了一种无缝执行复杂数据分析的方法。在生物信息学实验中,将工具组合成工作流是必不可少的。有科学的工作流系统,如Taverna和Galaxy,可以促进自动工作流组合。然而,随着大量复杂、异构工具的出现,使用工作流系统设计工作流变得更加复杂。将这种异构工具连接到工作流是容易出错且耗时的。本研究的目的是开发一个使用工作流中频繁模式的交互式工作流组合的提示系统。该方法基本上由三个主要阶段组成:模式挖掘、组件建议和更新工作流。使用频繁子图挖掘技术和n图建模来识别工作流的频繁模式。建议的组件允许重用最佳实践工作流,同时减少组合工作流所需的时间。识别出的频繁使用模式也可以用于在工作流存储库中搜索类似的工作流。交互式工作流组合方法对新手和有经验的科学家使用最先进的工具组合工作流都很有用。该方法增强了对其他用户开发的最佳实践工作流的重用。在开放科学的背景下,随着越来越多的工作流程的可用性,这样的系统将在未来取得更大的成功
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Tool recommendation for workflow composition using frequent patterns
Workflows or pipelines provide a means for executing complex data analysis seamlessly. Composing tools into a workflow is essential in bioinformatics experiments. There are scientific workflow systems such as Taverna and Galaxy that facilitate automatic workflow composition. However, designing workflows using workflow systems becomes more complex with the availability of vast numbers of complex, heterogeneous tools. Connecting such heterogeneous tools to a workflow is error-prone and time-consuming. The objective of the study is to develop a suggestive system for interactive workflow composition using frequent patterns in workflows. The approach basically consists of three main phases: pattern mining, component suggestion, and updating the workflow. Frequent patterns of workflows are identified using frequent subgraph mining techniques and N-gram modeling. The suggested components allow reusing best practice workflows while reducing the time required in composing the workflows. Frequent usage patterns identified can also be used in searching similar workflows in workflow repositories. An interactive workflow composition approach is useful for novice as well as experienced scientists in composing workflows with state-of-the-art tools. The approach enhances the reuse of best practice workflows developed by other users. Such systems would succeed more in future with the availability of more and more workflows in the light of open science
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信