{"title":"基于出处的科学工作流搜索","authors":"A. A. Jabal, E. Bertino, Geeth de Mel","doi":"10.1109/eScience.2017.24","DOIUrl":null,"url":null,"abstract":"Due to data intensive and sophisticated tasks in scientific experiments, workflows have been widely used to enable repetitive task automation and data reproducibility. This yields to the need for effective and efficient search mechanisms for scientific workflows discovery as workflow retrieval systems require a model which fulfills several requirements: unification, accuracy, and rich representations. Motivated by the recent uptake in provenance based models for scientific workflow discovery, in this paper, we propose a provenance-based architecture for retrieving workflows. Specifically, the paper presents an architecture which transforms data provenance into workflows and then organizes data into a set of indexes to support efficient querying mechanisms. The architecture enables composite queries supporting three types of search criteria: keywords of workflow tasks, workflow structure patterns, and metadata about workflows–e.g., how often a workflow was used.","PeriodicalId":137652,"journal":{"name":"2017 IEEE 13th International Conference on e-Science (e-Science)","volume":"688 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Provenance-Based Scientific Workflow Search\",\"authors\":\"A. A. Jabal, E. Bertino, Geeth de Mel\",\"doi\":\"10.1109/eScience.2017.24\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Due to data intensive and sophisticated tasks in scientific experiments, workflows have been widely used to enable repetitive task automation and data reproducibility. This yields to the need for effective and efficient search mechanisms for scientific workflows discovery as workflow retrieval systems require a model which fulfills several requirements: unification, accuracy, and rich representations. Motivated by the recent uptake in provenance based models for scientific workflow discovery, in this paper, we propose a provenance-based architecture for retrieving workflows. Specifically, the paper presents an architecture which transforms data provenance into workflows and then organizes data into a set of indexes to support efficient querying mechanisms. The architecture enables composite queries supporting three types of search criteria: keywords of workflow tasks, workflow structure patterns, and metadata about workflows–e.g., how often a workflow was used.\",\"PeriodicalId\":137652,\"journal\":{\"name\":\"2017 IEEE 13th International Conference on e-Science (e-Science)\",\"volume\":\"688 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2017-10-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2017 IEEE 13th International Conference on e-Science (e-Science)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/eScience.2017.24\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 IEEE 13th International Conference on e-Science (e-Science)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/eScience.2017.24","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Due to data intensive and sophisticated tasks in scientific experiments, workflows have been widely used to enable repetitive task automation and data reproducibility. This yields to the need for effective and efficient search mechanisms for scientific workflows discovery as workflow retrieval systems require a model which fulfills several requirements: unification, accuracy, and rich representations. Motivated by the recent uptake in provenance based models for scientific workflow discovery, in this paper, we propose a provenance-based architecture for retrieving workflows. Specifically, the paper presents an architecture which transforms data provenance into workflows and then organizes data into a set of indexes to support efficient querying mechanisms. The architecture enables composite queries supporting three types of search criteria: keywords of workflow tasks, workflow structure patterns, and metadata about workflows–e.g., how often a workflow was used.