使用SchedFlow对工作流应用程序进行性能评估

The 5th Workshop on Workflows in Support of Large-Scale Science Pub Date : 2010-12-17 DOI:10.1109/WORKS.2010.5671864

Gustavo Martínez, Gustavo Martínez, E. Heymann, Miguel Angel Senar, E. Luque, B. Miller

{"title":"使用SchedFlow对工作流应用程序进行性能评估","authors":"Gustavo Martínez, Gustavo Martínez, E. Heymann, Miguel Angel Senar, E. Luque, B. Miller","doi":"10.1109/WORKS.2010.5671864","DOIUrl":null,"url":null,"abstract":"Computational science increasingly relies on the execution of workflows in distributed networks to solve complex applications. However, the heterogeneity of resources in these environments complicates resource management and the scheduling of such applications. Sophisticated scheduling policies are being developed for workflows, but they have had little impact in practice because their integration into existing workflow engines is complex and time consuming as each policy has to be individually ported to a particular workflow engine. In addition, choosing a particular scheduling policy is difficult, as factors like machine availability, workload, and communication volume between tasks are difficult to predict. In this paper, we describe SchedFlow, a tool that integrates scheduling policies into workflow engines such as Taverna, DAGMan or Karajan. We show how SchedFlow was used to take advantage of different scheduling policies at different times, depending on the dynamic workload of the workflows. Our experiments included two real workflow applications and four different scheduling policies. We show that no single scheduling policy is the best for all scenarios, so tools like SchedFlow can improve performance by providing flexibility when scheduling workflows.","PeriodicalId":400999,"journal":{"name":"The 5th Workshop on Workflows in Support of Large-Scale Science","volume":"2 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2010-12-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Using SchedFlow for performance evaluation of workflow applications\",\"authors\":\"Gustavo Martínez, Gustavo Martínez, E. Heymann, Miguel Angel Senar, E. Luque, B. Miller\",\"doi\":\"10.1109/WORKS.2010.5671864\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Computational science increasingly relies on the execution of workflows in distributed networks to solve complex applications. However, the heterogeneity of resources in these environments complicates resource management and the scheduling of such applications. Sophisticated scheduling policies are being developed for workflows, but they have had little impact in practice because their integration into existing workflow engines is complex and time consuming as each policy has to be individually ported to a particular workflow engine. In addition, choosing a particular scheduling policy is difficult, as factors like machine availability, workload, and communication volume between tasks are difficult to predict. In this paper, we describe SchedFlow, a tool that integrates scheduling policies into workflow engines such as Taverna, DAGMan or Karajan. We show how SchedFlow was used to take advantage of different scheduling policies at different times, depending on the dynamic workload of the workflows. Our experiments included two real workflow applications and four different scheduling policies. We show that no single scheduling policy is the best for all scenarios, so tools like SchedFlow can improve performance by providing flexibility when scheduling workflows.\",\"PeriodicalId\":400999,\"journal\":{\"name\":\"The 5th Workshop on Workflows in Support of Large-Scale Science\",\"volume\":\"2 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2010-12-17\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"The 5th Workshop on Workflows in Support of Large-Scale Science\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/WORKS.2010.5671864\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"The 5th Workshop on Workflows in Support of Large-Scale Science","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/WORKS.2010.5671864","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

摘要

计算科学越来越依赖于在分布式网络中执行工作流来解决复杂的应用。然而，这些环境中资源的异构性使资源管理和此类应用程序的调度变得复杂。人们正在为工作流开发复杂的调度策略，但它们在实践中几乎没有什么影响，因为它们与现有工作流引擎的集成非常复杂且耗时，因为每个策略都必须单独移植到特定的工作流引擎中。此外，选择特定的调度策略也很困难，因为诸如机器可用性、工作负载和任务之间的通信量等因素很难预测。在本文中，我们描述了SchedFlow，一个将调度策略集成到工作流引擎(如Taverna、DAGMan或Karajan)中的工具。我们将展示如何使用SchedFlow在不同时间利用不同的调度策略，这取决于工作流的动态工作负载。我们的实验包括两个真实的工作流应用程序和四种不同的调度策略。我们表明，没有一种调度策略适合所有场景，因此SchedFlow等工具可以通过在调度工作流时提供灵活性来提高性能。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Using SchedFlow for performance evaluation of workflow applications

Computational science increasingly relies on the execution of workflows in distributed networks to solve complex applications. However, the heterogeneity of resources in these environments complicates resource management and the scheduling of such applications. Sophisticated scheduling policies are being developed for workflows, but they have had little impact in practice because their integration into existing workflow engines is complex and time consuming as each policy has to be individually ported to a particular workflow engine. In addition, choosing a particular scheduling policy is difficult, as factors like machine availability, workload, and communication volume between tasks are difficult to predict. In this paper, we describe SchedFlow, a tool that integrates scheduling policies into workflow engines such as Taverna, DAGMan or Karajan. We show how SchedFlow was used to take advantage of different scheduling policies at different times, depending on the dynamic workload of the workflows. Our experiments included two real workflow applications and four different scheduling policies. We show that no single scheduling policy is the best for all scenarios, so tools like SchedFlow can improve performance by providing flexibility when scheduling workflows.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

The 5th Workshop on Workflows in Support of Large-Scale Science

自引率

0.00%

发文量