{"title":"大规模分布式计算环境下任务调度软状态的鲁棒性研究","authors":"H. Tada, M. Imase, M. Murata","doi":"10.1109/IMCSIT.2008.4747285","DOIUrl":null,"url":null,"abstract":"In this paper, we consider task scheduling in distributed computing. In distributed computing, it is possible that tasks fail, and it is difficult to get accurate information about hosts and tasks. WQR (workqueue with replication), which was proposed by Cirne et al., is a good algorithm because it achieves a short job-completion time without requiring any information about hosts and tasks. However, in order to use WQR for distributed computing, we need to resolve some issues on task failure detection and task cancellation. For this purpose, we examine two approaches-the conventional task timeout method and the soft state method. Simulation results showed that the soft state method is more robust than the task timeout method.","PeriodicalId":267715,"journal":{"name":"2008 International Multiconference on Computer Science and Information Technology","volume":"23 1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2008-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":"{\"title\":\"On the robustness of the soft state for task scheduling in large-scale distributed computing environment\",\"authors\":\"H. Tada, M. Imase, M. Murata\",\"doi\":\"10.1109/IMCSIT.2008.4747285\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In this paper, we consider task scheduling in distributed computing. In distributed computing, it is possible that tasks fail, and it is difficult to get accurate information about hosts and tasks. WQR (workqueue with replication), which was proposed by Cirne et al., is a good algorithm because it achieves a short job-completion time without requiring any information about hosts and tasks. However, in order to use WQR for distributed computing, we need to resolve some issues on task failure detection and task cancellation. For this purpose, we examine two approaches-the conventional task timeout method and the soft state method. Simulation results showed that the soft state method is more robust than the task timeout method.\",\"PeriodicalId\":267715,\"journal\":{\"name\":\"2008 International Multiconference on Computer Science and Information Technology\",\"volume\":\"23 1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2008-10-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"4\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2008 International Multiconference on Computer Science and Information Technology\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/IMCSIT.2008.4747285\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2008 International Multiconference on Computer Science and Information Technology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IMCSIT.2008.4747285","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 4
摘要
本文主要研究分布式计算中的任务调度问题。在分布式计算中,任务可能会失败,并且很难获得关于主机和任务的准确信息。Cirne等人提出的WQR (workqueue with replication)算法是一种很好的算法,它在不需要任何主机和任务信息的情况下实现了较短的作业完成时间。然而,为了将WQR用于分布式计算,我们需要解决任务失败检测和任务取消的一些问题。为此,我们研究了两种方法——传统的任务超时方法和软状态方法。仿真结果表明,软状态法比任务超时法具有更强的鲁棒性。
On the robustness of the soft state for task scheduling in large-scale distributed computing environment
In this paper, we consider task scheduling in distributed computing. In distributed computing, it is possible that tasks fail, and it is difficult to get accurate information about hosts and tasks. WQR (workqueue with replication), which was proposed by Cirne et al., is a good algorithm because it achieves a short job-completion time without requiring any information about hosts and tasks. However, in order to use WQR for distributed computing, we need to resolve some issues on task failure detection and task cancellation. For this purpose, we examine two approaches-the conventional task timeout method and the soft state method. Simulation results showed that the soft state method is more robust than the task timeout method.