在Hadoop系统中实现优先调度服务

2018 IEEE 6th International Conference on Future Internet of Things and Cloud (FiCloud) Pub Date : 2018-08-01 DOI:10.1109/FiCloud.2018.00015

Tsozen Yeh, Hsinyi Huang

{"title":"在Hadoop系统中实现优先调度服务","authors":"Tsozen Yeh, Hsinyi Huang","doi":"10.1109/FiCloud.2018.00015","DOIUrl":null,"url":null,"abstract":"Cloud computing has been widely used in many areas nowadays. It is common that large cloud systems could simultaneously service tens of thousands of users and host an excessive number of jobs running at the same time. Under such circumstances, the completion of urgent or time-critical tasks can be significantly delayed if the underlying cloud system does not offer schemes to speed up the execution of those tasks. Among the platforms adopted in cloud computing, Hadoop is one of the most widely used in the community of cloud computing. Unfortunately, Hadoop does not provide users efficient ways to expedite the course of execution for high-priority jobs which users would hope for their fast completion. We designed and implemented a new scheduling scheme enabling Hadoop to support fully prioritized scheduling. With our scheduling scheme, users can dynamically assign high priority to individual jobs so their execution could be accelerated accordingly. We evaluated our design and implementation by executing the same programs with ordinary priority versus high priority in Hadoop environments under different configurations. Experimental results show that programs can shorten their execution time by up to 82.70% if they are executed with high priority.","PeriodicalId":174838,"journal":{"name":"2018 IEEE 6th International Conference on Future Internet of Things and Cloud (FiCloud)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":"{\"title\":\"Realizing Prioritized Scheduling Service in the Hadoop System\",\"authors\":\"Tsozen Yeh, Hsinyi Huang\",\"doi\":\"10.1109/FiCloud.2018.00015\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Cloud computing has been widely used in many areas nowadays. It is common that large cloud systems could simultaneously service tens of thousands of users and host an excessive number of jobs running at the same time. Under such circumstances, the completion of urgent or time-critical tasks can be significantly delayed if the underlying cloud system does not offer schemes to speed up the execution of those tasks. Among the platforms adopted in cloud computing, Hadoop is one of the most widely used in the community of cloud computing. Unfortunately, Hadoop does not provide users efficient ways to expedite the course of execution for high-priority jobs which users would hope for their fast completion. We designed and implemented a new scheduling scheme enabling Hadoop to support fully prioritized scheduling. With our scheduling scheme, users can dynamically assign high priority to individual jobs so their execution could be accelerated accordingly. We evaluated our design and implementation by executing the same programs with ordinary priority versus high priority in Hadoop environments under different configurations. Experimental results show that programs can shorten their execution time by up to 82.70% if they are executed with high priority.\",\"PeriodicalId\":174838,\"journal\":{\"name\":\"2018 IEEE 6th International Conference on Future Internet of Things and Cloud (FiCloud)\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2018-08-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"5\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2018 IEEE 6th International Conference on Future Internet of Things and Cloud (FiCloud)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/FiCloud.2018.00015\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 IEEE 6th International Conference on Future Internet of Things and Cloud (FiCloud)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/FiCloud.2018.00015","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 5

摘要

如今，云计算在许多领域得到了广泛的应用。大型云系统可以同时为成千上万的用户提供服务，并同时托管大量运行的作业，这是很常见的。在这种情况下，如果底层云系统不提供加速执行这些任务的方案，则紧急或时间紧迫任务的完成可能会严重延迟。在云计算采用的平台中，Hadoop是云计算社区中使用最广泛的平台之一。不幸的是，Hadoop并没有为用户提供有效的方法来加快高优先级任务的执行过程，而用户希望这些任务能够快速完成。我们设计并实现了一个新的调度方案，使Hadoop能够支持完全优先级调度。使用我们的调度方案，用户可以动态地为单个作业分配高优先级，从而相应地加快它们的执行速度。我们通过在不同配置的Hadoop环境中以普通优先级和高优先级执行相同的程序来评估我们的设计和实现。实验结果表明，高优先级执行的程序最多可缩短82.70%的执行时间。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Realizing Prioritized Scheduling Service in the Hadoop System

Cloud computing has been widely used in many areas nowadays. It is common that large cloud systems could simultaneously service tens of thousands of users and host an excessive number of jobs running at the same time. Under such circumstances, the completion of urgent or time-critical tasks can be significantly delayed if the underlying cloud system does not offer schemes to speed up the execution of those tasks. Among the platforms adopted in cloud computing, Hadoop is one of the most widely used in the community of cloud computing. Unfortunately, Hadoop does not provide users efficient ways to expedite the course of execution for high-priority jobs which users would hope for their fast completion. We designed and implemented a new scheduling scheme enabling Hadoop to support fully prioritized scheduling. With our scheduling scheme, users can dynamically assign high priority to individual jobs so their execution could be accelerated accordingly. We evaluated our design and implementation by executing the same programs with ordinary priority versus high priority in Hadoop environments under different configurations. Experimental results show that programs can shorten their execution time by up to 82.70% if they are executed with high priority.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2018 IEEE 6th International Conference on Future Internet of Things and Cloud (FiCloud)

自引率

0.00%

发文量