{"title":"在Hadoop系统中实现优先调度服务","authors":"Tsozen Yeh, Hsinyi Huang","doi":"10.1109/FiCloud.2018.00015","DOIUrl":null,"url":null,"abstract":"Cloud computing has been widely used in many areas nowadays. It is common that large cloud systems could simultaneously service tens of thousands of users and host an excessive number of jobs running at the same time. Under such circumstances, the completion of urgent or time-critical tasks can be significantly delayed if the underlying cloud system does not offer schemes to speed up the execution of those tasks. Among the platforms adopted in cloud computing, Hadoop is one of the most widely used in the community of cloud computing. Unfortunately, Hadoop does not provide users efficient ways to expedite the course of execution for high-priority jobs which users would hope for their fast completion. We designed and implemented a new scheduling scheme enabling Hadoop to support fully prioritized scheduling. With our scheduling scheme, users can dynamically assign high priority to individual jobs so their execution could be accelerated accordingly. We evaluated our design and implementation by executing the same programs with ordinary priority versus high priority in Hadoop environments under different configurations. Experimental results show that programs can shorten their execution time by up to 82.70% if they are executed with high priority.","PeriodicalId":174838,"journal":{"name":"2018 IEEE 6th International Conference on Future Internet of Things and Cloud (FiCloud)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":"{\"title\":\"Realizing Prioritized Scheduling Service in the Hadoop System\",\"authors\":\"Tsozen Yeh, Hsinyi Huang\",\"doi\":\"10.1109/FiCloud.2018.00015\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Cloud computing has been widely used in many areas nowadays. It is common that large cloud systems could simultaneously service tens of thousands of users and host an excessive number of jobs running at the same time. Under such circumstances, the completion of urgent or time-critical tasks can be significantly delayed if the underlying cloud system does not offer schemes to speed up the execution of those tasks. Among the platforms adopted in cloud computing, Hadoop is one of the most widely used in the community of cloud computing. Unfortunately, Hadoop does not provide users efficient ways to expedite the course of execution for high-priority jobs which users would hope for their fast completion. We designed and implemented a new scheduling scheme enabling Hadoop to support fully prioritized scheduling. With our scheduling scheme, users can dynamically assign high priority to individual jobs so their execution could be accelerated accordingly. We evaluated our design and implementation by executing the same programs with ordinary priority versus high priority in Hadoop environments under different configurations. Experimental results show that programs can shorten their execution time by up to 82.70% if they are executed with high priority.\",\"PeriodicalId\":174838,\"journal\":{\"name\":\"2018 IEEE 6th International Conference on Future Internet of Things and Cloud (FiCloud)\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2018-08-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"5\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2018 IEEE 6th International Conference on Future Internet of Things and Cloud (FiCloud)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/FiCloud.2018.00015\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 IEEE 6th International Conference on Future Internet of Things and Cloud (FiCloud)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/FiCloud.2018.00015","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Realizing Prioritized Scheduling Service in the Hadoop System
Cloud computing has been widely used in many areas nowadays. It is common that large cloud systems could simultaneously service tens of thousands of users and host an excessive number of jobs running at the same time. Under such circumstances, the completion of urgent or time-critical tasks can be significantly delayed if the underlying cloud system does not offer schemes to speed up the execution of those tasks. Among the platforms adopted in cloud computing, Hadoop is one of the most widely used in the community of cloud computing. Unfortunately, Hadoop does not provide users efficient ways to expedite the course of execution for high-priority jobs which users would hope for their fast completion. We designed and implemented a new scheduling scheme enabling Hadoop to support fully prioritized scheduling. With our scheduling scheme, users can dynamically assign high priority to individual jobs so their execution could be accelerated accordingly. We evaluated our design and implementation by executing the same programs with ordinary priority versus high priority in Hadoop environments under different configurations. Experimental results show that programs can shorten their execution time by up to 82.70% if they are executed with high priority.