{"title":"MapReduce计算模型中基于节点工作负载的动态插槽任务调度","authors":"Hsin-Yu Shih, Jhih-Jia Huang, Jenq-Shiou Leu","doi":"10.1109/ICASID.2012.6325318","DOIUrl":null,"url":null,"abstract":"MapReduce is becoming a leading large-scale data processing model providing a logical framework for cloud computing. Hadoop, an open-source implementation of MapReduce framework, is widely used for realize such kind of parallel computing model. Nodes in the current Hadoop environment are normally homogeneous. Efficient resource management in clouds is crucial for improving the performance of MapReduce applications and the utilization of resources. However, the original scheduling scheme in Hadoop assign tasks to each node based on the fixed and static number of slots, without considering the physical workload on each node, such as the CPU utilization. This paper aims at proposing a dynamic slot-based task scheduling scheme by considering the physical workload on each node so as to prevent resource underutilization. The evaluation results show the proposed scheme can raise the overall computation efficiency among the heterogeneous nodes in cloud.","PeriodicalId":408223,"journal":{"name":"Anti-counterfeiting, Security, and Identification","volume":"36 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2012-10-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":"{\"title\":\"Dynamic slot-based task scheduling based on node workload in a MapReduce computation model\",\"authors\":\"Hsin-Yu Shih, Jhih-Jia Huang, Jenq-Shiou Leu\",\"doi\":\"10.1109/ICASID.2012.6325318\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"MapReduce is becoming a leading large-scale data processing model providing a logical framework for cloud computing. Hadoop, an open-source implementation of MapReduce framework, is widely used for realize such kind of parallel computing model. Nodes in the current Hadoop environment are normally homogeneous. Efficient resource management in clouds is crucial for improving the performance of MapReduce applications and the utilization of resources. However, the original scheduling scheme in Hadoop assign tasks to each node based on the fixed and static number of slots, without considering the physical workload on each node, such as the CPU utilization. This paper aims at proposing a dynamic slot-based task scheduling scheme by considering the physical workload on each node so as to prevent resource underutilization. The evaluation results show the proposed scheme can raise the overall computation efficiency among the heterogeneous nodes in cloud.\",\"PeriodicalId\":408223,\"journal\":{\"name\":\"Anti-counterfeiting, Security, and Identification\",\"volume\":\"36 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2012-10-11\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"6\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Anti-counterfeiting, Security, and Identification\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICASID.2012.6325318\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Anti-counterfeiting, Security, and Identification","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICASID.2012.6325318","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Dynamic slot-based task scheduling based on node workload in a MapReduce computation model
MapReduce is becoming a leading large-scale data processing model providing a logical framework for cloud computing. Hadoop, an open-source implementation of MapReduce framework, is widely used for realize such kind of parallel computing model. Nodes in the current Hadoop environment are normally homogeneous. Efficient resource management in clouds is crucial for improving the performance of MapReduce applications and the utilization of resources. However, the original scheduling scheme in Hadoop assign tasks to each node based on the fixed and static number of slots, without considering the physical workload on each node, such as the CPU utilization. This paper aims at proposing a dynamic slot-based task scheduling scheme by considering the physical workload on each node so as to prevent resource underutilization. The evaluation results show the proposed scheme can raise the overall computation efficiency among the heterogeneous nodes in cloud.