基于进化方法的推测任务执行调度

D. Vinutha, G. Raju
{"title":"基于进化方法的推测任务执行调度","authors":"D. Vinutha, G. Raju","doi":"10.1109/ICAIT47043.2019.8987236","DOIUrl":null,"url":null,"abstract":"Hadoop is an open source framework to implement MapReduce. It stores and processes the data in distributed, highly scalable, parallel and fault tolerant environment. Job scheduling shows a significant role in optimizing the functioning of Hadoop. Hadoop default scheduler is not suitable for heterogeneous environment and not robust to identify the stragglers task which prolongs total execution time. Evolutionary approach based scheduler for speculative task execution is proposed in this paper. In this work we are proposing a new method to select the best nodes to run the speculative copy of the slow task. Two parameters such as network information and resource utilization are used to select the optimal nodes to execute the speculative copy of the stragglers task. Experiments have been conducted on web log file of academic website for obtaining the click count. Experimental results show that the execution time is reduced by 31% for 1 GB input data and 23% for 2 GB input data. On an average, the execution time is improved by 21% compared to conventional scheduler.","PeriodicalId":221994,"journal":{"name":"2019 1st International Conference on Advances in Information Technology (ICAIT)","volume":"10 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Evolutionary Approach based Scheduler for Speculative Task Execution\",\"authors\":\"D. Vinutha, G. Raju\",\"doi\":\"10.1109/ICAIT47043.2019.8987236\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Hadoop is an open source framework to implement MapReduce. It stores and processes the data in distributed, highly scalable, parallel and fault tolerant environment. Job scheduling shows a significant role in optimizing the functioning of Hadoop. Hadoop default scheduler is not suitable for heterogeneous environment and not robust to identify the stragglers task which prolongs total execution time. Evolutionary approach based scheduler for speculative task execution is proposed in this paper. In this work we are proposing a new method to select the best nodes to run the speculative copy of the slow task. Two parameters such as network information and resource utilization are used to select the optimal nodes to execute the speculative copy of the stragglers task. Experiments have been conducted on web log file of academic website for obtaining the click count. Experimental results show that the execution time is reduced by 31% for 1 GB input data and 23% for 2 GB input data. On an average, the execution time is improved by 21% compared to conventional scheduler.\",\"PeriodicalId\":221994,\"journal\":{\"name\":\"2019 1st International Conference on Advances in Information Technology (ICAIT)\",\"volume\":\"10 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-07-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2019 1st International Conference on Advances in Information Technology (ICAIT)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICAIT47043.2019.8987236\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 1st International Conference on Advances in Information Technology (ICAIT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICAIT47043.2019.8987236","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

Hadoop是实现MapReduce的开源框架。它在分布式、高可扩展、并行和容错的环境中存储和处理数据。作业调度在优化Hadoop的功能方面发挥了重要作用。Hadoop默认调度器不适合异构环境,对于识别散列任务也不够健壮,这会延长总执行时间。提出了一种基于进化方法的推测任务执行调度方法。在这项工作中,我们提出了一种新的方法来选择最佳节点来运行慢任务的推测副本。利用网络信息和资源利用率两个参数选择最优节点来执行离散任务的推测拷贝。在学术网站的日志文件上进行了实验,获得了点击数。实验结果表明,当输入数据为1gb时,执行时间减少31%,当输入数据为2gb时,执行时间减少23%。与传统调度器相比,执行时间平均提高了21%。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Evolutionary Approach based Scheduler for Speculative Task Execution
Hadoop is an open source framework to implement MapReduce. It stores and processes the data in distributed, highly scalable, parallel and fault tolerant environment. Job scheduling shows a significant role in optimizing the functioning of Hadoop. Hadoop default scheduler is not suitable for heterogeneous environment and not robust to identify the stragglers task which prolongs total execution time. Evolutionary approach based scheduler for speculative task execution is proposed in this paper. In this work we are proposing a new method to select the best nodes to run the speculative copy of the slow task. Two parameters such as network information and resource utilization are used to select the optimal nodes to execute the speculative copy of the stragglers task. Experiments have been conducted on web log file of academic website for obtaining the click count. Experimental results show that the execution time is reduced by 31% for 1 GB input data and 23% for 2 GB input data. On an average, the execution time is improved by 21% compared to conventional scheduler.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信