{"title":"Optimizing Hadoop parameter for speedup using Q-Learning Reinforcement Learning","authors":"Nandita Yambem, A. Nandakumar","doi":"10.1109/icecct52121.2021.9616965","DOIUrl":null,"url":null,"abstract":"Hadoop is the most popular open source big data processing platform which is being used in many big data analytics applications. The performance of Hadoop can be fine-tuned for application performance requirements by adjusting the value of the some of the configuration parameters. Various methods have been proposed in literature for fine tuning the configuration parameters of Hadoop. The relation between the Hadoop performance tuning parameters and speed up is dependent on the nature of the applications and environment dynamics. Tuning the parameters without consideration of these dynamics results in sub optimal configurations and lower performance.. Adaptive reinforcement learning using Q-Learning is proposed in this work to fine tune the configuration parameters with the objective of reducing the error between desired and achieved service level agreement (SLA).","PeriodicalId":155129,"journal":{"name":"2021 Fourth International Conference on Electrical, Computer and Communication Technologies (ICECCT)","volume":"32 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-09-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 Fourth International Conference on Electrical, Computer and Communication Technologies (ICECCT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/icecct52121.2021.9616965","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
Hadoop is the most popular open source big data processing platform which is being used in many big data analytics applications. The performance of Hadoop can be fine-tuned for application performance requirements by adjusting the value of the some of the configuration parameters. Various methods have been proposed in literature for fine tuning the configuration parameters of Hadoop. The relation between the Hadoop performance tuning parameters and speed up is dependent on the nature of the applications and environment dynamics. Tuning the parameters without consideration of these dynamics results in sub optimal configurations and lower performance.. Adaptive reinforcement learning using Q-Learning is proposed in this work to fine tune the configuration parameters with the objective of reducing the error between desired and achieved service level agreement (SLA).