Tzu-Chi Huang, Kuo-Chih Chu, Guo-Hao Huang, Yan-Chen Shen, C. Shieh
{"title":"Distributed control framework for mapreduce cloud on cloud computing","authors":"Tzu-Chi Huang, Kuo-Chih Chu, Guo-Hao Huang, Yan-Chen Shen, C. Shieh","doi":"10.1109/NOMS.2018.8406180","DOIUrl":null,"url":null,"abstract":"A MapReduce cloud becomes a key to the success of cloud computing today. However, a MapReduce cloud uses a single Master node as the brain to manage tasks distributed over Slave nodes for controlling the entire progress of the application execution. Accordingly, a MapReduce cloud easily overloads the Master node with reports sent from Slave nodes at run time to harm performance. Besides, a MapReduce cloud makes the Master node a single failure point to suspend the application execution when the Master node cannot work. A MapReduce cloud can use the Distributed Control Framework (DCF) proposed in this paper to improve both performance and fault tolerance, because DCF shifts most works of a Master node to a DCF Master Agent coexisting in each Slave node and allows Slave nodes to join or leave a cloud at run time without interrupting the application execution. According to observations on experiments with various applications in this paper, a MapReduce cloud can use DCF to have better performance and fault tolerance in comparison to a native MapReduce cloud.","PeriodicalId":19331,"journal":{"name":"NOMS 2018 - 2018 IEEE/IFIP Network Operations and Management Symposium","volume":"34 1","pages":"1-4"},"PeriodicalIF":0.0000,"publicationDate":"2018-07-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"NOMS 2018 - 2018 IEEE/IFIP Network Operations and Management Symposium","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/NOMS.2018.8406180","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
A MapReduce cloud becomes a key to the success of cloud computing today. However, a MapReduce cloud uses a single Master node as the brain to manage tasks distributed over Slave nodes for controlling the entire progress of the application execution. Accordingly, a MapReduce cloud easily overloads the Master node with reports sent from Slave nodes at run time to harm performance. Besides, a MapReduce cloud makes the Master node a single failure point to suspend the application execution when the Master node cannot work. A MapReduce cloud can use the Distributed Control Framework (DCF) proposed in this paper to improve both performance and fault tolerance, because DCF shifts most works of a Master node to a DCF Master Agent coexisting in each Slave node and allows Slave nodes to join or leave a cloud at run time without interrupting the application execution. According to observations on experiments with various applications in this paper, a MapReduce cloud can use DCF to have better performance and fault tolerance in comparison to a native MapReduce cloud.