{"title":"An implementation of heterogeneous architecture based MapReduce in the clouds","authors":"Yusong Tan, Wenzhu Wang, Q. Wu, Jie Lin","doi":"10.1109/CCIOT.2016.7868295","DOIUrl":null,"url":null,"abstract":"With the rapid development of computer technology, heterogeneous architecture based MapReduce (HA-MapReduce for short) is widely studied in the big data processing domain. Meanwhile, cloud computing is becoming an important alternative for providing computational infrastructure. Therefore, in this paper, we propose an implementation of HA-MapReduce in the cloud environment. First, we design a uniform MapReduce framework for heterogeneous architecture, which can utilize CPU and coprocessor cooperatively and efficiently. Second, we propose a coprocessor token mechanism for handling the coprocessor scalability and fault tolerance issues. Finally, we design a lightweight virtualization based cloud platform for low overhead and easy deployment. We deploy a CPU-MIC heterogeneous cluster for our HA-MapReduce and cloud platform. The experimental results show that our system is up to 1.21× and 1.38× faster than VM-based cloud platform, and 2.31× to 8.39× speedups than CPU-based Hadoop.","PeriodicalId":384484,"journal":{"name":"2016 2nd International Conference on Cloud Computing and Internet of Things (CCIOT)","volume":"486 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 2nd International Conference on Cloud Computing and Internet of Things (CCIOT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CCIOT.2016.7868295","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
With the rapid development of computer technology, heterogeneous architecture based MapReduce (HA-MapReduce for short) is widely studied in the big data processing domain. Meanwhile, cloud computing is becoming an important alternative for providing computational infrastructure. Therefore, in this paper, we propose an implementation of HA-MapReduce in the cloud environment. First, we design a uniform MapReduce framework for heterogeneous architecture, which can utilize CPU and coprocessor cooperatively and efficiently. Second, we propose a coprocessor token mechanism for handling the coprocessor scalability and fault tolerance issues. Finally, we design a lightweight virtualization based cloud platform for low overhead and easy deployment. We deploy a CPU-MIC heterogeneous cluster for our HA-MapReduce and cloud platform. The experimental results show that our system is up to 1.21× and 1.38× faster than VM-based cloud platform, and 2.31× to 8.39× speedups than CPU-based Hadoop.