{"title":"Performance comparison of Hadoop Clusters configured on virtual machines and as a cloud service","authors":"M. F. Hyder, M. A. Ismail, Hameeza Ahmed","doi":"10.1109/ICET.2014.7021017","DOIUrl":null,"url":null,"abstract":"Cloud computing is an emerging trend for online computing and resource management. OpenStack is one of the most widely used open source platform for building public and private clouds. Whereas, Apache Hadoop is an open source framework use to process large data sets spread across clusters of computers. Currently both OpenStack and Apache Hadoop are sharing the major consideration in research and open source communities. This paper focuses on integration of Hadoop Cluster on OpenStack cloud as it's one of the services and then highlights the performance comparison of Hadoop cluster implemented with that of Hadoop cluster configured separately. The results concluded show the successful implementation of Hadoop as a cloud service and its performance enhancement in comparison to native virtual Hadoop cluster.","PeriodicalId":325890,"journal":{"name":"2014 International Conference on Emerging Technologies (ICET)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2014-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2014 International Conference on Emerging Technologies (ICET)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICET.2014.7021017","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 6
Abstract
Cloud computing is an emerging trend for online computing and resource management. OpenStack is one of the most widely used open source platform for building public and private clouds. Whereas, Apache Hadoop is an open source framework use to process large data sets spread across clusters of computers. Currently both OpenStack and Apache Hadoop are sharing the major consideration in research and open source communities. This paper focuses on integration of Hadoop Cluster on OpenStack cloud as it's one of the services and then highlights the performance comparison of Hadoop cluster implemented with that of Hadoop cluster configured separately. The results concluded show the successful implementation of Hadoop as a cloud service and its performance enhancement in comparison to native virtual Hadoop cluster.