Towards Model Based Approach to Hadoop Deployment and Configuration

2015 12th Web Information System and Application Conference (WISA) Pub Date : 2015-09-11 DOI:10.1109/WISA.2015.65

Yicheng Huang, X. Lan, Xing Chen, Wenzhong Guo

{"title":"Towards Model Based Approach to Hadoop Deployment and Configuration","authors":"Yicheng Huang, X. Lan, Xing Chen, Wenzhong Guo","doi":"10.1109/WISA.2015.65","DOIUrl":null,"url":null,"abstract":"Hadoop is an open source software framework of distributed processing of big data. There are many kinds of services in Hadoop ecosystem, such as HDFS, Map-Reduce, HBase, Hive, Yarn, Flume, Spark, Storm, Zookeeper, and so on, which increase the complexity of deployment and configuration. It takes plenty of time to construct a Hadoop cluster. Although there are some management tools which help administrators deploy and configure Hadoop clusters automatically, they usually provide a fixed solution. So administrators couldn't construct their Hadoop clusters according to different management requirements by the tools. Software architecture acts as a bridge between requirements and implementations. It has been used to reduce the complexity and cost mainly resulted from the difficulties faced by understanding the large-scale and complex software system. This paper proposes a model based approach to Hadoop deployment and configuration which help administrators construct Hadoop clusters in a simple but powerful enough manner. First, we provide the unified models of Hadoop software architecture, according to the domain knowledge of current Hadoop deployment and configuration. Second, we provide a framework with a set of definable rules for domain experts to describe their solutions to deploy and configure Hadoop clusters. Thus, administrators can use various custom solutions to automatically deploy and configure their Hadoop clusters according to different management requirements. In addition, a real-world experiment demonstrates the feasibility, effectiveness and benefits of the new approach to Hadoop deployment and configuration.","PeriodicalId":198938,"journal":{"name":"2015 12th Web Information System and Application Conference (WISA)","volume":"8 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2015-09-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"14","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2015 12th Web Information System and Application Conference (WISA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/WISA.2015.65","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 14

Abstract

Hadoop is an open source software framework of distributed processing of big data. There are many kinds of services in Hadoop ecosystem, such as HDFS, Map-Reduce, HBase, Hive, Yarn, Flume, Spark, Storm, Zookeeper, and so on, which increase the complexity of deployment and configuration. It takes plenty of time to construct a Hadoop cluster. Although there are some management tools which help administrators deploy and configure Hadoop clusters automatically, they usually provide a fixed solution. So administrators couldn't construct their Hadoop clusters according to different management requirements by the tools. Software architecture acts as a bridge between requirements and implementations. It has been used to reduce the complexity and cost mainly resulted from the difficulties faced by understanding the large-scale and complex software system. This paper proposes a model based approach to Hadoop deployment and configuration which help administrators construct Hadoop clusters in a simple but powerful enough manner. First, we provide the unified models of Hadoop software architecture, according to the domain knowledge of current Hadoop deployment and configuration. Second, we provide a framework with a set of definable rules for domain experts to describe their solutions to deploy and configure Hadoop clusters. Thus, administrators can use various custom solutions to automatically deploy and configure their Hadoop clusters according to different management requirements. In addition, a real-world experiment demonstrates the feasibility, effectiveness and benefits of the new approach to Hadoop deployment and configuration.

查看原文本刊更多论文

基于模型的Hadoop部署与配置方法研究

Hadoop是一个开源的大数据分布式处理软件框架。Hadoop生态系统中有很多服务，如HDFS、Map-Reduce、HBase、Hive、Yarn、Flume、Spark、Storm、Zookeeper等，这增加了部署和配置的复杂性。构建Hadoop集群需要花费大量的时间。虽然有一些管理工具可以帮助管理员自动部署和配置Hadoop集群，但它们通常提供固定的解决方案。因此管理员无法通过工具根据不同的管理需求来构建自己的Hadoop集群。软件架构充当需求和实现之间的桥梁。它被用来降低复杂性和成本，这主要是由于大规模和复杂的软件系统难以理解所造成的。本文提出了一种基于模型的Hadoop部署和配置方法，帮助管理员以一种简单而强大的方式构建Hadoop集群。首先，根据目前Hadoop部署和配置的领域知识，给出了Hadoop软件架构的统一模型。其次，我们提供了一个框架，其中包含一组可定义的规则，供领域专家描述他们部署和配置Hadoop集群的解决方案。因此，管理员可以根据不同的管理需求，使用各种自定义解决方案自动部署和配置Hadoop集群。此外，一个真实世界的实验证明了Hadoop部署和配置新方法的可行性、有效性和优点。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2015 12th Web Information System and Application Conference (WISA)

自引率

0.00%

发文量