Secure and multi-tenant Hadoop cluster - an experience

Paresh Wankhede, Nayanjyoti Paul
{"title":"Secure and multi-tenant Hadoop cluster - an experience","authors":"Paresh Wankhede, Nayanjyoti Paul","doi":"10.1109/ICGHPC.2016.7508069","DOIUrl":null,"url":null,"abstract":"Data Analytics and Data Discovery are the most important facets in today's Business Domain where customer centric business decisions are the key. With ever increasing rate of data captivation, curation, management and requirement of data analytics, Hadoop has accounted itself as a major player in providing the data analytics and data processing backbone for any Organization that deals with ever increasing nuances of data management and processing. With every organizational setup of Hadoop clusters, we find it an ever increasing challenge to setup, manage and operate multiple Hadoop clusters, for managing different projects or managing different Tenants (clients). This results in a higher client onboarding time on Hadoop, cost of project ownership and effort to setup and manage separate clusters for separate projects/clients/tenants. However with the current trend of data security, companies are apprehensive of building a single large cluster and onboarding multiple clients on same common Hadoop cluster. This paper demonstrates how to set up a multi-tenant cluster which is big in size, scalable enough and has short client onboarding time without any client having access/knowledge/information of any other clients. Security features are also implemented on this multi-tenant cluster for authentication and authorization, so that only right client members have access to their allocated Hadoop resources like RAM, CPU and disk size. This paper also demonstrates how to create a fully functional and operational multi-tenant cluster with security at its core, reduced Cluster Management, higher data & resource security to provide an optimized Hadoop based solution offering in terms of cost and effectiveness.","PeriodicalId":268630,"journal":{"name":"2016 2nd International Conference on Green High Performance Computing (ICGHPC)","volume":"63 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 2nd International Conference on Green High Performance Computing (ICGHPC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICGHPC.2016.7508069","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 5

Abstract

Data Analytics and Data Discovery are the most important facets in today's Business Domain where customer centric business decisions are the key. With ever increasing rate of data captivation, curation, management and requirement of data analytics, Hadoop has accounted itself as a major player in providing the data analytics and data processing backbone for any Organization that deals with ever increasing nuances of data management and processing. With every organizational setup of Hadoop clusters, we find it an ever increasing challenge to setup, manage and operate multiple Hadoop clusters, for managing different projects or managing different Tenants (clients). This results in a higher client onboarding time on Hadoop, cost of project ownership and effort to setup and manage separate clusters for separate projects/clients/tenants. However with the current trend of data security, companies are apprehensive of building a single large cluster and onboarding multiple clients on same common Hadoop cluster. This paper demonstrates how to set up a multi-tenant cluster which is big in size, scalable enough and has short client onboarding time without any client having access/knowledge/information of any other clients. Security features are also implemented on this multi-tenant cluster for authentication and authorization, so that only right client members have access to their allocated Hadoop resources like RAM, CPU and disk size. This paper also demonstrates how to create a fully functional and operational multi-tenant cluster with security at its core, reduced Cluster Management, higher data & resource security to provide an optimized Hadoop based solution offering in terms of cost and effectiveness.
安全和多租户Hadoop集群——一种体验
数据分析和数据发现是当今业务领域中最重要的方面,以客户为中心的业务决策是关键。随着数据捕获、管理、管理和数据分析需求的不断增长,Hadoop已经成为为任何组织提供数据分析和数据处理骨干的主要参与者,这些组织处理日益增加的数据管理和处理的细微差别。随着Hadoop集群的每一次组织设置,我们发现设置,管理和操作多个Hadoop集群,管理不同的项目或管理不同的租户(客户端)是一个越来越大的挑战。这导致客户端在Hadoop上的登录时间更长,项目拥有成本更高,并且需要为不同的项目/客户端/租户设置和管理单独的集群。然而,随着当前数据安全的趋势,公司对构建一个大型集群并在同一个公共Hadoop集群上安装多个客户端感到担忧。本文演示了如何设置一个多租户集群,该集群规模大、可伸缩,并且客户端登录时间短,而任何客户端都不需要访问/了解其他任何客户端的信息。这个多租户集群还实现了安全特性,用于身份验证和授权,因此只有正确的客户端成员才能访问分配给他们的Hadoop资源,如RAM、CPU和磁盘大小。本文还演示了如何创建一个功能齐全、可操作的多租户集群,以安全性为核心,减少集群管理,提高数据和资源安全性,从而在成本和效率方面提供优化的基于Hadoop的解决方案。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信