Deploying Network Key-Value SSDs to Disaggregate Resources in Big Data Processing Frameworks

2020 IEEE 39th International Performance Computing and Communications Conference (IPCCC) Pub Date : 2020-11-06 DOI:10.1109/IPCCC50635.2020.9391532

Mahsa Bayati, Harsh Roogi, Ron Lee, N. Mi

{"title":"Deploying Network Key-Value SSDs to Disaggregate Resources in Big Data Processing Frameworks","authors":"Mahsa Bayati, Harsh Roogi, Ron Lee, N. Mi","doi":"10.1109/IPCCC50635.2020.9391532","DOIUrl":null,"url":null,"abstract":"The exponential data generation embraces unstructured object storage systems as an effective solution to improve performance. Key-Value (KV) SSD object storage devices are unveiled to mitigate the shortcomings of traditional Key-Value stores on block devices, including device low-bandwidth utilization and KV-store resource-draining operations on the host CPU and block devices. Samsung KV-SSDs are built on top of NVMe over Fabric hardware, which supports storage remote access protocols (i.e., RDMA). Network Key-Value (NKV) is a software eco-system developed by Samsung that enables data distribution and storage disaggregation of KV-SSDs. Most widely used big data processing platforms, such as Hadoop, Presto, deploy Hadoop Distributed File System (HDFS) to take advantage of rapid data access by co-locating storage and compute nodes. The co-allocation of compute and storage node limits the scalability and utilization resources and thus increases the total cost of ownership. In this paper, we present a new storage disaggregation model for big data processing platforms. Our new system layout leverages resource disaggregation by separating compute infrastructure from storage infrastructure and utilizes the benefits of new evolving storage technology, i.e., KV-SSD, for large-scale data access and processing. The goal of this work is to facilitate independent scaling of storage and compute resources, and shift the data retrieval load from the hosts to storage nodes. We evaluate our designed architecture using TPC-DS benchmark. Our results show that the CPU load on compute nodes is non-negligibly released with sustaining the same performance compared to the conventional Hadoop with HDFS.","PeriodicalId":226034,"journal":{"name":"2020 IEEE 39th International Performance Computing and Communications Conference (IPCCC)","volume":"97 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-11-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 IEEE 39th International Performance Computing and Communications Conference (IPCCC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IPCCC50635.2020.9391532","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

The exponential data generation embraces unstructured object storage systems as an effective solution to improve performance. Key-Value (KV) SSD object storage devices are unveiled to mitigate the shortcomings of traditional Key-Value stores on block devices, including device low-bandwidth utilization and KV-store resource-draining operations on the host CPU and block devices. Samsung KV-SSDs are built on top of NVMe over Fabric hardware, which supports storage remote access protocols (i.e., RDMA). Network Key-Value (NKV) is a software eco-system developed by Samsung that enables data distribution and storage disaggregation of KV-SSDs. Most widely used big data processing platforms, such as Hadoop, Presto, deploy Hadoop Distributed File System (HDFS) to take advantage of rapid data access by co-locating storage and compute nodes. The co-allocation of compute and storage node limits the scalability and utilization resources and thus increases the total cost of ownership. In this paper, we present a new storage disaggregation model for big data processing platforms. Our new system layout leverages resource disaggregation by separating compute infrastructure from storage infrastructure and utilizes the benefits of new evolving storage technology, i.e., KV-SSD, for large-scale data access and processing. The goal of this work is to facilitate independent scaling of storage and compute resources, and shift the data retrieval load from the hosts to storage nodes. We evaluate our designed architecture using TPC-DS benchmark. Our results show that the CPU load on compute nodes is non-negligibly released with sustaining the same performance compared to the conventional Hadoop with HDFS.

查看原文本刊更多论文

部署网络键值ssd，实现大数据处理框架下的资源分解

指数数据生成采用非结构化对象存储系统作为提高性能的有效解决方案。KV (Key-Value) SSD对象存储设备的出现，是为了解决传统的Key-Value存储在块设备上的带宽利用率低、主机CPU和块设备上的KV-store资源消耗大等缺点。三星kv - ssd是建立在NVMe之上的Fabric硬件，它支持存储远程访问协议(即RDMA)。网络键值(Network Key-Value, NKV)是三星电子开发的一种软件生态系统，可以实现kv - ssd的数据分发和存储分解。目前应用最广泛的大数据处理平台，如Hadoop、Presto等，都部署了HDFS (Hadoop Distributed File System)，通过存储和计算节点的共置，实现数据的快速访问。计算节点和存储节点的共同分配限制了可扩展性和资源利用率，从而增加了总体拥有成本。本文提出了一种新的大数据处理平台存储分解模型。我们的新系统布局通过将计算基础设施与存储基础设施分离来利用资源分解，并利用新的不断发展的存储技术的优势，例如KV-SSD，用于大规模数据访问和处理。这项工作的目标是促进存储和计算资源的独立扩展，并将数据检索负载从主机转移到存储节点。我们使用TPC-DS基准测试来评估我们设计的架构。我们的结果表明，与传统的Hadoop与HDFS相比，计算节点上的CPU负载得到了不可忽视的释放，并保持了相同的性能。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2020 IEEE 39th International Performance Computing and Communications Conference (IPCCC)

自引率

0.00%

发文量