An Optimized Implementation for Concurrent LSM-Structured Key-Value Stores

2018 IEEE International Conference on Networking, Architecture and Storage (NAS) Pub Date : 2018-10-01 DOI:10.1109/NAS.2018.8515730

Li Liu, Hua Wang, Ke Zhou

{"title":"An Optimized Implementation for Concurrent LSM-Structured Key-Value Stores","authors":"Li Liu, Hua Wang, Ke Zhou","doi":"10.1109/NAS.2018.8515730","DOIUrl":null,"url":null,"abstract":"Log-Structured Merge Trees (LSM) based key-value (KV) stores such as LevelDB and HyperLevelDB, use a compaction strategy which brings frequent compaction operations, to store key-value items in sorted order. However, large numbers of compactions impose a negative impact on write and read performance for random data-intensive workloads. To remedy this problem, this paper presents OHDB, an optimization of HyperLevelDB for random data-intensive workloads. OHDB implements two stand-alone techniques in the disk component of LSM structure to optimize the concurrent compactions. One is dividing KV items by prefix at the first level in the disk component, to reduce the frequency of overlapping in key range among data files, and thus reduces the amount of compactions. The other is separating the first level in the disk component from the rest levels, and organizing them in two disks individually, to increase parallelism of disk writes of compactions. We evaluate three OHDBs which are OHDB with each of the technique and OHDB with the combination of both respectively, using micro-benchmarks with random write- intensive and read-intensive workloads. Experimental results show that OHDB reduces the amount of compactions by a factor of up to 4x, and improves the write and read performance for random data-intensive workloads under various settings.","PeriodicalId":115970,"journal":{"name":"2018 IEEE International Conference on Networking, Architecture and Storage (NAS)","volume":"53 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 IEEE International Conference on Networking, Architecture and Storage (NAS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/NAS.2018.8515730","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Log-Structured Merge Trees (LSM) based key-value (KV) stores such as LevelDB and HyperLevelDB, use a compaction strategy which brings frequent compaction operations, to store key-value items in sorted order. However, large numbers of compactions impose a negative impact on write and read performance for random data-intensive workloads. To remedy this problem, this paper presents OHDB, an optimization of HyperLevelDB for random data-intensive workloads. OHDB implements two stand-alone techniques in the disk component of LSM structure to optimize the concurrent compactions. One is dividing KV items by prefix at the first level in the disk component, to reduce the frequency of overlapping in key range among data files, and thus reduces the amount of compactions. The other is separating the first level in the disk component from the rest levels, and organizing them in two disks individually, to increase parallelism of disk writes of compactions. We evaluate three OHDBs which are OHDB with each of the technique and OHDB with the combination of both respectively, using micro-benchmarks with random write- intensive and read-intensive workloads. Experimental results show that OHDB reduces the amount of compactions by a factor of up to 4x, and improves the write and read performance for random data-intensive workloads under various settings.

查看原文本刊更多论文

并发lsm结构键值存储的优化实现

基于日志结构合并树(LSM)的键值(KV)存储，如LevelDB和HyperLevelDB，使用压缩策略，带来频繁的压缩操作，按排序顺序存储键值项。但是，对于随机数据密集型工作负载，大量的压缩会对写入和读取性能产生负面影响。为了解决这个问题，本文提出了OHDB，一种针对随机数据密集型工作负载的HyperLevelDB优化。OHDB在LSM结构的磁盘组件中实现了两种独立的技术来优化并发压缩。一种是在磁盘组件的第一级按前缀划分KV项，以减少数据文件在键范围内重叠的频率，从而减少压缩量。另一种方法是将磁盘组件中的第一层与其余层分开，并将它们分别组织在两个磁盘中，以增加磁盘写压缩的并行性。我们使用随机写密集型和读密集型工作负载的微基准测试，分别评估了使用每种技术的OHDB和结合两者的OHDB的三个OHDB。实验结果表明，OHDB减少了多达4倍的压缩量，并在各种设置下提高了随机数据密集型工作负载的写入和读取性能。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2018 IEEE International Conference on Networking, Architecture and Storage (NAS)

自引率

0.00%

发文量