ROWE-tree: A Read-Optimized and Write-Efficient B+-tree for Persistent Memory

Proceedings of the 51st International Conference on Parallel Processing Pub Date : 2022-08-29 DOI:10.1145/3545008.3545043

Xiaomin Zou, Fang Wang, D. Feng, Tianjin Guan, Nan Su

{"title":"ROWE-tree: A Read-Optimized and Write-Efficient B+-tree for Persistent Memory","authors":"Xiaomin Zou, Fang Wang, D. Feng, Tianjin Guan, Nan Su","doi":"10.1145/3545008.3545043","DOIUrl":null,"url":null,"abstract":"Persistent memory (PM) exhibits a huge potential to provide B+-tree indexes with high performance, efficient persistence, and instant recovery. A large number of PM-optimized B+-tree indexes have been proposed, but most of them fail to provide high performance for both read and write operations because: (1) their designs of search optimization and insert improvement are often traded off against each other, and (2) they overlook the read/write interference problem of PM which incurs unpredictable performance degradation. In this paper, we propose ROWE-tree, a read-optimized and write-efficient B+-tree for PM. The designs of our ROWE-tree consist of three key points. First, we propose two techniques to make a good trade-off between write and read performance: self-verifying insertion, which reduces consistency overhead by using the key itself as a persist mark instead of additional metadata, and semi-sorted leaf nodes, which use append-only insertion to avoid the shifting overhead of sorting nodes but keep intra-cache-line items sorted to accelerate the lookup. Second, based on the observation that data accesses are highly skewed in real-world workloads, we build an in-DRAM cache of hot items to outsource accesses to hot items to DRAM. By doing so, we can alleviate the read/write interference of PM and significantly improve overall performance. Third, to cope with the dynamic changes of hot items, we exploit a lightweight mechanism to track such changes at run-time. Using Intel Optane DCPMM, our evaluations show that ROWE-tree obtains up to 3.86 × higher performance than the state-of-the-art PM B+-tree indexes under YCSB workloads.","PeriodicalId":360504,"journal":{"name":"Proceedings of the 51st International Conference on Parallel Processing","volume":"79 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-08-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 51st International Conference on Parallel Processing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3545008.3545043","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Persistent memory (PM) exhibits a huge potential to provide B+-tree indexes with high performance, efficient persistence, and instant recovery. A large number of PM-optimized B+-tree indexes have been proposed, but most of them fail to provide high performance for both read and write operations because: (1) their designs of search optimization and insert improvement are often traded off against each other, and (2) they overlook the read/write interference problem of PM which incurs unpredictable performance degradation. In this paper, we propose ROWE-tree, a read-optimized and write-efficient B+-tree for PM. The designs of our ROWE-tree consist of three key points. First, we propose two techniques to make a good trade-off between write and read performance: self-verifying insertion, which reduces consistency overhead by using the key itself as a persist mark instead of additional metadata, and semi-sorted leaf nodes, which use append-only insertion to avoid the shifting overhead of sorting nodes but keep intra-cache-line items sorted to accelerate the lookup. Second, based on the observation that data accesses are highly skewed in real-world workloads, we build an in-DRAM cache of hot items to outsource accesses to hot items to DRAM. By doing so, we can alleviate the read/write interference of PM and significantly improve overall performance. Third, to cope with the dynamic changes of hot items, we exploit a lightweight mechanism to track such changes at run-time. Using Intel Optane DCPMM, our evaluations show that ROWE-tree obtains up to 3.86 × higher performance than the state-of-the-art PM B+-tree indexes under YCSB workloads.

查看原文本刊更多论文

row -tree:持久内存的读优化和写高效B+树

持久内存(PM)在提供具有高性能、高效持久性和即时恢复的B+树索引方面显示出巨大的潜力。已经提出了大量的PM优化的B+树索引，但大多数索引不能同时提供高性能的读和写操作，因为:(1)它们的搜索优化设计和插入改进设计经常相互权衡，(2)它们忽略了PM的读写干扰问题，从而导致不可预测的性能下降。在本文中，我们提出了ROWE-tree，一个读优化和写高效的B+树。我们的row树的设计包括三个关键点。首先，我们提出了两种技术来在写和读性能之间做出良好的权衡:自验证插入，它通过使用键本身作为持久化标记而不是额外的元数据来减少一致性开销;半排序叶节点，它使用仅追加插入来避免排序节点的移动开销，但保持缓存内行项的排序以加速查找。其次，基于对数据访问在现实工作负载中高度倾斜的观察，我们构建了一个热项目的DRAM内缓存，将对热项目的访问外包给DRAM。通过这样做，我们可以减轻PM的读/写干扰，并显着提高整体性能。第三，为了处理热点项的动态变化，我们利用轻量级机制在运行时跟踪这些变化。使用英特尔Optane DCPMM，我们的评估表明，在YCSB工作负载下，ROWE-tree索引的性能比最先进的PM B+-tree索引高出3.86倍。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings of the 51st International Conference on Parallel Processing

自引率

0.00%

发文量