Concurrent Expandable AMQs on the Basis of Quotient Filters

Bulletin of the Society of Sea Water Science, Japan Pub Date : 2019-11-19 DOI:10.4230/LIPIcs.SEA.2020.15

Tobias Maier, P. Sanders, Robert Williger

{"title":"Concurrent Expandable AMQs on the Basis of Quotient Filters","authors":"Tobias Maier, P. Sanders, Robert Williger","doi":"10.4230/LIPIcs.SEA.2020.15","DOIUrl":null,"url":null,"abstract":"A quotient filter is a cache efficient AMQ data structure. Depending on the fill degree of the filter most insertions and queries only need to access one or two consecutive cache lines. This makes quotient filters fast compared to the more commonly used Bloom filters that incur multiple cache misses. However, concurrent Bloom filters are easy to implement and can be implemented lock-free while concurrent quotient filters are not as simple. Usually concurrent quotient filters work by using an external array of locks -- each protecting a region of the table. Accessing this array incurs one additional cache miss per operation. We propose a new locking scheme that has no memory overhead. Using this new locking scheme we achieve 1.8 times higher speedups than with the common external locking scheme. \nAnother advantage of quotient filters over Bloom filters is that a quotient filter can change its size when it is becoming full. We implement this growing technique for our concurrent quotient filters and adapt it in a way that allows unbounded growing while keeping a bounded false positive rate. We call the resulting data structure a fully expandable quotient filter. Its design is similar to scalable Bloom filters, but we exploit some concepts inherent to quotient filters to improve the space efficiency and the query speed. \nWe also propose quotient filter variants that are aimed to reduce the number of status bits (2-status-bit variant) or to simplify concurrent implementations (linear probing quotient filter). The linear probing quotient filter even leads to a lock-free concurrent filter implementation. This is especially interesting, since we show that any lock-free implementation of another common quotient filter variant would incur significant overheads in the form of additional data fields or multiple passes over the accessed data.","PeriodicalId":9448,"journal":{"name":"Bulletin of the Society of Sea Water Science, Japan","volume":"115 ","pages":"15:1-15:13"},"PeriodicalIF":0.0000,"publicationDate":"2019-11-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Bulletin of the Society of Sea Water Science, Japan","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.4230/LIPIcs.SEA.2020.15","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 2

Abstract

A quotient filter is a cache efficient AMQ data structure. Depending on the fill degree of the filter most insertions and queries only need to access one or two consecutive cache lines. This makes quotient filters fast compared to the more commonly used Bloom filters that incur multiple cache misses. However, concurrent Bloom filters are easy to implement and can be implemented lock-free while concurrent quotient filters are not as simple. Usually concurrent quotient filters work by using an external array of locks -- each protecting a region of the table. Accessing this array incurs one additional cache miss per operation. We propose a new locking scheme that has no memory overhead. Using this new locking scheme we achieve 1.8 times higher speedups than with the common external locking scheme. Another advantage of quotient filters over Bloom filters is that a quotient filter can change its size when it is becoming full. We implement this growing technique for our concurrent quotient filters and adapt it in a way that allows unbounded growing while keeping a bounded false positive rate. We call the resulting data structure a fully expandable quotient filter. Its design is similar to scalable Bloom filters, but we exploit some concepts inherent to quotient filters to improve the space efficiency and the query speed. We also propose quotient filter variants that are aimed to reduce the number of status bits (2-status-bit variant) or to simplify concurrent implementations (linear probing quotient filter). The linear probing quotient filter even leads to a lock-free concurrent filter implementation. This is especially interesting, since we show that any lock-free implementation of another common quotient filter variant would incur significant overheads in the form of additional data fields or multiple passes over the accessed data.

查看原文本刊更多论文

基于商滤波器的并发可扩展amq

商过滤器是一种高速缓存高效的AMQ数据结构。根据过滤器的填充程度，大多数插入和查询只需要访问一个或两个连续的缓存行。这使得商过滤器比更常用的Bloom过滤器更快，后者会导致多次缓存丢失。然而，并发布隆过滤器很容易实现，并且可以实现无锁，而并发商过滤器则不那么简单。通常并发商过滤器通过使用外部锁数组来工作——每个锁保护表的一个区域。每次操作访问此数组都会导致一次额外的缓存丢失。我们提出了一种没有内存开销的新锁方案。使用这种新的锁定方案，我们获得了比普通外部锁定方案高1.8倍的速度。商过滤器相对于布隆过滤器的另一个优点是，商过滤器可以在已满时更改其大小。我们为我们的并发商过滤器实现了这种增长技术，并以一种允许无界增长同时保持有界假阳性率的方式进行了调整。我们将得到的数据结构称为完全可扩展商过滤器。它的设计类似于可扩展的布隆过滤器，但我们利用了商过滤器固有的一些概念来提高空间效率和查询速度。我们还提出了商滤波器变体，旨在减少状态位的数量(2-状态位变体)或简化并发实现(线性探测商滤波器)。线性探测商滤波器甚至导致无锁并发滤波器的实现。这一点特别有趣，因为我们展示了另一种常见的商过滤器变体的任何无锁实现都会以额外的数据字段或对访问的数据进行多次传递的形式产生显著的开销。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Bulletin of the Society of Sea Water Science, Japan

自引率

0.00%

发文量