Fast concurrent queues for x86 processors

ACM SIGPLAN Symposium on Principles & Practice of Parallel Programming Pub Date : 2013-02-23 DOI:10.1145/2442516.2442527

Adam Morrison, Y. Afek

引用次数: 138

Abstract

Conventional wisdom in designing concurrent data structures is to use the most powerful synchronization primitive, namely compare-and-swap (CAS), and to avoid contended hot spots. In building concurrent FIFO queues, this reasoning has led researchers to propose combining-based concurrent queues. This paper takes a different approach, showing how to rely on fetch-and-add (F&A), a less powerful primitive that is available on x86 processors, to construct a nonblocking (lock-free) linearizable concurrent FIFO queue which, despite the F&A being a contended hot spot, outperforms combining-based implementations by 1.5x to 2.5x in all concurrency levels on an x86 server with four multicore processors, in both single-processor and multi-processor executions.

查看原文本刊更多论文

用于x86处理器的快速并发队列

设计并发数据结构的传统智慧是使用最强大的同步原语，即比较与交换(CAS)，并避免争用热点。在构建并发FIFO队列时，这一推理使研究人员提出了基于组合的并发队列。本文采用了一种不同的方法，展示了如何依赖于获取和添加(F&A)，这是x86处理器上可用的一种功能较弱的原语，来构建一个非阻塞(无锁)可线性化的并发FIFO队列，尽管F&A是一个争用热点，但在具有四个多核处理器的x86服务器上的所有并发级别上，在单处理器和多处理器执行中，该队列的性能比基于组合的实现高出1.5到2.5倍。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

ACM SIGPLAN Symposium on Principles & Practice of Parallel Programming

自引率

0.00%

发文量