Data-parallel finite-state machines

Proceedings of the 19th international conference on Architectural support for programming languages and operating systems Pub Date : 2014-02-24 DOI:10.1145/2541940.2541988

Todd Mytkowicz, Madan Musuvathi, Wolfram Schulte

引用次数: 82

Abstract

A finite-state machine (FSM) is an important abstraction for solving several problems, including regular-expression matching, tokenizing text, and Huffman decoding. FSM computations typically involve data-dependent iterations with unpredictable memory-access patterns making them difficult to parallelize. This paper describes a parallel algorithm for FSMs that breaks dependences across iterations by efficiently enumerating transitions from all possible states on each input symbol. This allows the algorithm to utilize various sources of data parallelism available on modern hardware, including vector instructions and multiple processors/cores. For instance, on benchmarks from three FSM applications: regular expressions, Huffman decoding, and HTML tokenization, the parallel algorithm achieves up to a 3x speedup over optimized sequential baselines on a single core, and linear speedups up to 21x on 8 cores.

查看原文本刊更多论文

数据并行有限状态机

有限状态机(FSM)是解决正则表达式匹配、文本标记化和霍夫曼解码等问题的重要抽象。FSM计算通常涉及数据依赖的迭代，具有不可预测的内存访问模式，这使得它们难以并行化。本文描述了一种FSMs并行算法，该算法通过有效地枚举每个输入符号上所有可能状态的转换来打破迭代之间的依赖。这允许该算法利用现代硬件上可用的各种数据并行性来源，包括矢量指令和多处理器/内核。例如，在三个FSM应用程序(正则表达式、Huffman解码和HTML标记化)的基准测试中，并行算法在单核上比优化的顺序基线实现了高达3倍的加速，在8核上实现了高达21倍的线性加速。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings of the 19th international conference on Architectural support for programming languages and operating systems

自引率

0.00%

发文量