Pipelined Parallel Finite Automata Evaluation

2019 International Conference on Field-Programmable Technology (ICFPT) Pub Date : 2019-12-01 DOI:10.1109/ICFPT47387.2019.00021

Vipula Sateesh, Connor Mckeon, Jared Winograd, A. DeHon

引用次数: 5

Abstract

Finite automata are key compute models in modern computational theory and important building blocks for digital logic used for regular expression and protocol parsing, filtering, and control. Finite automata evaluation would seem to be a sequential operation, since we need to complete the evaluation of one state to know the next state in which to evaluate the logic. Nonetheless, parallel theory provides strategies for parallel finite automata evaluation. We show how to exploit this parallel evaluation strategy in practice on today's high capacity FPGAs, including a novel formulation for spatially pipelined evaluation. For non-deterministic finite automata (NFA) with S states, we can evaluate N inputs in a single cycle with O(N * S^2) BRAMs and O(N*S^3) LUTs. This allows us, for example, to consume 64 inputs on a 16 state NFA in a single cycle on the Xilinx XZCU9EG-ffvb1156-2-i SoC FPGA, achieving 47 GB/s (377 Gb/s) single stream throughput for 8b inputs. For a 40 Gb/s network link, we can support 28 state NFAs.

查看原文本刊更多论文

流水线并行有限自动机评估

有限自动机是现代计算理论中的关键计算模型，也是用于正则表达式和协议解析、过滤和控制的数字逻辑的重要组成部分。有限自动机评估似乎是一个顺序操作，因为我们需要完成一个状态的评估，以知道评估逻辑的下一个状态。然而，并行理论为并行有限自动机的评估提供了策略。我们展示了如何在当今的高容量fpga实践中利用这种并行评估策略，包括用于空间流水线评估的新公式。对于具有S状态的非确定性有限自动机(NFA)，我们可以用O(N*S^ 2)个bram和O(N*S^3)个LUTs在单个循环中评估N个输入。例如，这允许我们在Xilinx XZCU9EG-ffvb1156-2-i SoC FPGA上的单周期内消耗16状态NFA上的64个输入，为8b输入实现47 GB/s (377 GB/s)单流吞吐量。对于40gb /s的网络链路，可以支持28个状态的NFAs。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2019 International Conference on Field-Programmable Technology (ICFPT)

自引率

0.00%

发文量