High throughput filtering using FPGA-acceleration

Proceedings of the 22nd ACM international conference on Information & Knowledge Management Pub Date : 2013-10-27 DOI:10.1145/2505515.2507866

W. Vanderbauwhede, Anton Frolov, L. Azzopardi, S. R. Chalamalasetti, M. Margala

引用次数: 1

Abstract

With the rise in the amount information of being streamed across networks, there is a growing demand to vet the quality, type and content itself for various purposes such as spam, security and search. In this paper, we develop an energy-efficient high performance information filtering system that is capable of classifying a stream of incoming document at high speed. The prototype parses a stream of documents using a multicore CPU and then performs classification using Field-Programmable Gate Arrays (FPGAs). On a large TREC data collection, we implemented a Naive Bayes classifier on our prototype and compared it to an optimized CPU based-baseline. Our empirical findings show that we can classify documents at 10Gb/s which is up to 94 times faster than the CPU baseline (and up to 5 times faster than previous FPGA based implementations). In future work, we aim to increase the throughput by another order of magnitude by implementing both the parser and filter on the FPGA.

查看原文本刊更多论文

采用fpga加速的高吞吐量滤波

随着通过网络传输的信息量的增加，为了垃圾邮件、安全、搜索等各种目的，对质量、类型和内容本身进行审查的需求也在不断增长。在本文中，我们开发了一种高效节能的信息过滤系统，该系统能够对输入的文档流进行高速分类。原型使用多核CPU解析文档流，然后使用现场可编程门阵列(fpga)执行分类。在一个大型TREC数据集上，我们在原型上实现了朴素贝叶斯分类器，并将其与优化的基于CPU的基线进行了比较。我们的实证研究结果表明，我们可以以10Gb/s的速度对文档进行分类，这比CPU基准快94倍(比以前基于FPGA的实现快5倍)。在未来的工作中，我们的目标是通过在FPGA上实现解析器和滤波器来将吞吐量提高一个数量级。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings of the 22nd ACM international conference on Information & Knowledge Management

自引率

0.00%

发文量