A New Hardware Accelerator for Data Sorting in Area & Energy Constrained Architectures

2019 IEEE 62nd International Midwest Symposium on Circuits and Systems (MWSCAS) Pub Date : 2019-08-01 DOI:10.1109/MWSCAS.2019.8885297

Amin Norollah, H. Beitollahi, A. Patooghy

引用次数: 0

Abstract

Sorting is one of the most important computational tasks in data processing applications. Recent studies show that the FPGA-based hardware accelerators are more efficient than the general-purpose processors and GPUs. By increasing the input records in the sorting network, the number of Compare-And-Swap (CAS) units would be increased, which in turn, will lead to increased resource consumption. In some applications, the number of available resources is limited. Thereby, it is necessary to optimize resource requirements while maintaining a sufficient level of performance. This paper presents a new sorting architecture that reduces the number of required resources compared to the state-of-the-art sorting architecture and achieves the desired performance using Unary processing. Results indicate that the proposed architecture increases throughput by 29.1% and reduces the number of LUTs by 42%, for sorting 8-input records, compared to other architecture.

查看原文本刊更多论文

一种用于区域和能量受限架构中数据排序的新型硬件加速器

排序是数据处理应用中最重要的计算任务之一。近年来的研究表明，基于fpga的硬件加速器比通用处理器和gpu的效率更高。通过增加排序网络中的输入记录，比较-交换(CAS)单元的数量将会增加，这反过来又会导致资源消耗的增加。在某些应用程序中，可用资源的数量是有限的。因此，有必要在保持足够的性能水平的同时优化资源需求。本文提出了一种新的排序体系结构，与最先进的排序体系结构相比，它减少了所需资源的数量，并使用一元处理实现了所需的性能。结果表明，与其他架构相比，所提出的架构在对8个输入记录进行排序时将吞吐量提高了29.1%，并将lut数量减少了42%。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2019 IEEE 62nd International Midwest Symposium on Circuits and Systems (MWSCAS)

自引率

0.00%

发文量