Power-efficient re-gridding architecture for accelerating Non-uniform Fast Fourier Transform

2014 24th International Conference on Field Programmable Logic and Applications (FPL) Pub Date : 2014-10-20 DOI:10.1109/FPL.2014.6927451

Umer I. Cheema, G. Nash, R. Ansari, A. Khokhar

引用次数: 4

Abstract

This paper proposes a novel FPGA-based accelerator for the memory and compute-intense re-gridding process used in computation of Non-uniform Fast Fourier Transform (NuFFT). The re-gridding process interpolates arbitrary sampled data onto a uniform grid using an interpolation kernel function. This regridding step is considered one of the most time consuming step in entire NuFFT computation. We propose a memory-efficient technique based on the novel use of customizable hardware components such as FPGA block memory in First-In-First-Out (FIFO) configuration, fill-rate based arbiter, distributed RAM and an array of pipelined single precision floating point multipliers and adders. The proposed architecture exhibits high performance over a wide range of configurations and data-sizes. A speed-up of over 9.6 was achieved when compared with existing FPGA-based technique at a 7 times higher MFLOPS per watt. Compared to GPU based technique, over 6 times higher MFLOPS per watts were achieved.

查看原文本刊更多论文

加速非均匀快速傅里叶变换的节能重网格结构

针对非均匀快速傅里叶变换(NuFFT)计算中存储和计算密集型的重网格处理，提出了一种基于fpga的新型加速器。重网格过程使用插值核函数将任意采样数据插值到统一网格上。这一步被认为是整个NuFFT计算中最耗时的步骤之一。我们提出了一种基于可定制硬件组件的内存高效技术，如先进先出(FIFO)配置中的FPGA块存储器，基于填充率的仲裁器，分布式RAM和流水线单精度浮点乘法器和加法器阵列。所建议的体系结构在各种配置和数据大小上都表现出高性能。与现有的基于fpga的技术相比，速度提高了9.6以上，每瓦的MFLOPS提高了7倍。与基于GPU的技术相比，实现了每瓦6倍以上的MFLOPS。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2014 24th International Conference on Field Programmable Logic and Applications (FPL)

自引率

0.00%

发文量