An FPGA-specific approach to floating-point accumulation and sum-of-products

2008 International Conference on Field-Programmable Technology Pub Date : 2008-12-07 DOI:10.1109/FPT.2008.4762363

F. D. Dinechin, B. Pasca, O. Creţ, R. Tudoran

引用次数: 20

Abstract

This article studies two common situations where the flexibility of FPGAs allows one to design application-specific floating-point operators which are more efficient and more accurate than those offered by processors and GPUs. First, for applications involving the addition of a large number of floating-point values, an ad-hoc accumulator is proposed. By tailoring its parameters to the numerical requirements of the application, it can be made arbitrarily accurate, at an area cost comparable to that of a standard floating-point adder, and at a higher frequency. The second example is the sum-of-product operation, which is the building block of matrix computations. A novel architecture is proposed that feeds the previous accumulator out of a floating-point multiplier whose rounding logic has been removed, again improving the area/accuracy tradeoff. These architectures are implemented within the FloPoCo generator, freely available under the LGPL.

查看原文本刊更多论文

浮点累加和积和的fpga专用方法

本文研究了两种常见的情况，其中fpga的灵活性允许人们设计特定于应用程序的浮点运算符，这些运算符比处理器和gpu提供的运算符更高效、更准确。首先，针对涉及大量浮点数相加的应用，提出了一种自适应累加器。通过根据应用程序的数值要求调整其参数，可以使其精确到任意程度，其面积成本与标准浮点加法器相当，并且频率更高。第二个例子是乘积和运算，它是矩阵计算的组成部分。提出了一种新的架构，该架构将先前的累加器从去除舍入逻辑的浮点乘法器中取出，再次改善了面积/精度权衡。这些架构在FloPoCo生成器中实现，在LGPL下免费提供。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2008 International Conference on Field-Programmable Technology

自引率

0.00%

发文量