浮点累加和积和的fpga专用方法

2008 International Conference on Field-Programmable Technology Pub Date : 2008-12-07 DOI:10.1109/FPT.2008.4762363

F. D. Dinechin, B. Pasca, O. Creţ, R. Tudoran

{"title":"浮点累加和积和的fpga专用方法","authors":"F. D. Dinechin, B. Pasca, O. Creţ, R. Tudoran","doi":"10.1109/FPT.2008.4762363","DOIUrl":null,"url":null,"abstract":"This article studies two common situations where the flexibility of FPGAs allows one to design application-specific floating-point operators which are more efficient and more accurate than those offered by processors and GPUs. First, for applications involving the addition of a large number of floating-point values, an ad-hoc accumulator is proposed. By tailoring its parameters to the numerical requirements of the application, it can be made arbitrarily accurate, at an area cost comparable to that of a standard floating-point adder, and at a higher frequency. The second example is the sum-of-product operation, which is the building block of matrix computations. A novel architecture is proposed that feeds the previous accumulator out of a floating-point multiplier whose rounding logic has been removed, again improving the area/accuracy tradeoff. These architectures are implemented within the FloPoCo generator, freely available under the LGPL.","PeriodicalId":320925,"journal":{"name":"2008 International Conference on Field-Programmable Technology","volume":"67 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2008-12-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"20","resultStr":"{\"title\":\"An FPGA-specific approach to floating-point accumulation and sum-of-products\",\"authors\":\"F. D. Dinechin, B. Pasca, O. Creţ, R. Tudoran\",\"doi\":\"10.1109/FPT.2008.4762363\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This article studies two common situations where the flexibility of FPGAs allows one to design application-specific floating-point operators which are more efficient and more accurate than those offered by processors and GPUs. First, for applications involving the addition of a large number of floating-point values, an ad-hoc accumulator is proposed. By tailoring its parameters to the numerical requirements of the application, it can be made arbitrarily accurate, at an area cost comparable to that of a standard floating-point adder, and at a higher frequency. The second example is the sum-of-product operation, which is the building block of matrix computations. A novel architecture is proposed that feeds the previous accumulator out of a floating-point multiplier whose rounding logic has been removed, again improving the area/accuracy tradeoff. These architectures are implemented within the FloPoCo generator, freely available under the LGPL.\",\"PeriodicalId\":320925,\"journal\":{\"name\":\"2008 International Conference on Field-Programmable Technology\",\"volume\":\"67 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2008-12-07\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"20\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2008 International Conference on Field-Programmable Technology\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/FPT.2008.4762363\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2008 International Conference on Field-Programmable Technology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/FPT.2008.4762363","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 20

摘要

本文研究了两种常见的情况，其中fpga的灵活性允许人们设计特定于应用程序的浮点运算符，这些运算符比处理器和gpu提供的运算符更高效、更准确。首先，针对涉及大量浮点数相加的应用，提出了一种自适应累加器。通过根据应用程序的数值要求调整其参数，可以使其精确到任意程度，其面积成本与标准浮点加法器相当，并且频率更高。第二个例子是乘积和运算，它是矩阵计算的组成部分。提出了一种新的架构，该架构将先前的累加器从去除舍入逻辑的浮点乘法器中取出，再次改善了面积/精度权衡。这些架构在FloPoCo生成器中实现，在LGPL下免费提供。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

An FPGA-specific approach to floating-point accumulation and sum-of-products

This article studies two common situations where the flexibility of FPGAs allows one to design application-specific floating-point operators which are more efficient and more accurate than those offered by processors and GPUs. First, for applications involving the addition of a large number of floating-point values, an ad-hoc accumulator is proposed. By tailoring its parameters to the numerical requirements of the application, it can be made arbitrarily accurate, at an area cost comparable to that of a standard floating-point adder, and at a higher frequency. The second example is the sum-of-product operation, which is the building block of matrix computations. A novel architecture is proposed that feeds the previous accumulator out of a floating-point multiplier whose rounding logic has been removed, again improving the area/accuracy tradeoff. These architectures are implemented within the FloPoCo generator, freely available under the LGPL.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2008 International Conference on Field-Programmable Technology

自引率

0.00%

发文量