单精度浮点硬件设计的改进近似乘法器

2022 IEEE 13th Latin America Symposium on Circuits and System (LASCAS) Pub Date : 2022-03-01 DOI:10.1109/LASCAS53948.2022.9789077

Patrícia U. L. da Costa, P. Pereira, B. Abreu, Guilherme Paim, E. Costa, S. Bampi

{"title":"单精度浮点硬件设计的改进近似乘法器","authors":"Patrícia U. L. da Costa, P. Pereira, B. Abreu, Guilherme Paim, E. Costa, S. Bampi","doi":"10.1109/LASCAS53948.2022.9789077","DOIUrl":null,"url":null,"abstract":"This paper reveals that state-of-the-art integer approximate multipliers (AxMs) present dispensable blocks when specifically embedded within a floating-point (FP) architecture. This paper proposes and implements arithmetic simplifications that significantly improve four state-of-the-art AxMs for FP. The results for 32-bit FP (FP-32) show that our improved 24-bit integer AxMs (i.e., specific for FP) reduce area from about 4.2x up to 12.9x in four different AxMs when compared with the original 24-bit AxM generic integer multiplier. We also perform an AxC design space exploration (DSE) of FP-32 Least Mean Squares Adaptive Filters (LMS-AF) architectures employing the four improved AxM proposals. We present quality-energy and -area DSE trade-offs in an approximate FP-32 LMS-AF kernel, in terms of Pareto fronts, showing that we can still maintain a fully functional harmonics elimination. Pareto front total energy reduction ranges from 43.4 % (1.27x) to 70.3% (3.37x) w.r.t. the precise multiplier.","PeriodicalId":356481,"journal":{"name":"2022 IEEE 13th Latin America Symposium on Circuits and System (LASCAS)","volume":"47 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":"{\"title\":\"Improved Approximate Multipliers for Single-Precision Floating-Point Hardware Design\",\"authors\":\"Patrícia U. L. da Costa, P. Pereira, B. Abreu, Guilherme Paim, E. Costa, S. Bampi\",\"doi\":\"10.1109/LASCAS53948.2022.9789077\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper reveals that state-of-the-art integer approximate multipliers (AxMs) present dispensable blocks when specifically embedded within a floating-point (FP) architecture. This paper proposes and implements arithmetic simplifications that significantly improve four state-of-the-art AxMs for FP. The results for 32-bit FP (FP-32) show that our improved 24-bit integer AxMs (i.e., specific for FP) reduce area from about 4.2x up to 12.9x in four different AxMs when compared with the original 24-bit AxM generic integer multiplier. We also perform an AxC design space exploration (DSE) of FP-32 Least Mean Squares Adaptive Filters (LMS-AF) architectures employing the four improved AxM proposals. We present quality-energy and -area DSE trade-offs in an approximate FP-32 LMS-AF kernel, in terms of Pareto fronts, showing that we can still maintain a fully functional harmonics elimination. Pareto front total energy reduction ranges from 43.4 % (1.27x) to 70.3% (3.37x) w.r.t. the precise multiplier.\",\"PeriodicalId\":356481,\"journal\":{\"name\":\"2022 IEEE 13th Latin America Symposium on Circuits and System (LASCAS)\",\"volume\":\"47 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-03-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"4\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 IEEE 13th Latin America Symposium on Circuits and System (LASCAS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/LASCAS53948.2022.9789077\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE 13th Latin America Symposium on Circuits and System (LASCAS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/LASCAS53948.2022.9789077","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 4

摘要

本文揭示了最先进的整数近似乘法器(axm)在具体嵌入浮点(FP)架构时存在可缺性块。本文提出并实现了算法简化，显著改进了FP的四个最先进的axm。32位FP (FP-32)的结果表明，与原始的24位AxM通用整数乘法器相比，我们改进的24位整数AxM(即针对FP的)在四个不同的AxM中将面积从4.2倍减少到12.9倍。我们还使用四种改进的AxM方案对FP-32最小均方自适应滤波器(LMS-AF)架构进行了AxC设计空间探索(DSE)。在帕累托前沿方面，我们在近似FP-32 LMS-AF内核中提出了质量-能量和面积DSE权衡，表明我们仍然可以保持全功能的谐波消除。帕累托前总能量减少范围从43.4%(1.27倍)到70.3%(3.37倍)。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Improved Approximate Multipliers for Single-Precision Floating-Point Hardware Design

This paper reveals that state-of-the-art integer approximate multipliers (AxMs) present dispensable blocks when specifically embedded within a floating-point (FP) architecture. This paper proposes and implements arithmetic simplifications that significantly improve four state-of-the-art AxMs for FP. The results for 32-bit FP (FP-32) show that our improved 24-bit integer AxMs (i.e., specific for FP) reduce area from about 4.2x up to 12.9x in four different AxMs when compared with the original 24-bit AxM generic integer multiplier. We also perform an AxC design space exploration (DSE) of FP-32 Least Mean Squares Adaptive Filters (LMS-AF) architectures employing the four improved AxM proposals. We present quality-energy and -area DSE trade-offs in an approximate FP-32 LMS-AF kernel, in terms of Pareto fronts, showing that we can still maintain a fully functional harmonics elimination. Pareto front total energy reduction ranges from 43.4 % (1.27x) to 70.3% (3.37x) w.r.t. the precise multiplier.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2022 IEEE 13th Latin America Symposium on Circuits and System (LASCAS)

自引率

0.00%

发文量