Design of a Quadruple Precision Floating-Point Fused Multiply-Add Unit Based on 4-Way SIMD Device

Jun He, Biao Wang, Ying Zhu
{"title":"Design of a Quadruple Precision Floating-Point Fused Multiply-Add Unit Based on 4-Way SIMD Device","authors":"Jun He, Biao Wang, Ying Zhu","doi":"10.1109/NAS.2013.42","DOIUrl":null,"url":null,"abstract":"Aiming to decrease the hardware cost of floating-point quadruple precision fused multiply-add (QPFMA) unit, a new QPFMA unit is designed and realized based on a 4-way SIMD device, which supports 64-bit×4 floating-point double precision fused multiply-add (DPFMA). The new QPFMA supports four kinds of FMA operations, multiplication, addition, subtraction and comparison, with the operation latency of 7 cycles. By decomposing the 113-bit×113-bit multiplication of quadruple precision fractions into four 57-bit×57-bit multiplications to share the 53-bit×53-bit multipliers of the 4-way SIMD DPFMA, the hardware cost of the new QPFMA is reduced sharply. After the new QPFMA is synthesized in 65nm cell library, the results show that it has significant advantages both in area and latency, with frequency at 1.1GHz, area 42.71% of a classic QPFMA unit, operation latency decreased by 3 cycles.","PeriodicalId":213334,"journal":{"name":"2013 IEEE Eighth International Conference on Networking, Architecture and Storage","volume":"47 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2013-07-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2013 IEEE Eighth International Conference on Networking, Architecture and Storage","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/NAS.2013.42","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Aiming to decrease the hardware cost of floating-point quadruple precision fused multiply-add (QPFMA) unit, a new QPFMA unit is designed and realized based on a 4-way SIMD device, which supports 64-bit×4 floating-point double precision fused multiply-add (DPFMA). The new QPFMA supports four kinds of FMA operations, multiplication, addition, subtraction and comparison, with the operation latency of 7 cycles. By decomposing the 113-bit×113-bit multiplication of quadruple precision fractions into four 57-bit×57-bit multiplications to share the 53-bit×53-bit multipliers of the 4-way SIMD DPFMA, the hardware cost of the new QPFMA is reduced sharply. After the new QPFMA is synthesized in 65nm cell library, the results show that it has significant advantages both in area and latency, with frequency at 1.1GHz, area 42.71% of a classic QPFMA unit, operation latency decreased by 3 cycles.
基于4路SIMD器件的四精度浮点融合乘加单元设计
为了降低浮点四精度融合乘加(QPFMA)单元的硬件成本,基于4路SIMD器件设计并实现了一种支持64-bit×4浮点双精度融合乘加(DPFMA)的新型QPFMA单元。新的QPFMA支持乘法、加法、减法和比较四种FMA运算,运算延迟为7个周期。通过将四倍精度分数的113-bit×113-bit乘法分解为4个57-bit×57-bit乘法来共享四路SIMD DPFMA的53-bit×53-bit乘法,大大降低了新型QPFMA的硬件成本。在65nm细胞库中合成了新的QPFMA后,结果表明,该QPFMA在面积和延迟方面都具有显著优势,频率为1.1GHz,面积为经典QPFMA单元的42.71%,工作延迟降低了3个周期。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信