How orthogonal are we? A note on fast and accurate inner product computation in the floating-point arithmetic

2019 First International Conference on Societal Automation (SA) Pub Date : 2019-09-01 DOI:10.1109/sa47457.2019.8938060

B. Cyganek, K. Wiatr

{"title":"How orthogonal are we? A note on fast and accurate inner product computation in the floating-point arithmetic","authors":"B. Cyganek, K. Wiatr","doi":"10.1109/sa47457.2019.8938060","DOIUrl":null,"url":null,"abstract":"The multiply-and-accumulate (MAC) belongs to the most fundamental operations in digital signal processing. It constitute the key operation for digital filters as well as object classification, to name a few. Recently we faced implementation issues related to tensor decomposition into orthogonal factors, as well as tensor projections onto the orthogonal tensor bases. Also these rely heavily on the MAC operations. However, a serious problem of the numerical accuracy vs. operation speed of these operations can be observed when implemented with the floating point arithmetic realized in a hardware or a software platform. If not carefully approached, this can lead to significant numerical errors which are frequently overlooked by inexperienced engineers critically relying on standard libraries. In this paper we remind, dust off and discuss some computational aspects of the floating point implementation of the MAC operations in big data signal processing tasks on various platforms, and in different operation modes (serial, parallel, software, hardware). Such algorithms as simple summation, the Kahan algorithm, as well as some hybrid solutions are analyzed in respect to their accuracy, simplicity in implementation, and resource consumption, as well as speed of execution.","PeriodicalId":383922,"journal":{"name":"2019 First International Conference on Societal Automation (SA)","volume":"97 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 First International Conference on Societal Automation (SA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/sa47457.2019.8938060","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

The multiply-and-accumulate (MAC) belongs to the most fundamental operations in digital signal processing. It constitute the key operation for digital filters as well as object classification, to name a few. Recently we faced implementation issues related to tensor decomposition into orthogonal factors, as well as tensor projections onto the orthogonal tensor bases. Also these rely heavily on the MAC operations. However, a serious problem of the numerical accuracy vs. operation speed of these operations can be observed when implemented with the floating point arithmetic realized in a hardware or a software platform. If not carefully approached, this can lead to significant numerical errors which are frequently overlooked by inexperienced engineers critically relying on standard libraries. In this paper we remind, dust off and discuss some computational aspects of the floating point implementation of the MAC operations in big data signal processing tasks on various platforms, and in different operation modes (serial, parallel, software, hardware). Such algorithms as simple summation, the Kahan algorithm, as well as some hybrid solutions are analyzed in respect to their accuracy, simplicity in implementation, and resource consumption, as well as speed of execution.

查看原文本刊更多论文

我们有多正交?关于浮点运算中快速准确内积计算的注意事项

乘累加运算(MAC)是数字信号处理中最基本的运算。它构成了数字滤波器以及目标分类的关键操作，仅举几例。最近我们面临着与张量分解成正交因子相关的实现问题，以及张量在正交张量基上的投影。而且这些都严重依赖于MAC操作。然而，当在硬件或软件平台上实现浮点运算时，可以观察到这些运算的数值精度与运算速度的严重问题。如果不小心处理，这可能会导致严重的数值错误，而这些错误经常被缺乏经验的工程师所忽视，这些工程师严重依赖标准库。在本文中，我们回顾、总结并讨论了在不同平台、不同操作模式(串行、并行、软件、硬件)下的大数据信号处理任务中MAC操作的浮点实现的一些计算方面。分析了简单求和算法、Kahan算法以及一些混合算法的准确性、实现的简单性、资源消耗和执行速度。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2019 First International Conference on Societal Automation (SA)

自引率

0.00%

发文量