{"title":"NEDA: a new distributed arithmetic architecture and its application to one dimensional discrete cosine transform","authors":"W. Pan, A. Shams, M. Bayoumi","doi":"10.1109/SIPS.1999.822321","DOIUrl":null,"url":null,"abstract":"Conventional Distributed Arithmetic (DA) is popular in ASIC design and it features on-chip ROM to achieve high speed and regularity. In this paper, a new DA architecture called NEDA is proposed aimed at reducing the cost metrics of power and area while maintaining high speed and accuracy in Digital Signal Processing (DSP) applications. Mathematical analysis proves that NEDA can implement inner product of vectors in the form of 2's complement numbers using only additions, followed by a small number of shifts at the final stage. Comparative study shows that NEDA outperforms widely-used approaches such as MAC and DA in many aspects. Being a high speed architecture free of ROM, multiplication and subtraction, NEDA can also expose the redundancy existing in the adder array consisting of entries of 0 and 1. A hardware compression scheme is introduced to generate a butterfly structure with minimum number of additions. NEDA-based architecture for one dimensional DCT core is presented as an example.","PeriodicalId":275030,"journal":{"name":"1999 IEEE Workshop on Signal Processing Systems. SiPS 99. Design and Implementation (Cat. No.99TH8461)","volume":"8 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1999-10-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"31","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"1999 IEEE Workshop on Signal Processing Systems. SiPS 99. Design and Implementation (Cat. No.99TH8461)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SIPS.1999.822321","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 31
Abstract
Conventional Distributed Arithmetic (DA) is popular in ASIC design and it features on-chip ROM to achieve high speed and regularity. In this paper, a new DA architecture called NEDA is proposed aimed at reducing the cost metrics of power and area while maintaining high speed and accuracy in Digital Signal Processing (DSP) applications. Mathematical analysis proves that NEDA can implement inner product of vectors in the form of 2's complement numbers using only additions, followed by a small number of shifts at the final stage. Comparative study shows that NEDA outperforms widely-used approaches such as MAC and DA in many aspects. Being a high speed architecture free of ROM, multiplication and subtraction, NEDA can also expose the redundancy existing in the adder array consisting of entries of 0 and 1. A hardware compression scheme is introduced to generate a butterfly structure with minimum number of additions. NEDA-based architecture for one dimensional DCT core is presented as an example.