{"title":"Improved fused floating point add-subtract and multiply-add unit for FFT implementation","authors":"P. Palsodkar, A. Gurjar","doi":"10.1109/ICDCSYST.2014.6926157","DOIUrl":null,"url":null,"abstract":"This paper describes the design and implementation of user defined fused floating-point arithmetic operations that can be used to implement Radix 2 Fast Fourier Transform (FFT) for complex numbers used in Digital Signal Processing (DSP-C) processors. The design is implemented and simulated by targeting Xilinx vertex 5 FPGA device. This paper describes the optimization of fused floating point modules in terms of area, delay, power and energy. Here we have achieved reduction in area (in terms of LUT required) by 27.09%, reduced delay by 7.10%, reduction in power consumption by 11 % and energy is reduced by 26.22% as compared to discrete implementation.","PeriodicalId":252016,"journal":{"name":"2014 2nd International Conference on Devices, Circuits and Systems (ICDCS)","volume":"63 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2014-03-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"7","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2014 2nd International Conference on Devices, Circuits and Systems (ICDCS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICDCSYST.2014.6926157","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 7
Abstract
This paper describes the design and implementation of user defined fused floating-point arithmetic operations that can be used to implement Radix 2 Fast Fourier Transform (FFT) for complex numbers used in Digital Signal Processing (DSP-C) processors. The design is implemented and simulated by targeting Xilinx vertex 5 FPGA device. This paper describes the optimization of fused floating point modules in terms of area, delay, power and energy. Here we have achieved reduction in area (in terms of LUT required) by 27.09%, reduced delay by 7.10%, reduction in power consumption by 11 % and energy is reduced by 26.22% as compared to discrete implementation.