Hardware efficient design and implementation of multiplierless FIR filters using Sparse PSO on FPGA and ASIC

IF 2.5 3区工程技术 Q3 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE

Integration-The Vlsi Journal Pub Date : 2025-08-22 DOI:10.1016/j.vlsi.2025.102519

Teena Soni , A. Kumar , Ila Sharma , Manoj Kumar Panda

{"title":"Hardware efficient design and implementation of multiplierless FIR filters using Sparse PSO on FPGA and ASIC","authors":"Teena Soni , A. Kumar , Ila Sharma , Manoj Kumar Panda","doi":"10.1016/j.vlsi.2025.102519","DOIUrl":null,"url":null,"abstract":"<div><div>Efficient design and implementation of the FIR filter is the requirement of most of the DSP subsystems. This article presents a sparse particle swarm optimization (SPSO)-based FIR filters. A comparative analysis of the design of FIR filter is performed using accelerated PSO, quantum PSO, hybrid firefly PSO, and hybrid gravitational search PSO. The proposed design method outperforms existing methods in the literature. This work also proposes two efficient architectures for FIR filter implementation: improved parallel distributed arithmetic with carry-save adder tree and radix-2<em>r</em> arithmetic with carry-save adder tree. The FPGA and ASIC implementation of the proposed architectures are performed. A 45 nm Nangate open cell library is used for ASIC implementation and a Basys 3 FPGA board is used for FPGA implementation. The hardware metrics such as resource utilization, power consumption, and delay are compared with existing architectures. The proposed parallel DA with CSA tree architecture provides 44.03 %, 54.91 %, and 77.33 % average reduction in slice LUTs, slices utilization, and power consumption, respectively, and the proposed radix-2<em>r</em> FIR with CSA tree architecture provides an average reduction of 47.11 % in slice LUTs and 56.37 % slices utilization for FPGA implementation. For ASIC implementation, the average reduction of 71.74 % in area and 67.27 % in area-power product (APP) using the proposed parallel DA with CSA tree architecture is obtained. The proposed radix-2<em>r</em>-based FIR with CSA tree architecture provides an average reduction of 87.24 % in area, 59.74 % in area-delay product ADP, and 90.98 % in APP, compared with architectures in previous works.</div></div>","PeriodicalId":54973,"journal":{"name":"Integration-The Vlsi Journal","volume":"105 ","pages":"Article 102519"},"PeriodicalIF":2.5000,"publicationDate":"2025-08-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Integration-The Vlsi Journal","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0167926025001762","RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"COMPUTER SCIENCE, HARDWARE & ARCHITECTURE","Score":null,"Total":0}

引用次数: 0

Abstract

Efficient design and implementation of the FIR filter is the requirement of most of the DSP subsystems. This article presents a sparse particle swarm optimization (SPSO)-based FIR filters. A comparative analysis of the design of FIR filter is performed using accelerated PSO, quantum PSO, hybrid firefly PSO, and hybrid gravitational search PSO. The proposed design method outperforms existing methods in the literature. This work also proposes two efficient architectures for FIR filter implementation: improved parallel distributed arithmetic with carry-save adder tree and radix-2r arithmetic with carry-save adder tree. The FPGA and ASIC implementation of the proposed architectures are performed. A 45 nm Nangate open cell library is used for ASIC implementation and a Basys 3 FPGA board is used for FPGA implementation. The hardware metrics such as resource utilization, power consumption, and delay are compared with existing architectures. The proposed parallel DA with CSA tree architecture provides 44.03 %, 54.91 %, and 77.33 % average reduction in slice LUTs, slices utilization, and power consumption, respectively, and the proposed radix-2r FIR with CSA tree architecture provides an average reduction of 47.11 % in slice LUTs and 56.37 % slices utilization for FPGA implementation. For ASIC implementation, the average reduction of 71.74 % in area and 67.27 % in area-power product (APP) using the proposed parallel DA with CSA tree architecture is obtained. The proposed radix-2r-based FIR with CSA tree architecture provides an average reduction of 87.24 % in area, 59.74 % in area-delay product ADP, and 90.98 % in APP, compared with architectures in previous works.

查看原文本刊更多论文

基于稀疏PSO的无乘法器FIR滤波器在FPGA和ASIC上的硬件高效设计与实现

FIR滤波器的高效设计和实现是大多数DSP子系统的要求。本文提出了一种基于稀疏粒子群优化（SPSO）的FIR滤波器。对加速粒子群、量子粒子群、混合萤火虫粒子群和混合引力搜索粒子群的FIR滤波器设计进行了比较分析。所提出的设计方法优于现有的文献方法。本文还提出了两种有效的FIR滤波器实现架构：改进的带进位加法器树的并行分布式算法和带进位加法器树的基数-2r算法。对所提出的体系结构进行了FPGA和ASIC实现。采用45nm的Nangate开放单元库实现ASIC，采用Basys 3 FPGA板实现FPGA。将硬件指标（如资源利用率、功耗和延迟）与现有体系结构进行比较。采用CSA树形结构的并行数据处理在片lut、片利用率和功耗方面分别平均降低44.03%、54.91%和77.33%，采用CSA树形结构的radix-2r FIR在FPGA实现方面平均降低47.11%的片lut和56.37%的片利用率。在ASIC实现中，采用CSA树形结构的并行数据处理，面积平均减少71.74%，面积功率产品（APP）平均减少67.27%。与以往的结构相比，本文提出的基于基数2r的CSA树结构FIR的面积平均减少了87.24%，面积延迟产品ADP平均减少了59.74%，APP平均减少了90.98%。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Integration-The Vlsi Journal 工程技术-工程：电子与电气

CiteScore

3.80

自引率

5.30%

发文量

107

审稿时长

6 months

期刊介绍： Integration''s aim is to cover every aspect of the VLSI area, with an emphasis on cross-fertilization between various fields of science, and the design, verification, test and applications of integrated circuits and systems, as well as closely related topics in process and device technologies. Individual issues will feature peer-reviewed tutorials and articles as well as reviews of recent publications. The intended coverage of the journal can be assessed by examining the following (non-exclusive) list of topics: Specification methods and languages; Analog/Digital Integrated Circuits and Systems; VLSI architectures; Algorithms, methods and tools for modeling, simulation, synthesis and verification of integrated circuits and systems of any complexity; Embedded systems; High-level synthesis for VLSI systems; Logic synthesis and finite automata; Testing, design-for-test and test generation algorithms; Physical design; Formal verification; Algorithms implemented in VLSI systems; Systems engineering; Heterogeneous systems.