Teena Soni , A. Kumar , Ila Sharma , Manoj Kumar Panda
{"title":"Hardware efficient design and implementation of multiplierless FIR filters using Sparse PSO on FPGA and ASIC","authors":"Teena Soni , A. Kumar , Ila Sharma , Manoj Kumar Panda","doi":"10.1016/j.vlsi.2025.102519","DOIUrl":null,"url":null,"abstract":"<div><div>Efficient design and implementation of the FIR filter is the requirement of most of the DSP subsystems. This article presents a sparse particle swarm optimization (SPSO)-based FIR filters. A comparative analysis of the design of FIR filter is performed using accelerated PSO, quantum PSO, hybrid firefly PSO, and hybrid gravitational search PSO. The proposed design method outperforms existing methods in the literature. This work also proposes two efficient architectures for FIR filter implementation: improved parallel distributed arithmetic with carry-save adder tree and radix-2<em>r</em> arithmetic with carry-save adder tree. The FPGA and ASIC implementation of the proposed architectures are performed. A 45 nm Nangate open cell library is used for ASIC implementation and a Basys 3 FPGA board is used for FPGA implementation. The hardware metrics such as resource utilization, power consumption, and delay are compared with existing architectures. The proposed parallel DA with CSA tree architecture provides 44.03 %, 54.91 %, and 77.33 % average reduction in slice LUTs, slices utilization, and power consumption, respectively, and the proposed radix-2<em>r</em> FIR with CSA tree architecture provides an average reduction of 47.11 % in slice LUTs and 56.37 % slices utilization for FPGA implementation. For ASIC implementation, the average reduction of 71.74 % in area and 67.27 % in area-power product (APP) using the proposed parallel DA with CSA tree architecture is obtained. The proposed radix-2<em>r</em>-based FIR with CSA tree architecture provides an average reduction of 87.24 % in area, 59.74 % in area-delay product ADP, and 90.98 % in APP, compared with architectures in previous works.</div></div>","PeriodicalId":54973,"journal":{"name":"Integration-The Vlsi Journal","volume":"105 ","pages":"Article 102519"},"PeriodicalIF":2.5000,"publicationDate":"2025-08-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Integration-The Vlsi Journal","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0167926025001762","RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"COMPUTER SCIENCE, HARDWARE & ARCHITECTURE","Score":null,"Total":0}
引用次数: 0
Abstract
Efficient design and implementation of the FIR filter is the requirement of most of the DSP subsystems. This article presents a sparse particle swarm optimization (SPSO)-based FIR filters. A comparative analysis of the design of FIR filter is performed using accelerated PSO, quantum PSO, hybrid firefly PSO, and hybrid gravitational search PSO. The proposed design method outperforms existing methods in the literature. This work also proposes two efficient architectures for FIR filter implementation: improved parallel distributed arithmetic with carry-save adder tree and radix-2r arithmetic with carry-save adder tree. The FPGA and ASIC implementation of the proposed architectures are performed. A 45 nm Nangate open cell library is used for ASIC implementation and a Basys 3 FPGA board is used for FPGA implementation. The hardware metrics such as resource utilization, power consumption, and delay are compared with existing architectures. The proposed parallel DA with CSA tree architecture provides 44.03 %, 54.91 %, and 77.33 % average reduction in slice LUTs, slices utilization, and power consumption, respectively, and the proposed radix-2r FIR with CSA tree architecture provides an average reduction of 47.11 % in slice LUTs and 56.37 % slices utilization for FPGA implementation. For ASIC implementation, the average reduction of 71.74 % in area and 67.27 % in area-power product (APP) using the proposed parallel DA with CSA tree architecture is obtained. The proposed radix-2r-based FIR with CSA tree architecture provides an average reduction of 87.24 % in area, 59.74 % in area-delay product ADP, and 90.98 % in APP, compared with architectures in previous works.
期刊介绍:
Integration''s aim is to cover every aspect of the VLSI area, with an emphasis on cross-fertilization between various fields of science, and the design, verification, test and applications of integrated circuits and systems, as well as closely related topics in process and device technologies. Individual issues will feature peer-reviewed tutorials and articles as well as reviews of recent publications. The intended coverage of the journal can be assessed by examining the following (non-exclusive) list of topics:
Specification methods and languages; Analog/Digital Integrated Circuits and Systems; VLSI architectures; Algorithms, methods and tools for modeling, simulation, synthesis and verification of integrated circuits and systems of any complexity; Embedded systems; High-level synthesis for VLSI systems; Logic synthesis and finite automata; Testing, design-for-test and test generation algorithms; Physical design; Formal verification; Algorithms implemented in VLSI systems; Systems engineering; Heterogeneous systems.