Hardware-efficient approximate multiplier architectures for media processing applications

IF 0.7 4区 工程技术 Q4 ENGINEERING, ELECTRICAL & ELECTRONIC
Anil Kumar Uppugunduru, Syed Ershad Ahmed
{"title":"Hardware-efficient approximate multiplier architectures for media processing applications","authors":"Anil Kumar Uppugunduru, Syed Ershad Ahmed","doi":"10.1108/CW-07-2020-0147","DOIUrl":null,"url":null,"abstract":"\nPurpose\nMultipliers that form the basic building blocks in most of the error-resilient media processing applications are computationally intensive and power-hungry modules. Therefore, improving the multiplier’s performance in terms of area, critical path delay and power has become an important research area. This paper aims to propose two improved multiplier designs based on a new approximate compressor circuit to reduce the hardware complexity at the partial product reduction stage. The proposed approximate 4:2 compressor design significantly reduces the overall hardware cost of the multiplier. The error introduced by the approximate compressor is reduced using a new technique of assigning inputs to the compressors in the partial product reduction structure.\n\n\nDesign/methodology/approach\nThe multiplier designs implemented using the proposed approximate 4:2 compressor are targeted for error-resilient applications. For fair comparisons, various multiplier designs, including the proposed one, are implemented in MATLAB. The quality analysis is carried out using standard images, and metrics such as structural similarity index are computed to quantify the result of proposed designs with the existing architectures. Next, Verilog gate-level designs are synthesized to compute area, delay and power to prove the efficacy of the proposed designs.\n\n\nFindings\nExhaustive error and hardware analysis have been carried out for the existing and proposed multiplier architectures. Error analysis carried out using MATLAB proves that the proposed designs achieve better quality metrics than existing designs. Hardware results show that area, the power consumed and critical path delay are reduced up to 39.8%, 51.7% and 15.9%, respectively, compared to the existing designs. Toward the end, the proposed designs impact is quantified and compared with existing designs on real-time image sharpening and image multiplication applications.\n\n\nOriginality/value\nThe area, delay and power metrics of the multiplier can be improved using an approximate compressor in an error-resilient application. Accordingly, in this work, a new compressor is proposed that reduces the hardware complexity in the multiplier architecture. However, the proposed approximate compressor, while reducing the computational complexity, tends to introduce error in the multiplier. The error introduced by the approximate compressor is reduced using a new technique of assigning inputs to the compressors in the partial product reduction structure. With the help of the approximate compressor and a technique of input realignment, hardware efficient and highly accurate multiplier designs are achieved.\n","PeriodicalId":50693,"journal":{"name":"Circuit World","volume":" ","pages":""},"PeriodicalIF":0.7000,"publicationDate":"2021-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Circuit World","FirstCategoryId":"5","ListUrlMain":"https://doi.org/10.1108/CW-07-2020-0147","RegionNum":4,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
引用次数: 1

Abstract

Purpose Multipliers that form the basic building blocks in most of the error-resilient media processing applications are computationally intensive and power-hungry modules. Therefore, improving the multiplier’s performance in terms of area, critical path delay and power has become an important research area. This paper aims to propose two improved multiplier designs based on a new approximate compressor circuit to reduce the hardware complexity at the partial product reduction stage. The proposed approximate 4:2 compressor design significantly reduces the overall hardware cost of the multiplier. The error introduced by the approximate compressor is reduced using a new technique of assigning inputs to the compressors in the partial product reduction structure. Design/methodology/approach The multiplier designs implemented using the proposed approximate 4:2 compressor are targeted for error-resilient applications. For fair comparisons, various multiplier designs, including the proposed one, are implemented in MATLAB. The quality analysis is carried out using standard images, and metrics such as structural similarity index are computed to quantify the result of proposed designs with the existing architectures. Next, Verilog gate-level designs are synthesized to compute area, delay and power to prove the efficacy of the proposed designs. Findings Exhaustive error and hardware analysis have been carried out for the existing and proposed multiplier architectures. Error analysis carried out using MATLAB proves that the proposed designs achieve better quality metrics than existing designs. Hardware results show that area, the power consumed and critical path delay are reduced up to 39.8%, 51.7% and 15.9%, respectively, compared to the existing designs. Toward the end, the proposed designs impact is quantified and compared with existing designs on real-time image sharpening and image multiplication applications. Originality/value The area, delay and power metrics of the multiplier can be improved using an approximate compressor in an error-resilient application. Accordingly, in this work, a new compressor is proposed that reduces the hardware complexity in the multiplier architecture. However, the proposed approximate compressor, while reducing the computational complexity, tends to introduce error in the multiplier. The error introduced by the approximate compressor is reduced using a new technique of assigning inputs to the compressors in the partial product reduction structure. With the help of the approximate compressor and a technique of input realignment, hardware efficient and highly accurate multiplier designs are achieved.
媒体处理应用的硬件高效近似乘法器架构
在大多数防错误媒体处理应用程序中,构成基本构建块的倍增器是计算密集型和耗电的模块。因此,提高乘法器在面积、关键路径延迟和功率方面的性能成为一个重要的研究领域。本文旨在提出两种基于近似压缩电路的乘法器改进设计,以降低部分积约简阶段的硬件复杂度。所提出的近似4:2压缩器设计显着降低了乘法器的总体硬件成本。在部分积约简结构中,采用分配输入给压缩机的新技术,减小了近似压缩机引入的误差。设计/方法/方法使用提议的近似4:2压缩器实现的乘法器设计针对的是容错应用。为了公平比较,我们在MATLAB中实现了各种乘法器设计,包括本文提出的乘法器。使用标准图像进行质量分析,并计算结构相似指数等度量来量化与现有架构相匹配的建议设计的结果。其次,综合Verilog门级设计,计算面积、延迟和功耗,以证明所提出设计的有效性。对现有的和提出的乘法器结构进行了详尽的误差和硬件分析。利用MATLAB进行的误差分析证明,所提出的设计比现有设计实现了更好的质量指标。硬件结果表明,与现有设计相比,该设计的面积、功耗和关键路径延迟分别降低了39.8%、51.7%和15.9%。最后,将提出的设计对实时图像锐化和图像倍增应用的影响进行量化并与现有设计进行比较。原创性/价值乘法器的面积、延迟和功率指标可以在容错应用中使用近似压缩器来改进。因此,本文提出了一种新的压缩器,以降低乘法器结构中的硬件复杂度。然而,所提出的近似压缩器在降低计算复杂度的同时,容易在乘法器中引入误差。在部分积约简结构中,采用分配输入给压缩机的新技术,减小了近似压缩机引入的误差。利用近似压缩器和输入调整技术,实现了高硬件效率和高精度的乘法器设计。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Circuit World
Circuit World 工程技术-材料科学:综合
CiteScore
2.60
自引率
0.00%
发文量
33
审稿时长
>12 weeks
期刊介绍: Circuit World is a platform for state of the art, technical papers and editorials in the areas of electronics circuit, component, assembly, and product design, manufacture, test, and use, including quality, reliability and safety. The journal comprises the multidisciplinary study of the various theories, methodologies, technologies, processes and applications relating to todays and future electronics. Circuit World provides a comprehensive and authoritative information source for research, application and current awareness purposes. Circuit World covers a broad range of topics, including: • Circuit theory, design methodology, analysis and simulation • Digital, analog, microwave and optoelectronic integrated circuits • Semiconductors, passives, connectors and sensors • Electronic packaging of components, assemblies and products • PCB design technologies and processes (controlled impedance, high-speed PCBs, laminates and lamination, laser processes and drilling, moulded interconnect devices, multilayer boards, optical PCBs, single- and double-sided boards, soldering and solderable finishes) • Design for X (including manufacturability, quality, reliability, maintainability, sustainment, safety, reuse, disposal) • Internet of Things (IoT).
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信