Speed, Power and Area Optimized Monotonic Asynchronous Array Multipliers

IF 1.8 Q3 ENGINEERING, ELECTRICAL & ELECTRONIC

Journal of Low Power Electronics and Applications Pub Date : 2023-12-24 DOI:10.3390/jlpea14010001

Padmanabhan Balasubramanian, Nikos Mastorakis

{"title":"Speed, Power and Area Optimized Monotonic Asynchronous Array Multipliers","authors":"Padmanabhan Balasubramanian, Nikos Mastorakis","doi":"10.3390/jlpea14010001","DOIUrl":null,"url":null,"abstract":"Multiplication is a fundamental arithmetic operation in electronic processing units such as microprocessors and digital signal processors as it plays an important role in various computational tasks and applications. There exist many designs of synchronous multipliers in the literature. However, in the domain of Input–Output Mode (IOM) asynchronous design, there is relatively less published research on multipliers. Some existing works have considered quasi-delay-insensitive (QDI) asynchronous implementations of multipliers. However, the QDI asynchronous design paradigm, in general, is not area- and speed-efficient. This article presents an efficient alternative implementation of IOM asynchronous multipliers based on the concept of monotonic Boolean networks. The array multiplier architecture has been considered for demonstrating the usefulness of our proposition. The building blocks of the multiplier, such as the partial product generator, half adder, and full adder, were implemented monotonically. The popular dual-rail encoding scheme was considered for encoding the multiplier inputs and outputs, and four-phase return-to-zero handshaking (RZH) and return-to-one handshaking (ROH) were separately considered for communication. Compared to the best of the existing QDI asynchronous array multipliers, the proposed monotonic asynchronous array multiplier achieves the following reductions in design metrics: (i) a 40.1% (44.3%) reduction in cycle time (which is the asynchronous equivalent of synchronous clock timing), a 37.7% (37.7%) reduction in area, and a 4% (4.5%) reduction in power for 4 × 4 multiplication corresponding to RZH (ROH), and (ii) a 58.1% (60.2%) reduction in cycle time, a 45.2% (45.2%) reduction in area, and a 10.3% (11%) reduction in power for 8 × 8 multiplication corresponding to RZH (ROH). The multipliers were implemented using a 28 nm CMOS process technology.","PeriodicalId":38100,"journal":{"name":"Journal of Low Power Electronics and Applications","volume":"36 1","pages":""},"PeriodicalIF":1.8000,"publicationDate":"2023-12-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Low Power Electronics and Applications","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3390/jlpea14010001","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}

引用次数: 0

Abstract

Multiplication is a fundamental arithmetic operation in electronic processing units such as microprocessors and digital signal processors as it plays an important role in various computational tasks and applications. There exist many designs of synchronous multipliers in the literature. However, in the domain of Input–Output Mode (IOM) asynchronous design, there is relatively less published research on multipliers. Some existing works have considered quasi-delay-insensitive (QDI) asynchronous implementations of multipliers. However, the QDI asynchronous design paradigm, in general, is not area- and speed-efficient. This article presents an efficient alternative implementation of IOM asynchronous multipliers based on the concept of monotonic Boolean networks. The array multiplier architecture has been considered for demonstrating the usefulness of our proposition. The building blocks of the multiplier, such as the partial product generator, half adder, and full adder, were implemented monotonically. The popular dual-rail encoding scheme was considered for encoding the multiplier inputs and outputs, and four-phase return-to-zero handshaking (RZH) and return-to-one handshaking (ROH) were separately considered for communication. Compared to the best of the existing QDI asynchronous array multipliers, the proposed monotonic asynchronous array multiplier achieves the following reductions in design metrics: (i) a 40.1% (44.3%) reduction in cycle time (which is the asynchronous equivalent of synchronous clock timing), a 37.7% (37.7%) reduction in area, and a 4% (4.5%) reduction in power for 4 × 4 multiplication corresponding to RZH (ROH), and (ii) a 58.1% (60.2%) reduction in cycle time, a 45.2% (45.2%) reduction in area, and a 10.3% (11%) reduction in power for 8 × 8 multiplication corresponding to RZH (ROH). The multipliers were implemented using a 28 nm CMOS process technology.

查看原文本刊更多论文

速度、功率和面积优化的单调异步阵列乘法器

乘法是微处理器和数字信号处理器等电子处理单元的基本算术运算，在各种计算任务和应用中发挥着重要作用。文献中有许多同步乘法器的设计。然而，在输入输出模式（IOM）异步设计领域，有关乘法器的研究成果相对较少。现有的一些著作考虑了乘法器的准延迟不敏感（QDI）异步实现。然而，QDI 异步设计范例一般不具有面积和速度效率。本文基于单调布尔网络的概念，提出了一种高效的 IOM 异步乘法器替代实现方法。我们考虑了阵列乘法器架构，以证明我们的主张是有用的。乘法器的构件，如部分乘积发生器、半加法器和全加法器，都是单调实现的。乘法器输入和输出的编码采用了流行的双轨编码方案，通信方面则分别采用了四相归零握手（RZH）和归一握手（ROH）。与现有最佳 QDI 异步阵列乘法器相比，所提出的单调异步阵列乘法器实现了以下设计指标的降低：(i) 周期时间（相当于同步时钟时序的异步时间）降低了 40.1%（44.3%）；(ii) 单调异步阵列乘法器的输入输出时间降低了 37.7%（37.7%）。7% (37.7%)，RZH (ROH) 对应的 4 × 4 乘法的功耗降低了 4% (4.5%)；(ii) RZH (ROH) 对应的 8 × 8 乘法的周期时间降低了 58.1% (60.2%)，面积降低了 45.2% (45.2%)，功耗降低了 10.3% (11%)。乘法器采用 28 纳米 CMOS 工艺技术实现。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊