{"title":"LTE:轻量级、定时高效的不等规模多项式乘法加速器","authors":"Yazheng Tu;Tianyou Bao;Pengzhou He;Leonel Sousa;Jiafeng Xie","doi":"10.1109/TCSII.2024.3458871","DOIUrl":null,"url":null,"abstract":"Integer polynomial multiplication has been frequently used in post-quantum cryptography and fully homomorphic encryption systems. Particularly, there exists a special polynomial multiplication, where the polynomial degree can be a power of two and the coefficients of the two input polynomials are unequal-sized (difficult to deploy fast algorithm for implementation efficiency). Following this direction, in this brief, we propose a novel hardware-implemented Lightweight and Timing-Efficient (LTE) integer polynomial multiplication design framework. We proposed two new algorithms for efficient implementation of the targeted polynomial multiplication. Accordingly, we presented two hardware accelerators with the help of several new hardware design techniques. The final implementation showcases the proposed accelerators’ superior performance, e.g., the proposed Accelerator-I \n<inline-formula> <tex-math>$(v=512)$ </tex-math></inline-formula>\n has 44.7% less equivalent area-delay product (EADP) than the state-of-the-art design for \n<inline-formula> <tex-math>$n=4,096$ </tex-math></inline-formula>\n, while the proposed Accelerator-II has at least 38.7% less ADP than the competing designs for \n<inline-formula> <tex-math>$n=1,024$ </tex-math></inline-formula>\n. The proposed strategy is highly efficient and can be extended for other usage.","PeriodicalId":13101,"journal":{"name":"IEEE Transactions on Circuits and Systems II: Express Briefs","volume":"72 1","pages":"253-257"},"PeriodicalIF":4.0000,"publicationDate":"2024-09-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"LTE: Lightweight and Timing-Efficient Unequal-Sized Polynomial Multiplication Accelerators\",\"authors\":\"Yazheng Tu;Tianyou Bao;Pengzhou He;Leonel Sousa;Jiafeng Xie\",\"doi\":\"10.1109/TCSII.2024.3458871\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Integer polynomial multiplication has been frequently used in post-quantum cryptography and fully homomorphic encryption systems. Particularly, there exists a special polynomial multiplication, where the polynomial degree can be a power of two and the coefficients of the two input polynomials are unequal-sized (difficult to deploy fast algorithm for implementation efficiency). Following this direction, in this brief, we propose a novel hardware-implemented Lightweight and Timing-Efficient (LTE) integer polynomial multiplication design framework. We proposed two new algorithms for efficient implementation of the targeted polynomial multiplication. Accordingly, we presented two hardware accelerators with the help of several new hardware design techniques. The final implementation showcases the proposed accelerators’ superior performance, e.g., the proposed Accelerator-I \\n<inline-formula> <tex-math>$(v=512)$ </tex-math></inline-formula>\\n has 44.7% less equivalent area-delay product (EADP) than the state-of-the-art design for \\n<inline-formula> <tex-math>$n=4,096$ </tex-math></inline-formula>\\n, while the proposed Accelerator-II has at least 38.7% less ADP than the competing designs for \\n<inline-formula> <tex-math>$n=1,024$ </tex-math></inline-formula>\\n. The proposed strategy is highly efficient and can be extended for other usage.\",\"PeriodicalId\":13101,\"journal\":{\"name\":\"IEEE Transactions on Circuits and Systems II: Express Briefs\",\"volume\":\"72 1\",\"pages\":\"253-257\"},\"PeriodicalIF\":4.0000,\"publicationDate\":\"2024-09-11\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE Transactions on Circuits and Systems II: Express Briefs\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/10678754/\",\"RegionNum\":2,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"ENGINEERING, ELECTRICAL & ELECTRONIC\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Circuits and Systems II: Express Briefs","FirstCategoryId":"5","ListUrlMain":"https://ieeexplore.ieee.org/document/10678754/","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
引用次数: 0
摘要
整数多项式乘法在后量子密码和全同态加密系统中得到了广泛的应用。特别是,存在一种特殊的多项式乘法,其中多项式次数可以是2的幂,并且两个输入多项式的系数大小不等(为了实现效率难以部署快速算法)。遵循这个方向,在本文中,我们提出了一种新的硬件实现的轻量级和定时效率(LTE)整数多项式乘法设计框架。我们提出了两种新的算法来有效地实现目标多项式乘法。因此,我们在几种新的硬件设计技术的帮助下,提出了两种硬件加速器。最终的实现展示了所提议的加速器的优越性能,例如,所提议的加速器- i $(v=512)$的等效面积延迟产品(EADP)比最先进的设计(n=4,096$)少44.7%,而所提议的加速器- ii的ADP比竞争设计(n=1,024$)至少少38.7%。该策略效率高,可扩展到其他用途。
LTE: Lightweight and Timing-Efficient Unequal-Sized Polynomial Multiplication Accelerators
Integer polynomial multiplication has been frequently used in post-quantum cryptography and fully homomorphic encryption systems. Particularly, there exists a special polynomial multiplication, where the polynomial degree can be a power of two and the coefficients of the two input polynomials are unequal-sized (difficult to deploy fast algorithm for implementation efficiency). Following this direction, in this brief, we propose a novel hardware-implemented Lightweight and Timing-Efficient (LTE) integer polynomial multiplication design framework. We proposed two new algorithms for efficient implementation of the targeted polynomial multiplication. Accordingly, we presented two hardware accelerators with the help of several new hardware design techniques. The final implementation showcases the proposed accelerators’ superior performance, e.g., the proposed Accelerator-I
$(v=512)$
has 44.7% less equivalent area-delay product (EADP) than the state-of-the-art design for
$n=4,096$
, while the proposed Accelerator-II has at least 38.7% less ADP than the competing designs for
$n=1,024$
. The proposed strategy is highly efficient and can be extended for other usage.
期刊介绍:
TCAS II publishes brief papers in the field specified by the theory, analysis, design, and practical implementations of circuits, and the application of circuit techniques to systems and to signal processing. Included is the whole spectrum from basic scientific theory to industrial applications. The field of interest covered includes:
Circuits: Analog, Digital and Mixed Signal Circuits and Systems
Nonlinear Circuits and Systems, Integrated Sensors, MEMS and Systems on Chip, Nanoscale Circuits and Systems, Optoelectronic
Circuits and Systems, Power Electronics and Systems
Software for Analog-and-Logic Circuits and Systems
Control aspects of Circuits and Systems.