FPGA上GF(p)椭圆曲线点乘法的高效并行硬件架构

IF 1.6 3区 工程技术 Q3 ENGINEERING, ELECTRICAL & ELECTRONIC
Ruoyu Wu, Binchao Yu, Zhaofeng Chen, Xiangyu Li, Guanzhong Tian
{"title":"FPGA上GF(p)椭圆曲线点乘法的高效并行硬件架构","authors":"Ruoyu Wu,&nbsp;Binchao Yu,&nbsp;Zhaofeng Chen,&nbsp;Xiangyu Li,&nbsp;Guanzhong Tian","doi":"10.1002/cta.4495","DOIUrl":null,"url":null,"abstract":"<div>\n \n <p>Elliptic curve point multiplication is a critical component in elliptic curve cryptography, serving as a key computational module for fast and secure encryption. To enhance computational performance, parallelism and pipelining are commonly employed techniques in hardware design. However, the stringent computational dependencies in elliptic curve cryptography and the complex combinational logic required for large integer multiplication present significant challenges for area-efficient hardware design, leading to performance bottlenecks. This paper aims to improve system frequency by employing multilevel pipeline partitioning to optimize the critical path. We implement an efficient, low-latency, high-radix modular multiplication unit by combining parallel multiplication with deep pipelining. Building on this high-frequency, high-throughput modular multiplication unit, we propose a high-performance ECPM architecture. Multistage pipelines are inserted into each unit of the ECPM to maintain high-frequency operation across the entire system. Additionally, by analyzing and refining the combined point quadruple-point addition computation flow and data dependencies, we introduce a no-idle parallel modular multiplier architecture, which improves cycle efficiency per computation iteration by 9<i>%</i>. Experimental results on the Virtex-7 FPGA platform demonstrate that, compared with related works, the proposed ECPM architecture achieves a 30<i>%</i> improvement in Area \n<span></span><math>\n <mo>×</mo></math> Time.</p>\n </div>","PeriodicalId":13874,"journal":{"name":"International Journal of Circuit Theory and Applications","volume":"53 10","pages":"5839-5850"},"PeriodicalIF":1.6000,"publicationDate":"2025-03-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Efficient Parallel Hardware Architecture for GF(p) Elliptic Curve Point Multiplication on FPGA\",\"authors\":\"Ruoyu Wu,&nbsp;Binchao Yu,&nbsp;Zhaofeng Chen,&nbsp;Xiangyu Li,&nbsp;Guanzhong Tian\",\"doi\":\"10.1002/cta.4495\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div>\\n \\n <p>Elliptic curve point multiplication is a critical component in elliptic curve cryptography, serving as a key computational module for fast and secure encryption. To enhance computational performance, parallelism and pipelining are commonly employed techniques in hardware design. However, the stringent computational dependencies in elliptic curve cryptography and the complex combinational logic required for large integer multiplication present significant challenges for area-efficient hardware design, leading to performance bottlenecks. This paper aims to improve system frequency by employing multilevel pipeline partitioning to optimize the critical path. We implement an efficient, low-latency, high-radix modular multiplication unit by combining parallel multiplication with deep pipelining. Building on this high-frequency, high-throughput modular multiplication unit, we propose a high-performance ECPM architecture. Multistage pipelines are inserted into each unit of the ECPM to maintain high-frequency operation across the entire system. Additionally, by analyzing and refining the combined point quadruple-point addition computation flow and data dependencies, we introduce a no-idle parallel modular multiplier architecture, which improves cycle efficiency per computation iteration by 9<i>%</i>. Experimental results on the Virtex-7 FPGA platform demonstrate that, compared with related works, the proposed ECPM architecture achieves a 30<i>%</i> improvement in Area \\n<span></span><math>\\n <mo>×</mo></math> Time.</p>\\n </div>\",\"PeriodicalId\":13874,\"journal\":{\"name\":\"International Journal of Circuit Theory and Applications\",\"volume\":\"53 10\",\"pages\":\"5839-5850\"},\"PeriodicalIF\":1.6000,\"publicationDate\":\"2025-03-05\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"International Journal of Circuit Theory and Applications\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://onlinelibrary.wiley.com/doi/10.1002/cta.4495\",\"RegionNum\":3,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"ENGINEERING, ELECTRICAL & ELECTRONIC\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Circuit Theory and Applications","FirstCategoryId":"5","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/cta.4495","RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
引用次数: 0

摘要

椭圆曲线点乘法是椭圆曲线密码学的关键组成部分,是实现快速、安全加密的关键计算模块。为了提高计算性能,并行和流水线是硬件设计中常用的技术。然而,椭圆曲线密码学中严格的计算依赖性和大整数乘法所需的复杂组合逻辑对面积高效的硬件设计提出了重大挑战,导致性能瓶颈。本文旨在通过多级管道划分优化关键路径来提高系统频率。我们将并行乘法与深度流水线相结合,实现了一个高效、低延迟、高基数的模块化乘法单元。基于这种高频、高通量的模块化乘法单元,我们提出了一种高性能ECPM架构。ECPM的每个单元都插入了多级管道,以保持整个系统的高频运行。此外,通过分析和细化组合点四点加法的计算流程和数据依赖关系,我们引入了一种无空闲并行模块化乘法器架构,每次计算迭代的周期效率提高了9%。在Virtex-7 FPGA平台上的实验结果表明,与相关工作相比,所提出的ECPM架构在Area × Time上提高了30%。
本文章由计算机程序翻译,如有差异,请以英文原文为准。

Efficient Parallel Hardware Architecture for GF(p) Elliptic Curve Point Multiplication on FPGA

Efficient Parallel Hardware Architecture for GF(p) Elliptic Curve Point Multiplication on FPGA

Elliptic curve point multiplication is a critical component in elliptic curve cryptography, serving as a key computational module for fast and secure encryption. To enhance computational performance, parallelism and pipelining are commonly employed techniques in hardware design. However, the stringent computational dependencies in elliptic curve cryptography and the complex combinational logic required for large integer multiplication present significant challenges for area-efficient hardware design, leading to performance bottlenecks. This paper aims to improve system frequency by employing multilevel pipeline partitioning to optimize the critical path. We implement an efficient, low-latency, high-radix modular multiplication unit by combining parallel multiplication with deep pipelining. Building on this high-frequency, high-throughput modular multiplication unit, we propose a high-performance ECPM architecture. Multistage pipelines are inserted into each unit of the ECPM to maintain high-frequency operation across the entire system. Additionally, by analyzing and refining the combined point quadruple-point addition computation flow and data dependencies, we introduce a no-idle parallel modular multiplier architecture, which improves cycle efficiency per computation iteration by 9%. Experimental results on the Virtex-7 FPGA platform demonstrate that, compared with related works, the proposed ECPM architecture achieves a 30% improvement in Area × Time.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
International Journal of Circuit Theory and Applications
International Journal of Circuit Theory and Applications 工程技术-工程:电子与电气
CiteScore
3.60
自引率
34.80%
发文量
277
审稿时长
4.5 months
期刊介绍: The scope of the Journal comprises all aspects of the theory and design of analog and digital circuits together with the application of the ideas and techniques of circuit theory in other fields of science and engineering. Examples of the areas covered include: Fundamental Circuit Theory together with its mathematical and computational aspects; Circuit modeling of devices; Synthesis and design of filters and active circuits; Neural networks; Nonlinear and chaotic circuits; Signal processing and VLSI; Distributed, switched and digital circuits; Power electronics; Solid state devices. Contributions to CAD and simulation are welcome.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信