Effect of Loop-Unrolling in Hardware Reconfigurable Implementations of RC5-192 Encryption Algorithm

O. Elkeelany, S. Nimmagadda
{"title":"Effect of Loop-Unrolling in Hardware Reconfigurable Implementations of RC5-192 Encryption Algorithm","authors":"O. Elkeelany, S. Nimmagadda","doi":"10.1109/TPSD.2008.4562753","DOIUrl":null,"url":null,"abstract":"The objective of this paper is to determine the effects of 'loop-unrolling' design concept, on the performance of hardware based implementations of the RC5 encryption algorithm. An effort has been done to determine the best value of the number of unrolled loops to implement the RC5 algorithm using 192-bit encryption key. The various models tested were based on single-custom processor with: no-loop-unrolling; loop unrolled implementations with different unrolling sizes. In this research, various performance measures are considered, such as the maximum frequency of operation, circuit size, throughput, and energy consumption. To achieve proper comparison results, all models were developed and mapped to the same hardware reconfigurable chip, a Field Programmable Gate Array (FPGA). The performance parameters of each model were evaluated to determine the best hardware model. Verilog hardware description language was used to model and test all implementations. While no-loop-unrolling provided the least circuit size, the 3-loop-unrolled approach provided the highest throughput, amongst all tested implementations. A throughput speed up of 24% was achieved compared to a reference system implemented in a different target device using a Xilinx FPGA family. Comparing our implementations on the same Altera FPGA family, a maximum throughput speed up of 50% was achieved.","PeriodicalId":410786,"journal":{"name":"2008 IEEE Region 5 Conference","volume":"27 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2008-04-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2008 IEEE Region 5 Conference","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/TPSD.2008.4562753","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 4

Abstract

The objective of this paper is to determine the effects of 'loop-unrolling' design concept, on the performance of hardware based implementations of the RC5 encryption algorithm. An effort has been done to determine the best value of the number of unrolled loops to implement the RC5 algorithm using 192-bit encryption key. The various models tested were based on single-custom processor with: no-loop-unrolling; loop unrolled implementations with different unrolling sizes. In this research, various performance measures are considered, such as the maximum frequency of operation, circuit size, throughput, and energy consumption. To achieve proper comparison results, all models were developed and mapped to the same hardware reconfigurable chip, a Field Programmable Gate Array (FPGA). The performance parameters of each model were evaluated to determine the best hardware model. Verilog hardware description language was used to model and test all implementations. While no-loop-unrolling provided the least circuit size, the 3-loop-unrolled approach provided the highest throughput, amongst all tested implementations. A throughput speed up of 24% was achieved compared to a reference system implemented in a different target device using a Xilinx FPGA family. Comparing our implementations on the same Altera FPGA family, a maximum throughput speed up of 50% was achieved.
环展开在RC5-192加密算法硬件可重构实现中的作用
本文的目的是确定“循环展开”设计概念对基于硬件实现的RC5加密算法的性能的影响。为了使用192位加密密钥实现RC5算法,已经努力确定展开循环数量的最佳值。测试的各种模型基于单一定制处理器,具有:无循环展开;具有不同展开大小的循环展开实现。在这项研究中,考虑了各种性能指标,如最大操作频率,电路尺寸,吞吐量和能耗。为了获得适当的比较结果,所有模型都被开发并映射到相同的硬件可重构芯片,即现场可编程门阵列(FPGA)。对每种型号的性能参数进行了评估,以确定最佳硬件型号。使用Verilog硬件描述语言对所有实现进行建模和测试。在所有测试的实现中,无环路展开提供了最小的电路尺寸,而3环路展开方法提供了最高的吞吐量。与使用Xilinx FPGA系列在不同目标设备上实现的参考系统相比,吞吐量速度提高了24%。比较我们在同一Altera FPGA系列上的实现,最大吞吐量速度提高了50%。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信