Effect of Loop-Unrolling in Hardware Reconfigurable Implementations of RC5-192 Encryption Algorithm

2008 IEEE Region 5 Conference Pub Date : 2008-04-17 DOI:10.1109/TPSD.2008.4562753

O. Elkeelany, S. Nimmagadda

{"title":"Effect of Loop-Unrolling in Hardware Reconfigurable Implementations of RC5-192 Encryption Algorithm","authors":"O. Elkeelany, S. Nimmagadda","doi":"10.1109/TPSD.2008.4562753","DOIUrl":null,"url":null,"abstract":"The objective of this paper is to determine the effects of 'loop-unrolling' design concept, on the performance of hardware based implementations of the RC5 encryption algorithm. An effort has been done to determine the best value of the number of unrolled loops to implement the RC5 algorithm using 192-bit encryption key. The various models tested were based on single-custom processor with: no-loop-unrolling; loop unrolled implementations with different unrolling sizes. In this research, various performance measures are considered, such as the maximum frequency of operation, circuit size, throughput, and energy consumption. To achieve proper comparison results, all models were developed and mapped to the same hardware reconfigurable chip, a Field Programmable Gate Array (FPGA). The performance parameters of each model were evaluated to determine the best hardware model. Verilog hardware description language was used to model and test all implementations. While no-loop-unrolling provided the least circuit size, the 3-loop-unrolled approach provided the highest throughput, amongst all tested implementations. A throughput speed up of 24% was achieved compared to a reference system implemented in a different target device using a Xilinx FPGA family. Comparing our implementations on the same Altera FPGA family, a maximum throughput speed up of 50% was achieved.","PeriodicalId":410786,"journal":{"name":"2008 IEEE Region 5 Conference","volume":"27 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2008-04-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2008 IEEE Region 5 Conference","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/TPSD.2008.4562753","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 4

Abstract

The objective of this paper is to determine the effects of 'loop-unrolling' design concept, on the performance of hardware based implementations of the RC5 encryption algorithm. An effort has been done to determine the best value of the number of unrolled loops to implement the RC5 algorithm using 192-bit encryption key. The various models tested were based on single-custom processor with: no-loop-unrolling; loop unrolled implementations with different unrolling sizes. In this research, various performance measures are considered, such as the maximum frequency of operation, circuit size, throughput, and energy consumption. To achieve proper comparison results, all models were developed and mapped to the same hardware reconfigurable chip, a Field Programmable Gate Array (FPGA). The performance parameters of each model were evaluated to determine the best hardware model. Verilog hardware description language was used to model and test all implementations. While no-loop-unrolling provided the least circuit size, the 3-loop-unrolled approach provided the highest throughput, amongst all tested implementations. A throughput speed up of 24% was achieved compared to a reference system implemented in a different target device using a Xilinx FPGA family. Comparing our implementations on the same Altera FPGA family, a maximum throughput speed up of 50% was achieved.

查看原文本刊更多论文

环展开在RC5-192加密算法硬件可重构实现中的作用

本文的目的是确定“循环展开”设计概念对基于硬件实现的RC5加密算法的性能的影响。为了使用192位加密密钥实现RC5算法，已经努力确定展开循环数量的最佳值。测试的各种模型基于单一定制处理器，具有:无循环展开;具有不同展开大小的循环展开实现。在这项研究中，考虑了各种性能指标，如最大操作频率，电路尺寸，吞吐量和能耗。为了获得适当的比较结果，所有模型都被开发并映射到相同的硬件可重构芯片，即现场可编程门阵列(FPGA)。对每种型号的性能参数进行了评估，以确定最佳硬件型号。使用Verilog硬件描述语言对所有实现进行建模和测试。在所有测试的实现中，无环路展开提供了最小的电路尺寸，而3环路展开方法提供了最高的吞吐量。与使用Xilinx FPGA系列在不同目标设备上实现的参考系统相比，吞吐量速度提高了24%。比较我们在同一Altera FPGA系列上的实现，最大吞吐量速度提高了50%。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2008 IEEE Region 5 Conference

自引率

0.00%

发文量