Hardware acceleration for programs in SSA form

2013 International Conference on Compilers, Architecture and Synthesis for Embedded Systems (CASES) Pub Date : 2013-09-29 DOI:10.1109/CASES.2013.6662518

Manuel Mohr, Artjom Grudnitsky, Tobias Modschiedler, L. Bauer, Sebastian Hack, J. Henkel

引用次数: 6

Abstract

Register allocation is one of the most time-consuming parts of the compilation process. Depending on the quality of the register allocation, a large amount of shuffle code to move values between registers is generated. In this paper, we propose a processor architecture extension to provide register file permutations by which the shuffle code can be implemented more efficiently. We present compiler support to utilize this extension, an evaluation regarding performance and compilation time using the SPEC CINT2000 benchmark, as well as an analysis of area and frequency overhead of our architecture implementation. We find that using our extension, the number of executed instructions is reduced by up to 5.1 % while the compilation time is unaffected.

查看原文本刊更多论文

SSA形式的程序的硬件加速

寄存器分配是编译过程中最耗时的部分之一。根据寄存器分配的质量，会生成大量的shuffle代码来在寄存器之间移动值。在本文中，我们提出了一个处理器架构扩展，以提供寄存器文件排列，通过该排列可以更有效地实现shuffle代码。我们提供了编译器支持来利用这个扩展，使用SPEC CINT2000基准对性能和编译时间进行了评估，以及对架构实现的面积和频率开销进行了分析。我们发现，使用我们的扩展，执行指令的数量最多减少了5.1%，而编译时间不受影响。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2013 International Conference on Compilers, Architecture and Synthesis for Embedded Systems (CASES)

自引率

0.00%

发文量