Hardware-Software Codesign for Embedded Numerical Acceleration

2013 IEEE 21st Annual International Symposium on Field-Programmable Custom Computing Machines Pub Date : 2013-04-28 DOI:10.1109/FCCM.2013.27

Ranko Sredojevic, A. Wright, V. Stojanović

引用次数: 0

Abstract

In this work we aim to strike a balance between performance, power consumption and design effort for complex digital signal processing within the power and size constraints of embedded systems. Looking across the design stack, from algorithm formulation down to accelerator microarchitecture, we show that a high degree of flexibility and design reuse can be achieved without much performance sacrifice. The foundation of our design is a numerical accelerator template. Extensively parameterized, it allows us to develop the design while postponing microarchitectural decisions until program is known. Statically scheduling compiler provides a link between the algorithm and template instantiation parameters. Results show that the derived design can significantly outperform embedded processors for similar power cost and also approach the high-performance processor performance for a fraction of the power cost.

查看原文本刊更多论文

嵌入式数值加速的软硬件协同设计

在这项工作中，我们的目标是在嵌入式系统的功率和尺寸限制下，在复杂数字信号处理的性能、功耗和设计努力之间取得平衡。纵观整个设计堆栈，从算法制定到加速器微架构，我们表明，在不牺牲太多性能的情况下，可以实现高度的灵活性和设计重用。我们设计的基础是一个数值加速器模板。广泛参数化，它允许我们开发设计，同时推迟微架构决策，直到程序已知。静态调度编译器提供了算法和模板实例化参数之间的链接。结果表明，该设计以相似的功耗成本显著优于嵌入式处理器，并且以一小部分功耗成本接近高性能处理器的性能。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2013 IEEE 21st Annual International Symposium on Field-Programmable Custom Computing Machines

自引率

0.00%

发文量