Exploring the tradeoffs between programmability and efficiency in data-parallel accelerators

2011 38th Annual International Symposium on Computer Architecture (ISCA) Pub Date : 2011-06-04 DOI:10.1145/2000064.2000080

Yunsup Lee, Rimas Avizienis, Alex Bishara, R. Xia, Derek Lockhart, C. Batten, K. Asanović

引用次数: 54

Abstract

We present a taxonomy and modular implementation approach for data-parallel accelerators, including the MIMD, vector-SIMD, subword-SIMD, SIMT, and vector-thread (VT) architectural design patterns. We have developed a new VT microarchitecture, Maven, based on the traditional vector-SIMD microarchitecture that is considerably simpler to implement and easier to program than previous VT designs. Using an extensive design-space exploration of full VLSI implementations of many accelerator design points, we evaluate the varying tradeoffs between programmability and implementation efficiency among the MIMD, vector-SIMD, and VT patterns on a workload of microbenchmarks and compiled application kernels. We find the vector cores provide greater efficiency than the MIMD cores, even on fairly irregular kernels. Our results suggest that the Maven VT microarchitecture is superior to the traditional vector-SIMD architecture, providing both greater efficiency and easier programmability.

查看原文本刊更多论文

探索数据并行加速器中可编程性和效率之间的权衡

我们提出了一种数据并行加速器的分类和模块化实现方法，包括MIMD、向量- simd、子词- simd、SIMT和向量-线程(VT)架构设计模式。我们基于传统的矢量simd微体系结构开发了一种新的VT微体系结构Maven，它比以前的VT设计更容易实现和编程。通过对许多加速器设计点的全VLSI实现进行广泛的设计空间探索，我们评估了在微基准测试和编译的应用程序内核的工作负载上，MIMD、矢量simd和VT模式之间的可编程性和实现效率之间的不同权衡。我们发现矢量内核比MIMD内核提供更高的效率，即使在相当不规则的内核上也是如此。我们的结果表明，Maven VT微体系结构优于传统的矢量simd体系结构，提供更高的效率和更容易的可编程性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2011 38th Annual International Symposium on Computer Architecture (ISCA)

自引率

0.00%

发文量