G-Scalar: Cost-Effective Generalized Scalar Execution Architecture for Power-Efficient GPUs

2017 IEEE International Symposium on High Performance Computer Architecture (HPCA) Pub Date : 2017-02-01 DOI:10.1109/HPCA.2017.51

Zhenhong Liu, S. Gilani, M. Annavaram, N. Kim

{"title":"G-Scalar: Cost-Effective Generalized Scalar Execution Architecture for Power-Efficient GPUs","authors":"Zhenhong Liu, S. Gilani, M. Annavaram, N. Kim","doi":"10.1109/HPCA.2017.51","DOIUrl":null,"url":null,"abstract":"The GPU has provide higher throughput by integrating more execution resources into a single chip without unduly compromising power efficiency. With the power wall challenge, however, increasing the throughput will require significant improvement in power efficiency. To accomplish this goal, we propose G-Scalar, a cost-effective generalized scalar execution architecture for GPUs in this paper. G-Scalar offers two key advantages over prior architectures supporting scalar execution for only non-divergent arithmetic/logic instructions. First, G-Scalar is more power-efficient as it can also support scalar execution of divergent and special-function instructions, the fraction of which in contemporary GPU applications has notably increased. Second, G-Scalar is less expensive as it can share most of its hardware resources with register value compression, of which adoption has been strongly promoted to reduce high power consumption of accessing the large register file. Compared with the baseline and previous scalar architectures, G-Scalar improves power efficiency by 24% and 15%, respectively, at a negligible cost.","PeriodicalId":118950,"journal":{"name":"2017 IEEE International Symposium on High Performance Computer Architecture (HPCA)","volume":"134 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"17","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 IEEE International Symposium on High Performance Computer Architecture (HPCA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/HPCA.2017.51","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 17

Abstract

The GPU has provide higher throughput by integrating more execution resources into a single chip without unduly compromising power efficiency. With the power wall challenge, however, increasing the throughput will require significant improvement in power efficiency. To accomplish this goal, we propose G-Scalar, a cost-effective generalized scalar execution architecture for GPUs in this paper. G-Scalar offers two key advantages over prior architectures supporting scalar execution for only non-divergent arithmetic/logic instructions. First, G-Scalar is more power-efficient as it can also support scalar execution of divergent and special-function instructions, the fraction of which in contemporary GPU applications has notably increased. Second, G-Scalar is less expensive as it can share most of its hardware resources with register value compression, of which adoption has been strongly promoted to reduce high power consumption of accessing the large register file. Compared with the baseline and previous scalar architectures, G-Scalar improves power efficiency by 24% and 15%, respectively, at a negligible cost.

查看原文本刊更多论文

G-Scalar:高效能gpu的高效通用标量执行架构

GPU通过将更多的执行资源集成到单个芯片中而不会过度影响功率效率，从而提供更高的吞吐量。然而，面对功率墙的挑战，提高吞吐量将需要显著提高功率效率。为了实现这一目标，本文提出了一种具有成本效益的gpu通用标量执行架构G-Scalar。与只支持非发散算术/逻辑指令的标量执行的先前体系结构相比，G-Scalar提供了两个关键优势。首先，G-Scalar更节能，因为它还可以支持发散和特殊功能指令的标量执行，这在当代GPU应用程序中的比例显着增加。其次，G-Scalar的成本较低，因为它可以与寄存器值压缩共享大部分硬件资源，它的采用得到了大力推广，以减少访问大型寄存器文件的高功耗。与基线和以前的标量架构相比，G-Scalar的功耗效率分别提高了24%和15%，而成本可以忽略不计。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2017 IEEE International Symposium on High Performance Computer Architecture (HPCA)

自引率

0.00%

发文量