Power optimizations for transport triggered SIMD processors

Joonas Multanen, T. Viitanen, Henry Linjamaki, Heikki O. Kultala, P. Jääskeläinen, J. Takala, L. Koskinen, Jesse Simonsson, H. Berg, K. Raiskila, Tommi Zetterman
{"title":"Power optimizations for transport triggered SIMD processors","authors":"Joonas Multanen, T. Viitanen, Henry Linjamaki, Heikki O. Kultala, P. Jääskeläinen, J. Takala, L. Koskinen, Jesse Simonsson, H. Berg, K. Raiskila, Tommi Zetterman","doi":"10.1109/SAMOS.2015.7363689","DOIUrl":null,"url":null,"abstract":"Power consumption in modern processor design is a key aspect. Optimizing the processor for power leads to direct savings in battery energy consumption in case of mobile devices. At the same time, many mobile applications demand high computational performance. In case of large scale computing, low power compute devices help in thermal design and in reducing the electricity bill. This paper presents a case study of a customized low power vector processor design that was synthesized on a 28 nm process technology. The processor has a programmer exposed datapath based on the transport triggered architecture programming model. The paper's focus is on the RTL and microarchitecture level power optimizations applied to the design. Using semiautomated interconnection network and register file optimization algorithm, up to 27% of power savings were achieved. Using this as a baseline and applying register file datapath gating, register file banking and enabling clock gating of individual pipeline stages in pipelined function units, up to 26% of power and energy savings could be achieved with only a 3% area overhead. On top of this, for the measured radio applications, the exposed datapath architecture helped to achieve approximately 18% power improvement in comparison to a VLIW-like architecture by utilizing optimizations unique to transport triggered architectures.","PeriodicalId":346802,"journal":{"name":"2015 International Conference on Embedded Computer Systems: Architectures, Modeling, and Simulation (SAMOS)","volume":"3 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2015-07-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2015 International Conference on Embedded Computer Systems: Architectures, Modeling, and Simulation (SAMOS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SAMOS.2015.7363689","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2

Abstract

Power consumption in modern processor design is a key aspect. Optimizing the processor for power leads to direct savings in battery energy consumption in case of mobile devices. At the same time, many mobile applications demand high computational performance. In case of large scale computing, low power compute devices help in thermal design and in reducing the electricity bill. This paper presents a case study of a customized low power vector processor design that was synthesized on a 28 nm process technology. The processor has a programmer exposed datapath based on the transport triggered architecture programming model. The paper's focus is on the RTL and microarchitecture level power optimizations applied to the design. Using semiautomated interconnection network and register file optimization algorithm, up to 27% of power savings were achieved. Using this as a baseline and applying register file datapath gating, register file banking and enabling clock gating of individual pipeline stages in pipelined function units, up to 26% of power and energy savings could be achieved with only a 3% area overhead. On top of this, for the measured radio applications, the exposed datapath architecture helped to achieve approximately 18% power improvement in comparison to a VLIW-like architecture by utilizing optimizations unique to transport triggered architectures.
传输触发SIMD处理器的电源优化
功耗是现代处理器设计中的一个关键方面。在移动设备的情况下,优化处理器的功率可以直接节省电池的能耗。同时,许多移动应用对计算性能的要求也很高。在大规模计算的情况下,低功耗计算设备有助于热设计和减少电费。本文介绍了一个基于28纳米工艺技术合成的定制低功耗矢量处理器设计的案例研究。处理器有一个程序员公开的基于传输触发架构编程模型的数据路径。本文的重点是RTL和微架构级的功率优化应用于设计。采用半自动互连网络和注册文件优化算法,可节省高达27%的电力。以此为基准,在流水线功能单元中应用寄存器文件数据路径门控、寄存器文件银行和启用单个流水线阶段的时钟门控,可以实现高达26%的电力和能源节约,而面积开销仅为3%。最重要的是,对于测量的无线电应用程序,通过利用传输触发体系结构特有的优化,与类似vliw的体系结构相比,公开的数据路径体系结构帮助实现了大约18%的功率改进。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信