Characterizing the impact of different memory-intensity levels

R. Kotla, A. Devgan, S. Ghiasi, T. Keller, F. Rawson
{"title":"Characterizing the impact of different memory-intensity levels","authors":"R. Kotla, A. Devgan, S. Ghiasi, T. Keller, F. Rawson","doi":"10.1109/WWC.2004.1437388","DOIUrl":null,"url":null,"abstract":"Applications on today's high-end processors typically make varying load demands over time. A single application may have many different phases during its lifetime, and workload mixes show interleaved phases. This work examines and uses the differences between memory- and CPU-intensive phases to reduce power. Today's processors provide resources that are underutilized during memory-intensive phases, consuming power while producing little incremental gain in performance. This work examines a deployed system consisting of identical cores with a goal of running each one at a different effective frequency. The initial goal is to find the appropriate frequency at which to run each phase. This paper demonstrates that memory intensity directly affects the throughput of applications. The results indicate that simple metrics such as IPC (instructions per cycle) cannot be used to determine what frequency to run a phase. Instead, it uses performance counters which directly monitor memory behavior to identify. Memory-intensive phases can then be run on a slower core without incurring significant performance penalties. The key result of the paper is the introduction of a very simple, online model that uses the performance counter data to predict the performance of a program phase at any particular frequency setting. The information from this model allows a scheduler to decide which core to use to execute the program phase. Using a sophisticated power model for the processor family shows that this approach significantly reduces power consumption. The model was evaluated using a subset of SPECCPU and the SPECjbb and TPC-W benchmarks. It predicts performance with an average error of less than 10%. The power modeling shows that memory-intensive benchmarks achieve up to-a 58%, power reduction at a performance loss of less than 20% when run at 80% of nominal frequency.","PeriodicalId":240633,"journal":{"name":"IEEE International Workshop on Workload Characterization, 2004. WWC-7. 2004","volume":"58 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2004-10-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"44","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE International Workshop on Workload Characterization, 2004. WWC-7. 2004","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/WWC.2004.1437388","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 44

Abstract

Applications on today's high-end processors typically make varying load demands over time. A single application may have many different phases during its lifetime, and workload mixes show interleaved phases. This work examines and uses the differences between memory- and CPU-intensive phases to reduce power. Today's processors provide resources that are underutilized during memory-intensive phases, consuming power while producing little incremental gain in performance. This work examines a deployed system consisting of identical cores with a goal of running each one at a different effective frequency. The initial goal is to find the appropriate frequency at which to run each phase. This paper demonstrates that memory intensity directly affects the throughput of applications. The results indicate that simple metrics such as IPC (instructions per cycle) cannot be used to determine what frequency to run a phase. Instead, it uses performance counters which directly monitor memory behavior to identify. Memory-intensive phases can then be run on a slower core without incurring significant performance penalties. The key result of the paper is the introduction of a very simple, online model that uses the performance counter data to predict the performance of a program phase at any particular frequency setting. The information from this model allows a scheduler to decide which core to use to execute the program phase. Using a sophisticated power model for the processor family shows that this approach significantly reduces power consumption. The model was evaluated using a subset of SPECCPU and the SPECjbb and TPC-W benchmarks. It predicts performance with an average error of less than 10%. The power modeling shows that memory-intensive benchmarks achieve up to-a 58%, power reduction at a performance loss of less than 20% when run at 80% of nominal frequency.
描述不同内存强度水平的影响
当今高端处理器上的应用程序通常会随着时间的推移产生不同的负载需求。单个应用程序在其生命周期中可能有许多不同的阶段,并且工作负载混合显示交错的阶段。这项工作检查并使用内存密集型阶段和cpu密集型阶段之间的差异来降低功耗。目前的处理器提供的资源在内存密集型阶段未得到充分利用,消耗大量电力,而在性能方面几乎没有增加。这项工作检查了一个由相同核心组成的部署系统,目标是以不同的有效频率运行每个核心。最初的目标是找到运行每个阶段的适当频率。本文论证了内存强度直接影响应用程序的吞吐量。结果表明,像IPC(每周期指令数)这样的简单指标不能用于确定运行阶段的频率。相反,它使用直接监视内存行为的性能计数器来识别。然后,内存密集型阶段可以在较慢的核心上运行,而不会产生明显的性能损失。本文的关键成果是引入了一个非常简单的在线模型,该模型使用性能计数器数据来预测任何特定频率设置下程序相位的性能。来自该模型的信息允许调度器决定使用哪个核心来执行程序阶段。为处理器系列使用复杂的功耗模型表明,这种方法可以显著降低功耗。该模型使用SPECCPU和SPECjbb和TPC-W基准的一个子集进行评估。它预测性能的平均误差小于10%。功耗建模显示,当以80%的标称频率运行时,内存密集型基准测试可实现高达58%的功耗降低,而性能损失不到20%。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信