Interval-based models for run-time DVFS orchestration in superscalar processors

Proceedings of the 7th ACM international conference on Computing frontiers Pub Date : 2010-05-17 DOI:10.1145/1787275.1787338

G. Keramidas, Vasileios Spiliopoulos, S. Kaxiras

{"title":"Interval-based models for run-time DVFS orchestration in superscalar processors","authors":"G. Keramidas, Vasileios Spiliopoulos, S. Kaxiras","doi":"10.1145/1787275.1787338","DOIUrl":null,"url":null,"abstract":"We develop two simple interval-based models for dynamic superscalar processors. These models allow us to: i) predict with great accuracy performance and power consumption under various frequency and voltage combinations and ii) implement targeted DVFS policies at run-time. The models analyze program execution in intervals - steady-state and miss-event intervals. Intervals are signalled by miss events (L2-misses in our case) that upset the \"steady state\" execution of the program and are ended when the pipeline reaches again a steady state. The first model is fed by an approximation of the stall cycles (the time the processor instruction window is blocked) due to long-latency L2-misses. The second model improves on this approximation using as input the occupancy of the L2's miss-handling registers (MSHRs). Despite their simplicity these models prove to be accurate in predicting the performance (and energy) for any target frequency/voltage setting, yielding average errors of 2.1% and 0.2% respectively. Besides modelling, we show that the methodology we propose is powerful enough to implement (at run-time) various DVFS policies: \"operate at optimal EDP\" or \"ED2P,\" or even \"reduce ED2P within specific performance constraints.\" Approaches based on the two models require minimal hardware cost: two counters for measuring the duration of the steady state and the miss-event intervals and some comparison logic. To validate our methodology we use a cycle-accurate simulator and the benchmarks provided by the SPEC2K suite. Our results indicate that our proposed run-time mechanism is able to orchestrate different DVFS policies with great success yielding negligible errors - bellow 1.5% on average.","PeriodicalId":151791,"journal":{"name":"Proceedings of the 7th ACM international conference on Computing frontiers","volume":"14 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2010-05-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"93","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 7th ACM international conference on Computing frontiers","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/1787275.1787338","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 93

Abstract

We develop two simple interval-based models for dynamic superscalar processors. These models allow us to: i) predict with great accuracy performance and power consumption under various frequency and voltage combinations and ii) implement targeted DVFS policies at run-time. The models analyze program execution in intervals - steady-state and miss-event intervals. Intervals are signalled by miss events (L2-misses in our case) that upset the "steady state" execution of the program and are ended when the pipeline reaches again a steady state. The first model is fed by an approximation of the stall cycles (the time the processor instruction window is blocked) due to long-latency L2-misses. The second model improves on this approximation using as input the occupancy of the L2's miss-handling registers (MSHRs). Despite their simplicity these models prove to be accurate in predicting the performance (and energy) for any target frequency/voltage setting, yielding average errors of 2.1% and 0.2% respectively. Besides modelling, we show that the methodology we propose is powerful enough to implement (at run-time) various DVFS policies: "operate at optimal EDP" or "ED2P," or even "reduce ED2P within specific performance constraints." Approaches based on the two models require minimal hardware cost: two counters for measuring the duration of the steady state and the miss-event intervals and some comparison logic. To validate our methodology we use a cycle-accurate simulator and the benchmarks provided by the SPEC2K suite. Our results indicate that our proposed run-time mechanism is able to orchestrate different DVFS policies with great success yielding negligible errors - bellow 1.5% on average.

查看原文本刊更多论文

超标量处理器中运行时DVFS编排的基于间隔的模型

我们为动态超标量处理器开发了两个简单的基于区间的模型。这些模型使我们能够:i)在各种频率和电压组合下非常准确地预测性能和功耗，ii)在运行时实施有针对性的DVFS策略。该模型分析了程序执行的间隔-稳态和非事件间隔。间隔由miss事件(在我们的例子中是l2 -miss)表示，它打乱了程序的“稳定状态”执行，并在管道再次达到稳定状态时结束。第一个模型是由长延迟l2缺失导致的失速周期(处理器指令窗口被阻塞的时间)的近似值提供的。第二个模型在这个近似的基础上进行了改进，使用L2丢失处理寄存器(MSHRs)的占用作为输入。尽管这些模型简单，但在预测任何目标频率/电压设置的性能(和能量)方面都是准确的，平均误差分别为2.1%和0.2%。除了建模之外，我们还表明，我们提出的方法足够强大，可以在运行时实现各种DVFS策略:“以最佳EDP运行”或“ED2P”，甚至“在特定性能约束下减少ED2P”。基于这两种模型的方法需要最小的硬件成本:两个用于测量稳定状态持续时间和缺失事件间隔的计数器以及一些比较逻辑。为了验证我们的方法，我们使用了一个周期精确的模拟器和SPEC2K套件提供的基准测试。我们的结果表明，我们提出的运行时机制能够成功地编排不同的DVFS策略，误差可以忽略不计——平均误差低于1.5%。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings of the 7th ACM international conference on Computing frontiers

自引率

0.00%

发文量