PowerSpector: Towards Energy Efficiency with Calling-Context-Aware Profiling

2022 IEEE International Parallel and Distributed Processing Symposium (IPDPS) Pub Date : 2022-05-01 DOI:10.1109/ipdps53621.2022.00126

Xin You, Hailong Yang, Zhibo Xuan, Zhongzhi Luan, D. Qian

{"title":"PowerSpector: Towards Energy Efficiency with Calling-Context-Aware Profiling","authors":"Xin You, Hailong Yang, Zhibo Xuan, Zhongzhi Luan, D. Qian","doi":"10.1109/ipdps53621.2022.00126","DOIUrl":null,"url":null,"abstract":"Energy efficiency has become one of the major concerns in high-performance computing systems towards exascale. On mainstream systems, dynamic voltage and frequency scaling (DVFS) and uncore frequency scaling (UFS) are two popular techniques to trade-off performance and power consumption to achieve better energy efficiency. However, the existing system software is oblivious to application characteristics and thus misses the opportunity for fine-grained power management. Meanwhile, manually instrumenting applications with power management codes are prohibitive due to heavy engineering efforts and thus hardly portable across platforms. In this paper, we propose Powerspector, a fine-grained code profiling and optimization tool with calling context awareness to automatically explore the opportunity for optimizing energy efficiency. The design of Powerspector consists of three phases, including significant region detection, performance profiling and power modeling, and frequency optimization. The first phase automatically identifies the profitable regions for frequency optimization. Then, the second phase guides the core/uncore frequency optimization with power models. The third phase injects frequency optimization codes targeting each significant code region across different calling contexts automatically. The experiment results demonstrate that Powerspector can achieve 1.13×(1.00×), 1.28×(1.09×), and 1.17×(1.06×) improvement on energy efficiency compared to static(region-based) tuning on Haswell, Broadwell, and Skylake platforms, respectively.","PeriodicalId":321801,"journal":{"name":"2022 IEEE International Parallel and Distributed Processing Symposium (IPDPS)","volume":"17 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE International Parallel and Distributed Processing Symposium (IPDPS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ipdps53621.2022.00126","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 2

Abstract

Energy efficiency has become one of the major concerns in high-performance computing systems towards exascale. On mainstream systems, dynamic voltage and frequency scaling (DVFS) and uncore frequency scaling (UFS) are two popular techniques to trade-off performance and power consumption to achieve better energy efficiency. However, the existing system software is oblivious to application characteristics and thus misses the opportunity for fine-grained power management. Meanwhile, manually instrumenting applications with power management codes are prohibitive due to heavy engineering efforts and thus hardly portable across platforms. In this paper, we propose Powerspector, a fine-grained code profiling and optimization tool with calling context awareness to automatically explore the opportunity for optimizing energy efficiency. The design of Powerspector consists of three phases, including significant region detection, performance profiling and power modeling, and frequency optimization. The first phase automatically identifies the profitable regions for frequency optimization. Then, the second phase guides the core/uncore frequency optimization with power models. The third phase injects frequency optimization codes targeting each significant code region across different calling contexts automatically. The experiment results demonstrate that Powerspector can achieve 1.13×(1.00×), 1.28×(1.09×), and 1.17×(1.06×) improvement on energy efficiency compared to static(region-based) tuning on Haswell, Broadwell, and Skylake platforms, respectively.

查看原文本刊更多论文

PowerSpector:通过调用上下文感知分析实现能源效率

能源效率已经成为高性能计算系统迈向百亿亿次的主要关注点之一。在主流系统中，动态电压和频率缩放(DVFS)和非核心频率缩放(UFS)是两种流行的技术，可以权衡性能和功耗以实现更好的能源效率。然而，现有的系统软件忽略了应用程序的特性，因此错过了进行细粒度电源管理的机会。同时，使用电源管理代码手动检测应用程序是禁止的，因为需要大量的工程工作，因此很难跨平台移植。在本文中，我们提出了Powerspector，一个细粒度的代码分析和优化工具，具有调用上下文感知功能，可以自动探索优化能源效率的机会。Powerspector的设计包括三个阶段，包括显著区域检测、性能分析和功率建模以及频率优化。第一阶段自动识别频率优化的有利区域。然后，第二阶段利用功率模型指导核心/非核心频率优化。第三阶段针对不同调用上下文中的每个重要码区自动注入频率优化代码。实验结果表明，与Haswell、Broadwell和Skylake平台上的静态(基于区域的)调谐相比，Powerspector的能效分别提高了1.13倍(1.00倍)、1.28倍(1.09倍)和1.17倍(1.06倍)。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2022 IEEE International Parallel and Distributed Processing Symposium (IPDPS)

自引率

0.00%

发文量