Evaluation of Energy Characteristics of MPI Communication Primitives with RAPL

2013 IEEE International Symposium on Parallel & Distributed Processing, Workshops and Phd Forum Pub Date : 2013-05-20 DOI:10.1109/IPDPSW.2013.243

Akshay Venkatesh, K. Kandalla, D. Panda

{"title":"Evaluation of Energy Characteristics of MPI Communication Primitives with RAPL","authors":"Akshay Venkatesh, K. Kandalla, D. Panda","doi":"10.1109/IPDPSW.2013.243","DOIUrl":null,"url":null,"abstract":"The energy consumed by modern supercomputing systems continues to grow at an alarming rate. The Message Passing Interface (MPI) has been the de facto programming model for parallel applications and MPI libraries have been designed to achieve the best communication performance on modern architectures. However, the performance and energy trade-offs of these designs have not been studied. Hence, it is critical to understand the energy consumption characteristics of MPI routines and the performance-energy trade-offs of various protocols and designs that are used in MPI libraries. The first hurdle in achieving this objective is to design a framework that can be used to measure energy consumption of various components during communication operations. The RAPL interface allows users to measure energy across various domains on the Intel Sandy-Bridge processor, in a low-overhead, non-intrusive manner. However, this interface has certain limitations and cannot be directly used to measure energy profiles of MPI operations in a fine-grained manner. In this paper, we propose a novel methodology to address these limitations. We propose a new shared-memory window-based solution to accurately measure the aggregate energy consumed by all processes engaged in MPI operations. Using our proposed framework, we demonstrate the impact of various communication protocols and progress mechanisms on the energy consumption. Our evaluations demonstrate that the kernel-based solutions can potentially lead to lower energy consumption for intra-node communication operations. Further, our framework also reveals possible energy bottlenecks in scaling important collective operations, such as, MPI All reduce. In addition, we also use our proposed framework to study the energy consumption characteristics of MPI calls in the NAS-IS benchmark and we infer that the choice of progress mechanism can lead to about 6% energy savings for the processors.","PeriodicalId":234552,"journal":{"name":"2013 IEEE International Symposium on Parallel & Distributed Processing, Workshops and Phd Forum","volume":"2 1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2013-05-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"15","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2013 IEEE International Symposium on Parallel & Distributed Processing, Workshops and Phd Forum","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IPDPSW.2013.243","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 15

Abstract

The energy consumed by modern supercomputing systems continues to grow at an alarming rate. The Message Passing Interface (MPI) has been the de facto programming model for parallel applications and MPI libraries have been designed to achieve the best communication performance on modern architectures. However, the performance and energy trade-offs of these designs have not been studied. Hence, it is critical to understand the energy consumption characteristics of MPI routines and the performance-energy trade-offs of various protocols and designs that are used in MPI libraries. The first hurdle in achieving this objective is to design a framework that can be used to measure energy consumption of various components during communication operations. The RAPL interface allows users to measure energy across various domains on the Intel Sandy-Bridge processor, in a low-overhead, non-intrusive manner. However, this interface has certain limitations and cannot be directly used to measure energy profiles of MPI operations in a fine-grained manner. In this paper, we propose a novel methodology to address these limitations. We propose a new shared-memory window-based solution to accurately measure the aggregate energy consumed by all processes engaged in MPI operations. Using our proposed framework, we demonstrate the impact of various communication protocols and progress mechanisms on the energy consumption. Our evaluations demonstrate that the kernel-based solutions can potentially lead to lower energy consumption for intra-node communication operations. Further, our framework also reveals possible energy bottlenecks in scaling important collective operations, such as, MPI All reduce. In addition, we also use our proposed framework to study the energy consumption characteristics of MPI calls in the NAS-IS benchmark and we infer that the choice of progress mechanism can lead to about 6% energy savings for the processors.

查看原文本刊更多论文

用RAPL评价MPI通信原语的能量特性

现代超级计算系统消耗的能量继续以惊人的速度增长。消息传递接口(Message Passing Interface, MPI)已经成为并行应用程序事实上的编程模型，MPI库的设计目的是在现代体系结构上实现最佳的通信性能。然而，这些设计的性能和能量权衡尚未得到研究。因此，了解MPI例程的能耗特征以及MPI库中使用的各种协议和设计的性能-能量权衡是至关重要的。实现这一目标的第一个障碍是设计一个框架，该框架可用于测量通信操作期间各种组件的能耗。RAPL接口允许用户以低开销、非侵入式的方式测量英特尔Sandy-Bridge处理器上不同域的能量。然而，该接口有一定的局限性，不能直接用于细粒度测量MPI操作的能量分布。在本文中，我们提出了一种新的方法来解决这些限制。我们提出了一种新的基于共享内存窗口的解决方案，以准确地测量参与MPI操作的所有进程消耗的总能量。使用我们提出的框架，我们展示了各种通信协议和进度机制对能耗的影响。我们的评估表明，基于内核的解决方案可以潜在地降低节点内通信操作的能耗。此外，我们的框架还揭示了在扩展重要的集体操作时可能存在的能源瓶颈，例如MPI全部减少。此外，我们还使用我们提出的框架研究了NAS-IS基准中MPI调用的能耗特征，我们推断出进程机制的选择可以使处理器节省约6%的能源。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2013 IEEE International Symposium on Parallel & Distributed Processing, Workshops and Phd Forum

自引率

0.00%

发文量