A Simulation Framework to Automatically Analyze the Communication-Computation Overlap in Scientific Applications

2010 IEEE International Conference on Cluster Computing Pub Date : 2010-09-20 DOI:10.1109/CLUSTER.2010.33

V. Subotic, J. Sancho, J. Labarta, M. Valero

{"title":"A Simulation Framework to Automatically Analyze the Communication-Computation Overlap in Scientific Applications","authors":"V. Subotic, J. Sancho, J. Labarta, M. Valero","doi":"10.1109/CLUSTER.2010.33","DOIUrl":null,"url":null,"abstract":"Overlapping communication and computation has been devised as an attractive technique to alleviate the huge application's network requirements at large scale. Overlapping will allow to fully or partially hide the long communication delays suffered when transferring messages through the network. This will relax the application's network requirements, and hence allow to deploy more cost-effective network designs. However, today's scientific applications make little use of overlapping. In addition, there is no support to analyze how overlap could impact the performance of real scientific applications. In this paper we address this issue by presenting a simulation framework to automatically analyze the benefits of communication-computation overlap. The simulation framework consists of a binary translation tool (Valgrind), a distributed machine simulator (Dimemas), and a visualization tool (Paraver). Valgrind instruments the legacy MPI application and generates the execution traces, then Dimemas uses the obtained traces and reconstructs the application's time-behavior on a configurable parallel platform, and finally Paraver visualizes the obtained time-behaviors. Our simulation methodology brings two new features into the study of overlap: 1) automatic simulation of the overlapped execution - as there is no need for code restructuring in applications; and 2) visualization of simulated time behaviors, that further allows useful comparisons of the non-overlapped and the overlapped executions.","PeriodicalId":152171,"journal":{"name":"2010 IEEE International Conference on Cluster Computing","volume":"16 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2010-09-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"10","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2010 IEEE International Conference on Cluster Computing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CLUSTER.2010.33","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 10

Abstract

Overlapping communication and computation has been devised as an attractive technique to alleviate the huge application's network requirements at large scale. Overlapping will allow to fully or partially hide the long communication delays suffered when transferring messages through the network. This will relax the application's network requirements, and hence allow to deploy more cost-effective network designs. However, today's scientific applications make little use of overlapping. In addition, there is no support to analyze how overlap could impact the performance of real scientific applications. In this paper we address this issue by presenting a simulation framework to automatically analyze the benefits of communication-computation overlap. The simulation framework consists of a binary translation tool (Valgrind), a distributed machine simulator (Dimemas), and a visualization tool (Paraver). Valgrind instruments the legacy MPI application and generates the execution traces, then Dimemas uses the obtained traces and reconstructs the application's time-behavior on a configurable parallel platform, and finally Paraver visualizes the obtained time-behaviors. Our simulation methodology brings two new features into the study of overlap: 1) automatic simulation of the overlapped execution - as there is no need for code restructuring in applications; and 2) visualization of simulated time behaviors, that further allows useful comparisons of the non-overlapped and the overlapped executions.

查看原文本刊更多论文

科学应用中通信-计算重叠自动分析的仿真框架

重叠通信和计算是一种有吸引力的技术，可以缓解大规模应用程序对网络的需求。重叠将允许完全或部分隐藏在通过网络传输消息时遭受的长时间通信延迟。这将放宽应用程序的网络要求，从而允许部署更具成本效益的网络设计。然而，今天的科学应用很少使用重叠。此外，没有支持分析重叠如何影响真正的科学应用程序的性能。在本文中，我们通过提出一个仿真框架来自动分析通信-计算重叠的好处来解决这个问题。仿真框架由二进制翻译工具(Valgrind)、分布式机器模拟器(Dimemas)和可视化工具(Paraver)组成。Valgrind检测遗留的MPI应用程序并生成执行轨迹，然后Dimemas使用获得的轨迹并在可配置的并行平台上重建应用程序的时间行为，最后Paraver将获得的时间行为可视化。我们的模拟方法为重叠研究带来了两个新特性:1)自动模拟重叠执行-因为不需要在应用程序中重构代码;2)模拟时间行为的可视化，这进一步允许对非重叠和重叠执行进行有用的比较。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2010 IEEE International Conference on Cluster Computing

自引率

0.00%

发文量