An Overhead Analysis of MPI Profiling and Tracing Tools

Proceedings of the 2nd Workshop on Performance EngineeRing, Modelling, Analysis, and VisualizatiOn Strategy Pub Date : 2022-06-27 DOI:10.1145/3526063.3535353

S. Hunold, Jordy I. Ajanohoun, Ioannis Vardas, J. Träff

{"title":"An Overhead Analysis of MPI Profiling and Tracing Tools","authors":"S. Hunold, Jordy I. Ajanohoun, Ioannis Vardas, J. Träff","doi":"10.1145/3526063.3535353","DOIUrl":null,"url":null,"abstract":"MPI performance analysis tools are important instruments for finding performance bottlenecks in large-scale MPI applications. These tools commonly support either the profiling or the tracing of parallel applications. Depending on the type of analysis, the use of such a performance analysis tool may entail a significant runtime overhead on the monitored parallel application. However, overheads can occur in different stages of the performance analysis with varying severity, e.g., the overhead when initializing an MPI context is typically less problematic than when monitoring a high number of short-lived MPI function calls. In this work, we precisely define the different types of overheads that performance engineers may encounter when applying performance analysis tools. In the context of performance tuning, it is crucial to avoid delaying individual events (e.g., function calls) when monitoring MPI applications, as otherwise performance bottlenecks may not show up in the same spot as when running the applications without applying a performance analysis tool. We empirically examine the different types of overheads associated with popular performance analysis tools for a set of well-known proxy applications and categorize the tools according to our findings. Our study shows that although the investigated MPI profiling and tracing tools exhibit a rather unique overhead footprint, they hardly influence the net time of an MPI application, which is the time between the Init and Finalize calls. Performance engineers should be aware of all types of overheads associated with each tool to avoid very costly batch jobs.","PeriodicalId":244248,"journal":{"name":"Proceedings of the 2nd Workshop on Performance EngineeRing, Modelling, Analysis, and VisualizatiOn Strategy","volume":"8 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-06-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2nd Workshop on Performance EngineeRing, Modelling, Analysis, and VisualizatiOn Strategy","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3526063.3535353","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

MPI performance analysis tools are important instruments for finding performance bottlenecks in large-scale MPI applications. These tools commonly support either the profiling or the tracing of parallel applications. Depending on the type of analysis, the use of such a performance analysis tool may entail a significant runtime overhead on the monitored parallel application. However, overheads can occur in different stages of the performance analysis with varying severity, e.g., the overhead when initializing an MPI context is typically less problematic than when monitoring a high number of short-lived MPI function calls. In this work, we precisely define the different types of overheads that performance engineers may encounter when applying performance analysis tools. In the context of performance tuning, it is crucial to avoid delaying individual events (e.g., function calls) when monitoring MPI applications, as otherwise performance bottlenecks may not show up in the same spot as when running the applications without applying a performance analysis tool. We empirically examine the different types of overheads associated with popular performance analysis tools for a set of well-known proxy applications and categorize the tools according to our findings. Our study shows that although the investigated MPI profiling and tracing tools exhibit a rather unique overhead footprint, they hardly influence the net time of an MPI application, which is the time between the Init and Finalize calls. Performance engineers should be aware of all types of overheads associated with each tool to avoid very costly batch jobs.

查看原文本刊更多论文

MPI分析和跟踪工具的开销分析

MPI性能分析工具是在大规模MPI应用程序中发现性能瓶颈的重要工具。这些工具通常支持并行应用程序的分析或跟踪。根据分析类型的不同，使用这样的性能分析工具可能会在被监视的并行应用程序上带来很大的运行时开销。然而，开销可能在性能分析的不同阶段以不同的严重程度出现，例如，初始化MPI上下文时的开销通常比监视大量短期MPI函数调用时的开销问题要小。在这项工作中，我们精确地定义了性能工程师在应用性能分析工具时可能遇到的不同类型的开销。在性能调优的上下文中，在监视MPI应用程序时，避免延迟单个事件(例如，函数调用)是至关重要的，否则在不应用性能分析工具的情况下运行应用程序时，性能瓶颈可能不会出现在同一位置。我们对一组知名代理应用程序的流行性能分析工具进行了不同类型的开销测试，并根据我们的发现对这些工具进行了分类。我们的研究表明，尽管所调查的MPI分析和跟踪工具显示出相当独特的开销占用，但它们几乎不会影响MPI应用程序的净时间，即Init和Finalize调用之间的时间。性能工程师应该了解与每个工具相关的所有类型的开销，以避免非常昂贵的批处理作业。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings of the 2nd Workshop on Performance EngineeRing, Modelling, Analysis, and VisualizatiOn Strategy

自引率

0.00%

发文量