ALCF MPI Benchmarks: Understanding Machine-Specific Communication Behavior

2012 41st International Conference on Parallel Processing Workshops Pub Date : 2012-09-10 DOI:10.1109/ICPPW.2012.7

V. Morozov, Jiayuan Meng, V. Vishwanath, J. Hammond, Kalyan Kumaran, M. Papka

引用次数: 12

Abstract

As systems grow larger and computation is further spread across nodes, efficient data communication is becoming increasingly important to achieve high throughput and low power consumption for high performance computing systems. However, communication efficacy not only depends on application-specific communication patterns, but also on machine-specific communication subsystems, node architectures, and even the runtime communication libraries. In fact, different hardware systems lead to different tradeoffs with respect to communication mechanisms, which can impact the choice of application implementations. We present a set of MPI-based benchmarks to better understand the communication behavior of the hardware systems and guide the performance tuning of scientific applications. We further apply these benchmarks to three clusters and present several interesting lessons from our experience.

查看原文本刊更多论文

ALCF MPI基准:理解机器特定的通信行为

随着系统规模的扩大和计算在节点间的进一步分散，高效的数据通信对于实现高性能计算系统的高吞吐量和低功耗变得越来越重要。然而，通信效率不仅取决于特定于应用程序的通信模式，还取决于特定于机器的通信子系统、节点体系结构，甚至运行时通信库。实际上，不同的硬件系统会导致通信机制方面的不同权衡，这可能会影响应用程序实现的选择。我们提出了一组基于mpi的基准，以便更好地理解硬件系统的通信行为，并指导科学应用程序的性能调优。我们进一步将这些基准应用于三个集群，并从我们的经验中提出一些有趣的教训。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2012 41st International Conference on Parallel Processing Workshops

自引率

0.00%

发文量