Comprehensive throughput evaluation of LANs in clusters of PCs with Switchbench - or how to bring your switch to its knees

IEEE International. 2005 Proceedings of the IEEE Workload Characterization Symposium, 2005. Pub Date : 2005-11-07 DOI:10.1109/IISWC.2005.1526012

F. Rauch

{"title":"Comprehensive throughput evaluation of LANs in clusters of PCs with Switchbench - or how to bring your switch to its knees","authors":"F. Rauch","doi":"10.1109/IISWC.2005.1526012","DOIUrl":null,"url":null,"abstract":"Understanding the performance of parallel applications for prevalent clusters of commodity PCs is still not an easy task: One must understand performance characteristics of all subsystems in the cluster machine, besides the inherently required knowledge about the applications' behaviour. While there are already many benchmarks that characterise a single node's subsystems like CPU, memory and I/O. as well as a few to evaluate its network interface with point-to-point data streams, there are to the best of our knowledge no benchmarks available that characterise a cluster network or LAN as a whole. We present Switchbench (2005), a set of three microbenchmarks that thoroughly evaluate the throughput characteristics of networks for clusters. A first microbenchmark tests the basic processing limitations of the switches, by sending and receiving data concurrently at maximum throughputs on all network interfaces. A second microbenchmark tests arbitrary communication patterns by pairwise connecting nodes for high-speed throughput tests. A third and slightly more realistic microbenchmark executes an all-to-all personalised communication (AAPC) algorithm to test many different patterns and critical bisections in the network. The microbenchmarks already proved to be extremely useful in a previous study to experimentally quantify performance limitations in different networks of clusters of PCs with up to 128 nodes. We also establish the suitability of our microbenchmarks by comparing their results with two application benchmarks. The benchmarks consist of two C programs supported by shell scripts to start the programs on all nodes of the cluster with the correct execution parameters to automatically scale the workloads from a few nodes up to the full cluster size.","PeriodicalId":275514,"journal":{"name":"IEEE International. 2005 Proceedings of the IEEE Workload Characterization Symposium, 2005.","volume":"36 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2005-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE International. 2005 Proceedings of the IEEE Workload Characterization Symposium, 2005.","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IISWC.2005.1526012","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

Abstract

Understanding the performance of parallel applications for prevalent clusters of commodity PCs is still not an easy task: One must understand performance characteristics of all subsystems in the cluster machine, besides the inherently required knowledge about the applications' behaviour. While there are already many benchmarks that characterise a single node's subsystems like CPU, memory and I/O. as well as a few to evaluate its network interface with point-to-point data streams, there are to the best of our knowledge no benchmarks available that characterise a cluster network or LAN as a whole. We present Switchbench (2005), a set of three microbenchmarks that thoroughly evaluate the throughput characteristics of networks for clusters. A first microbenchmark tests the basic processing limitations of the switches, by sending and receiving data concurrently at maximum throughputs on all network interfaces. A second microbenchmark tests arbitrary communication patterns by pairwise connecting nodes for high-speed throughput tests. A third and slightly more realistic microbenchmark executes an all-to-all personalised communication (AAPC) algorithm to test many different patterns and critical bisections in the network. The microbenchmarks already proved to be extremely useful in a previous study to experimentally quantify performance limitations in different networks of clusters of PCs with up to 128 nodes. We also establish the suitability of our microbenchmarks by comparing their results with two application benchmarks. The benchmarks consist of two C programs supported by shell scripts to start the programs on all nodes of the cluster with the correct execution parameters to automatically scale the workloads from a few nodes up to the full cluster size.

查看原文本刊更多论文

综合吞吐量评估的局域网集群的pc与交换机-或如何使您的交换机膝盖

了解通用商用pc集群的并行应用程序的性能仍然不是一件容易的事情:除了固有的应用程序行为知识之外，还必须了解集群机器中所有子系统的性能特征。虽然已经有许多基准测试来描述单个节点的子系统，如CPU、内存和I/O。以及一些使用点对点数据流来评估其网络接口的基准，据我们所知，没有可用的基准来描述集群网络或LAN作为一个整体的特征。我们提出了Switchbench(2005)，这是一组三个微基准，可以彻底评估集群网络的吞吐量特征。第一个微基准测试通过在所有网络接口上以最大吞吐量并发发送和接收数据来测试交换机的基本处理限制。第二个微基准通过成对连接节点来测试任意通信模式，以进行高速吞吐量测试。第三个更现实的微基准执行全对全个性化通信(AAPC)算法，以测试网络中的许多不同模式和关键平分。在之前的一项研究中，微基准测试已经被证明是非常有用的，它可以实验性地量化不同网络中多达128个节点的pc集群的性能限制。我们还通过将微基准测试的结果与两个应用程序基准测试进行比较来确定微基准测试的适用性。基准测试由shell脚本支持的两个C程序组成，它们使用正确的执行参数在集群的所有节点上启动程序，从而自动将工作负载从几个节点扩展到整个集群大小。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

IEEE International. 2005 Proceedings of the IEEE Workload Characterization Symposium, 2005.

自引率

0.00%

发文量