Minimizing Network Contention in InfiniBand Clusters with a QoS-Aware Data-Staging Framework

2012 IEEE International Conference on Cluster Computing Pub Date : 2012-09-24 DOI:10.1109/CLUSTER.2012.90

R. Rajachandrasekar, Jai Jaswani, H. Subramoni, D. Panda

{"title":"Minimizing Network Contention in InfiniBand Clusters with a QoS-Aware Data-Staging Framework","authors":"R. Rajachandrasekar, Jai Jaswani, H. Subramoni, D. Panda","doi":"10.1109/CLUSTER.2012.90","DOIUrl":null,"url":null,"abstract":"The rapid growth of supercomputing systems, both in scale and complexity, has been accompanied by degradation in system efficiencies. The sheer abundance of resources including millions of cores, vast amounts of physical memory and high-bandwidth networks are heavily under-utilized. This happens when the resources are time-shared amongst parallel applications that are scheduled to run on a subset of compute nodes in an exclusive manner. Several space-sharing techniques that have been proposed in the literature allow parallel applications to be co-located on compute nodes and share resources with each other. Although this leads to better system efficiencies, it also causes contention for system resources. In this work, we specifically address the problem of network contention, caused due to the sharing of network resources by parallel applications and file systems simultaneously. We leverage the Quality-of-Service (QoS) capabilities of the widely used Infini Band interconnect to enhance our data-staging file system, making it QoS-aware. This is a user-level framework that is agnostic of the file system and MPI implementation. Using this file system, we demonstrate the isolation of file system traffic from MPI communication traffic, thereby reducing the network contention. Experimental results show that MPI point-to-point latency can be reduced by up to 320 microseconds, and the bandwidth improved by up to 674MB/s in the presence of contention with I/O traffic. Furthermore, we were able to reduce the runtime of the AWP-ODC MPI application by about 9.89% in the presence of network contention, and also reduce the time spent in communication by the NAS CG kernel by 23.46%.","PeriodicalId":143579,"journal":{"name":"2012 IEEE International Conference on Cluster Computing","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2012-09-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"9","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2012 IEEE International Conference on Cluster Computing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CLUSTER.2012.90","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 9

Abstract

The rapid growth of supercomputing systems, both in scale and complexity, has been accompanied by degradation in system efficiencies. The sheer abundance of resources including millions of cores, vast amounts of physical memory and high-bandwidth networks are heavily under-utilized. This happens when the resources are time-shared amongst parallel applications that are scheduled to run on a subset of compute nodes in an exclusive manner. Several space-sharing techniques that have been proposed in the literature allow parallel applications to be co-located on compute nodes and share resources with each other. Although this leads to better system efficiencies, it also causes contention for system resources. In this work, we specifically address the problem of network contention, caused due to the sharing of network resources by parallel applications and file systems simultaneously. We leverage the Quality-of-Service (QoS) capabilities of the widely used Infini Band interconnect to enhance our data-staging file system, making it QoS-aware. This is a user-level framework that is agnostic of the file system and MPI implementation. Using this file system, we demonstrate the isolation of file system traffic from MPI communication traffic, thereby reducing the network contention. Experimental results show that MPI point-to-point latency can be reduced by up to 320 microseconds, and the bandwidth improved by up to 674MB/s in the presence of contention with I/O traffic. Furthermore, we were able to reduce the runtime of the AWP-ODC MPI application by about 9.89% in the presence of network contention, and also reduce the time spent in communication by the NAS CG kernel by 23.46%.

查看原文本刊更多论文

基于qos感知的数据分级框架最小化ib集群中的网络争用

超级计算系统在规模和复杂性上的快速增长，伴随着系统效率的下降。大量的资源，包括数以百万计的核心、大量的物理内存和高带宽网络，都没有得到充分利用。当资源在并行应用程序之间分时共享时，就会发生这种情况，这些并行应用程序计划以排他性的方式在计算节点的子集上运行。文献中提出的几种空间共享技术允许并行应用程序位于计算节点上并彼此共享资源。尽管这可以提高系统效率，但也会引起系统资源的争用。在这项工作中，我们专门解决了由于并行应用程序和文件系统同时共享网络资源而引起的网络争用问题。我们利用广泛使用的Infini Band互连的服务质量(QoS)功能来增强我们的数据分级文件系统，使其具有QoS意识。这是一个用户级框架，与文件系统和MPI实现无关。使用这个文件系统，我们演示了文件系统流量与MPI通信流量的隔离，从而减少了网络争用。实验结果表明，在存在I/O争用的情况下，MPI点对点延迟最多可减少320微秒，带宽最多可提高674MB/s。此外，在存在网络争用的情况下，我们能够将AWP-ODC MPI应用程序的运行时减少约9.89%，并将NAS CG内核的通信时间减少23.46%。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2012 IEEE International Conference on Cluster Computing

自引率

0.00%

发文量