Multiplexing Endpoints of HCA for Scaling MPI Applications: Design and Performance Evaluation with uDAPL

2010 IEEE International Conference on Cluster Computing Pub Date : 2010-09-20 DOI:10.1109/CLUSTER.2010.22

Jasjit Singh, Yogeshwar Sonawane

{"title":"Multiplexing Endpoints of HCA for Scaling MPI Applications: Design and Performance Evaluation with uDAPL","authors":"Jasjit Singh, Yogeshwar Sonawane","doi":"10.1109/CLUSTER.2010.22","DOIUrl":null,"url":null,"abstract":"With an ever increasing demand for computing power, number of nodes to be deployed in a cluster based supercomputer is increasing. Limited hardware resources such as Endpoints (equivalent to Queue Pairs) on a Host Channel Adapter (HCA) of a high speed interconnect limit the scalability of a parallel application based on MPI that sets up reliable connections between every process pair using endpoints, prior to communication. In this paper, we propose a novel approach of multiplexing hardware endpoints (hweps) to extend scalability. (a) We discuss critical design issues with the multiplexing technique that differentiates a hwep from its software counterpart (swep) and enables sharing of hwep by multiple sweps. (b) We introduce the concept of Virtual Identifier (VID) which ensures that the connection between hardware endpoints is strictly one-to-one. (c) We also present static mapping scheme that offsets the overheads incurred due to multiplexing. User Direct Access Programming Library (uDAPL) defines a single set of APIs for all RDMA capable transports. We have incorporated the proposed multiplexing technique as a part of uDAPL implementation. Using this approach, we are able to scale MPI applications beyond the limit imposed by HCA and with no visible performance degradation.","PeriodicalId":152171,"journal":{"name":"2010 IEEE International Conference on Cluster Computing","volume":"16 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2010-09-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2010 IEEE International Conference on Cluster Computing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CLUSTER.2010.22","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 2

Abstract

With an ever increasing demand for computing power, number of nodes to be deployed in a cluster based supercomputer is increasing. Limited hardware resources such as Endpoints (equivalent to Queue Pairs) on a Host Channel Adapter (HCA) of a high speed interconnect limit the scalability of a parallel application based on MPI that sets up reliable connections between every process pair using endpoints, prior to communication. In this paper, we propose a novel approach of multiplexing hardware endpoints (hweps) to extend scalability. (a) We discuss critical design issues with the multiplexing technique that differentiates a hwep from its software counterpart (swep) and enables sharing of hwep by multiple sweps. (b) We introduce the concept of Virtual Identifier (VID) which ensures that the connection between hardware endpoints is strictly one-to-one. (c) We also present static mapping scheme that offsets the overheads incurred due to multiplexing. User Direct Access Programming Library (uDAPL) defines a single set of APIs for all RDMA capable transports. We have incorporated the proposed multiplexing technique as a part of uDAPL implementation. Using this approach, we are able to scale MPI applications beyond the limit imposed by HCA and with no visible performance degradation.

查看原文本刊更多论文

用于扩展MPI应用的HCA复用端点:用uDAPL进行设计和性能评估

随着对计算能力需求的不断增长，需要部署在基于集群的超级计算机中的节点数量也在不断增加。有限的硬件资源，如高速互连的主机通道适配器(HCA)上的端点(相当于队列对)，限制了基于MPI的并行应用程序的可伸缩性，该应用程序在通信之前使用端点在每个进程对之间建立可靠的连接。在本文中，我们提出了一种新的复用硬件端点(hweps)方法来扩展可扩展性。(a)我们讨论了与多路复用技术的关键设计问题，该技术将hwep与其软件对应(扫描)区分开来，并允许通过多个扫描共享hwep。(b)引入虚拟标识符(VID)的概念，确保硬件端点之间的连接是严格一对一的。(c)我们还提出了静态映射方案，以抵消由于多路复用而产生的开销。用户直接访问编程库(uDAPL)为所有支持RDMA的传输定义了一组api。我们已经将提出的多路复用技术作为uDAPL实现的一部分。使用这种方法，我们能够扩展超出HCA限制的MPI应用程序，并且没有明显的性能下降。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2010 IEEE International Conference on Cluster Computing

自引率

0.00%

发文量