用软件工作队列方法扩展HCA端点的工作队列:基于uDAPL的实验评估

Jasjit Singh, Eva Mishra, Yogeshwar Sonawane
{"title":"用软件工作队列方法扩展HCA端点的工作队列:基于uDAPL的实验评估","authors":"Jasjit Singh, Eva Mishra, Yogeshwar Sonawane","doi":"10.1109/CyberC.2011.66","DOIUrl":null,"url":null,"abstract":"With an ever increasing demand for computing power, number of nodes to be deployed in a high performance cluster is increasing. This has put stress on various hardware resources like endpoints (equivalent to Queue Pairs), memory etc. One of the factors limiting number of endpoints with a Host Channel Adaptor (HCA) is the amount of total descriptor space available in the form of on-board memory. For the same amount of descriptor space available, number of endpoints supported by an HCA is inversely proportional to the depth of work queue (WQ) for an endpoint (EP). Therefore to support more number of endpoints, depth of WQ has to be decreased. In this paper, we present an optimal approach of extending WQ using Software Work Queue (SWQ). SWQ is an extension to the work queue of EP on HCA. It provides a larger pool of descriptors (i.e. work requests) per EP to an application (e.g. uDAPL application, MPI library) than that available in the hardware. The advantage of this technique lies in the fact that more number of endpoints are supported by HCA without costing the depth of the WQ. Experimental evaluation suggests positive impact as requests in SWQ are given to hardware work queue (HWQ) as soon as it gets depleted.","PeriodicalId":227472,"journal":{"name":"2011 International Conference on Cyber-Enabled Distributed Computing and Knowledge Discovery","volume":"10 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2011-10-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Extending Work Queue of HCA Endpoint Using Software Work Queue Approach: Experimental Evaluation with uDAPL\",\"authors\":\"Jasjit Singh, Eva Mishra, Yogeshwar Sonawane\",\"doi\":\"10.1109/CyberC.2011.66\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"With an ever increasing demand for computing power, number of nodes to be deployed in a high performance cluster is increasing. This has put stress on various hardware resources like endpoints (equivalent to Queue Pairs), memory etc. One of the factors limiting number of endpoints with a Host Channel Adaptor (HCA) is the amount of total descriptor space available in the form of on-board memory. For the same amount of descriptor space available, number of endpoints supported by an HCA is inversely proportional to the depth of work queue (WQ) for an endpoint (EP). Therefore to support more number of endpoints, depth of WQ has to be decreased. In this paper, we present an optimal approach of extending WQ using Software Work Queue (SWQ). SWQ is an extension to the work queue of EP on HCA. It provides a larger pool of descriptors (i.e. work requests) per EP to an application (e.g. uDAPL application, MPI library) than that available in the hardware. The advantage of this technique lies in the fact that more number of endpoints are supported by HCA without costing the depth of the WQ. Experimental evaluation suggests positive impact as requests in SWQ are given to hardware work queue (HWQ) as soon as it gets depleted.\",\"PeriodicalId\":227472,\"journal\":{\"name\":\"2011 International Conference on Cyber-Enabled Distributed Computing and Knowledge Discovery\",\"volume\":\"10 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2011-10-10\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2011 International Conference on Cyber-Enabled Distributed Computing and Knowledge Discovery\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CyberC.2011.66\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2011 International Conference on Cyber-Enabled Distributed Computing and Knowledge Discovery","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CyberC.2011.66","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

随着对计算能力的需求不断增长,需要部署在高性能集群中的节点数量也在不断增加。这给端点(相当于队列对)、内存等各种硬件资源带来了压力。限制主机通道适配器(HCA)端点数量的因素之一是以板载内存形式提供的总描述符空间数量。对于相同数量的可用描述符空间,HCA支持的端点数量与端点(EP)的工作队列深度(WQ)成反比。因此,为了支持更多的端点,必须降低WQ的深度。本文提出了一种利用软件工作队列(SWQ)扩展WQ的最佳方法。SWQ是对HCA上EP工作队列的扩展。它为应用程序(例如uDAPL应用程序,MPI库)提供了比硬件中可用的更大的描述符池(即工作请求)。这种技术的优点在于HCA可以支持更多的端点,而不需要花费WQ的深度。实验评估表明,一旦硬件工作队列(HWQ)耗尽,SWQ中的请求就会被分配给硬件工作队列(HWQ),从而产生积极的影响。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Extending Work Queue of HCA Endpoint Using Software Work Queue Approach: Experimental Evaluation with uDAPL
With an ever increasing demand for computing power, number of nodes to be deployed in a high performance cluster is increasing. This has put stress on various hardware resources like endpoints (equivalent to Queue Pairs), memory etc. One of the factors limiting number of endpoints with a Host Channel Adaptor (HCA) is the amount of total descriptor space available in the form of on-board memory. For the same amount of descriptor space available, number of endpoints supported by an HCA is inversely proportional to the depth of work queue (WQ) for an endpoint (EP). Therefore to support more number of endpoints, depth of WQ has to be decreased. In this paper, we present an optimal approach of extending WQ using Software Work Queue (SWQ). SWQ is an extension to the work queue of EP on HCA. It provides a larger pool of descriptors (i.e. work requests) per EP to an application (e.g. uDAPL application, MPI library) than that available in the hardware. The advantage of this technique lies in the fact that more number of endpoints are supported by HCA without costing the depth of the WQ. Experimental evaluation suggests positive impact as requests in SWQ are given to hardware work queue (HWQ) as soon as it gets depleted.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信