全对全操作中网络竞争效应的实验研究

2018 XIV International Scientific-Technical Conference on Actual Problems of Electronics Instrument Engineering (APEIE) Pub Date : 2018-10-01 DOI:10.1109/APEIE.2018.8546166

E. N. Peryshkova, M. Kurnosov

{"title":"全对全操作中网络竞争效应的实验研究","authors":"E. N. Peryshkova, M. Kurnosov","doi":"10.1109/APEIE.2018.8546166","DOIUrl":null,"url":null,"abstract":"Interconnection networks of modern highperformance distributed computer systems are now deep hierarchical. In such systems, communication time between processors depends on their replacement in a computer system. In large-scale NUMA/SMP computer clusters network switches form the first level of this hierarchy and the second level is represented by a shared memory of computer nodes. In this paper we present a benchmark for estimating the message passing time when MPI-processes share the interconnection network. We studied the dependence of the execution time of the All-to-all collective operation on the message size and the number of processes sharing the communication channel. Authors developed a software for predicting the completion time of the All-to-all operation depending on the nodes allocation determined by the Resources and Jobs Management System. The software uses the results of an experimental estimate of the performance degradation for the MPI_Send/MPI_Recv operations during simultaneous (concurrent) usage of the communication channel by a set of processes. In future, we will develop structurally oriented algorithms for the determined nodes allocation.","PeriodicalId":147830,"journal":{"name":"2018 XIV International Scientific-Technical Conference on Actual Problems of Electronics Instrument Engineering (APEIE)","volume":"81 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Experimental Study of Network Contention Effects on All-to-All Operation\",\"authors\":\"E. N. Peryshkova, M. Kurnosov\",\"doi\":\"10.1109/APEIE.2018.8546166\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Interconnection networks of modern highperformance distributed computer systems are now deep hierarchical. In such systems, communication time between processors depends on their replacement in a computer system. In large-scale NUMA/SMP computer clusters network switches form the first level of this hierarchy and the second level is represented by a shared memory of computer nodes. In this paper we present a benchmark for estimating the message passing time when MPI-processes share the interconnection network. We studied the dependence of the execution time of the All-to-all collective operation on the message size and the number of processes sharing the communication channel. Authors developed a software for predicting the completion time of the All-to-all operation depending on the nodes allocation determined by the Resources and Jobs Management System. The software uses the results of an experimental estimate of the performance degradation for the MPI_Send/MPI_Recv operations during simultaneous (concurrent) usage of the communication channel by a set of processes. In future, we will develop structurally oriented algorithms for the determined nodes allocation.\",\"PeriodicalId\":147830,\"journal\":{\"name\":\"2018 XIV International Scientific-Technical Conference on Actual Problems of Electronics Instrument Engineering (APEIE)\",\"volume\":\"81 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2018-10-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2018 XIV International Scientific-Technical Conference on Actual Problems of Electronics Instrument Engineering (APEIE)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/APEIE.2018.8546166\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 XIV International Scientific-Technical Conference on Actual Problems of Electronics Instrument Engineering (APEIE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/APEIE.2018.8546166","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

现代高性能分布式计算机系统的互连网络现在是深度分层的。在这样的系统中，处理器之间的通信时间取决于它们在计算机系统中的替换。在大规模NUMA/SMP计算机集群中，网络交换机构成该层次结构的第一级，第二级由计算机节点的共享内存表示。本文提出了一种估计mpi进程共享互连网络时消息传递时间的基准。研究了全对全集体操作的执行时间对消息大小和共享通信通道的进程数的依赖关系。根据资源作业管理系统确定的节点分配情况，开发了全对全作业完成时间预测软件。该软件使用了一组进程同时(并发)使用通信通道期间MPI_Send/MPI_Recv操作性能下降的实验估计结果。未来，我们将开发面向结构的算法来确定节点的分配。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Experimental Study of Network Contention Effects on All-to-All Operation

Interconnection networks of modern highperformance distributed computer systems are now deep hierarchical. In such systems, communication time between processors depends on their replacement in a computer system. In large-scale NUMA/SMP computer clusters network switches form the first level of this hierarchy and the second level is represented by a shared memory of computer nodes. In this paper we present a benchmark for estimating the message passing time when MPI-processes share the interconnection network. We studied the dependence of the execution time of the All-to-all collective operation on the message size and the number of processes sharing the communication channel. Authors developed a software for predicting the completion time of the All-to-all operation depending on the nodes allocation determined by the Resources and Jobs Management System. The software uses the results of an experimental estimate of the performance degradation for the MPI_Send/MPI_Recv operations during simultaneous (concurrent) usage of the communication channel by a set of processes. In future, we will develop structurally oriented algorithms for the determined nodes allocation.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2018 XIV International Scientific-Technical Conference on Actual Problems of Electronics Instrument Engineering (APEIE)

自引率

0.00%

发文量