{"title":"全对全操作中网络竞争效应的实验研究","authors":"E. N. Peryshkova, M. Kurnosov","doi":"10.1109/APEIE.2018.8546166","DOIUrl":null,"url":null,"abstract":"Interconnection networks of modern highperformance distributed computer systems are now deep hierarchical. In such systems, communication time between processors depends on their replacement in a computer system. In large-scale NUMA/SMP computer clusters network switches form the first level of this hierarchy and the second level is represented by a shared memory of computer nodes. In this paper we present a benchmark for estimating the message passing time when MPI-processes share the interconnection network. We studied the dependence of the execution time of the All-to-all collective operation on the message size and the number of processes sharing the communication channel. Authors developed a software for predicting the completion time of the All-to-all operation depending on the nodes allocation determined by the Resources and Jobs Management System. The software uses the results of an experimental estimate of the performance degradation for the MPI_Send/MPI_Recv operations during simultaneous (concurrent) usage of the communication channel by a set of processes. In future, we will develop structurally oriented algorithms for the determined nodes allocation.","PeriodicalId":147830,"journal":{"name":"2018 XIV International Scientific-Technical Conference on Actual Problems of Electronics Instrument Engineering (APEIE)","volume":"81 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Experimental Study of Network Contention Effects on All-to-All Operation\",\"authors\":\"E. N. Peryshkova, M. Kurnosov\",\"doi\":\"10.1109/APEIE.2018.8546166\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Interconnection networks of modern highperformance distributed computer systems are now deep hierarchical. In such systems, communication time between processors depends on their replacement in a computer system. In large-scale NUMA/SMP computer clusters network switches form the first level of this hierarchy and the second level is represented by a shared memory of computer nodes. In this paper we present a benchmark for estimating the message passing time when MPI-processes share the interconnection network. We studied the dependence of the execution time of the All-to-all collective operation on the message size and the number of processes sharing the communication channel. Authors developed a software for predicting the completion time of the All-to-all operation depending on the nodes allocation determined by the Resources and Jobs Management System. The software uses the results of an experimental estimate of the performance degradation for the MPI_Send/MPI_Recv operations during simultaneous (concurrent) usage of the communication channel by a set of processes. In future, we will develop structurally oriented algorithms for the determined nodes allocation.\",\"PeriodicalId\":147830,\"journal\":{\"name\":\"2018 XIV International Scientific-Technical Conference on Actual Problems of Electronics Instrument Engineering (APEIE)\",\"volume\":\"81 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2018-10-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2018 XIV International Scientific-Technical Conference on Actual Problems of Electronics Instrument Engineering (APEIE)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/APEIE.2018.8546166\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 XIV International Scientific-Technical Conference on Actual Problems of Electronics Instrument Engineering (APEIE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/APEIE.2018.8546166","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Experimental Study of Network Contention Effects on All-to-All Operation
Interconnection networks of modern highperformance distributed computer systems are now deep hierarchical. In such systems, communication time between processors depends on their replacement in a computer system. In large-scale NUMA/SMP computer clusters network switches form the first level of this hierarchy and the second level is represented by a shared memory of computer nodes. In this paper we present a benchmark for estimating the message passing time when MPI-processes share the interconnection network. We studied the dependence of the execution time of the All-to-all collective operation on the message size and the number of processes sharing the communication channel. Authors developed a software for predicting the completion time of the All-to-all operation depending on the nodes allocation determined by the Resources and Jobs Management System. The software uses the results of an experimental estimate of the performance degradation for the MPI_Send/MPI_Recv operations during simultaneous (concurrent) usage of the communication channel by a set of processes. In future, we will develop structurally oriented algorithms for the determined nodes allocation.