Yutaka Watanabe, M. Sato, Miwako Tsuji, H. Murai, T. Boku
{"title":"Fugaku上openshme -UCX的Tofu-D互连UCX设计与性能评价","authors":"Yutaka Watanabe, M. Sato, Miwako Tsuji, H. Murai, T. Boku","doi":"10.1109/PAW-ATM56565.2022.00010","DOIUrl":null,"url":null,"abstract":"The partitioned global address space (PGAS) model with one-sided communication has recently received attention as an easy and intuitive method for describing remote data access in nodes. PGAS can be implemented using remote direct memory access, which provides lightweight one-sided communication and low overhead synchronization semantics. In this paper, to enable portable, lightweight, and efficient one-sided communication on the Fugaku supercomputer, we designed and implemented Universal Communication X (UCX) for Tofu Interconnect D. An evaluation using OpenSHMEM-UCX and OSHMPI indicates that OpenSHMEM with UCX on Tofu Interconnect D enables smaller latency and better efficiency compared with that for OpenSHMEM with MPI and that it is beneficial for several applications based on PGAS models.","PeriodicalId":231452,"journal":{"name":"2022 IEEE/ACM Parallel Applications Workshop: Alternatives To MPI+X (PAW-ATM)","volume":"2 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Design and Performance Evaluation of UCX for Tofu-D Interconnect with OpenSHMEM-UCX on Fugaku\",\"authors\":\"Yutaka Watanabe, M. Sato, Miwako Tsuji, H. Murai, T. Boku\",\"doi\":\"10.1109/PAW-ATM56565.2022.00010\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The partitioned global address space (PGAS) model with one-sided communication has recently received attention as an easy and intuitive method for describing remote data access in nodes. PGAS can be implemented using remote direct memory access, which provides lightweight one-sided communication and low overhead synchronization semantics. In this paper, to enable portable, lightweight, and efficient one-sided communication on the Fugaku supercomputer, we designed and implemented Universal Communication X (UCX) for Tofu Interconnect D. An evaluation using OpenSHMEM-UCX and OSHMPI indicates that OpenSHMEM with UCX on Tofu Interconnect D enables smaller latency and better efficiency compared with that for OpenSHMEM with MPI and that it is beneficial for several applications based on PGAS models.\",\"PeriodicalId\":231452,\"journal\":{\"name\":\"2022 IEEE/ACM Parallel Applications Workshop: Alternatives To MPI+X (PAW-ATM)\",\"volume\":\"2 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-11-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 IEEE/ACM Parallel Applications Workshop: Alternatives To MPI+X (PAW-ATM)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/PAW-ATM56565.2022.00010\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE/ACM Parallel Applications Workshop: Alternatives To MPI+X (PAW-ATM)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/PAW-ATM56565.2022.00010","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
摘要
具有单侧通信的分区全局地址空间(PGAS)模型作为一种简单直观的描述节点间远程数据访问的方法,近年来受到了广泛的关注。PGAS可以使用远程直接内存访问来实现,这提供了轻量级的单向通信和低开销的同步语义。为了在Fugaku超级计算机上实现便携、轻量级和高效的单向通信,我们设计并实现了豆腐互联D的通用通信X (Universal communication X, UCX)。使用OpenSHMEM-UCX和OSHMPI进行的评估表明,与带有MPI的OpenSHMEM相比,带有UCX的OpenSHMEM在豆腐互联D上具有更小的延迟和更高的效率,并且有利于基于PGAS模型的几种应用。
Design and Performance Evaluation of UCX for Tofu-D Interconnect with OpenSHMEM-UCX on Fugaku
The partitioned global address space (PGAS) model with one-sided communication has recently received attention as an easy and intuitive method for describing remote data access in nodes. PGAS can be implemented using remote direct memory access, which provides lightweight one-sided communication and low overhead synchronization semantics. In this paper, to enable portable, lightweight, and efficient one-sided communication on the Fugaku supercomputer, we designed and implemented Universal Communication X (UCX) for Tofu Interconnect D. An evaluation using OpenSHMEM-UCX and OSHMPI indicates that OpenSHMEM with UCX on Tofu Interconnect D enables smaller latency and better efficiency compared with that for OpenSHMEM with MPI and that it is beneficial for several applications based on PGAS models.