{"title":"Message from the Workshop Chairs","authors":"E. Gahleitner, Wolfram Wöß","doi":"10.1109/DEXA.2006.92","DOIUrl":null,"url":null,"abstract":"A Benchmark Fast of Multithreaded Communication Performance” new benchmark for multi-threaded communication, of Multi-GPU MPI Collective Communications on Large FFT Computation” the key area of GPU enabled MPI collectives. Improvements to Allreduce” by Bienz et al. explores the topic of collective communications and whether they can be accelerated by taking into account process placement on multiple nodes and taking advantage of local node shared memory communication in an intelligent way to accelerate all reduce. Gawande et al. MPI to provide a high-performance implementation of a PGAS runtime built on top of MPI two-sided communication in “Accelerating the Global Arrays ComEx Runtime Using Multiple Progress Ranks”. Margolin et al’s Library for Collective Operations in MPI” used underlying RDMA network operations to accelerate large scale collective operations in MPI. Finally, Schuchart et al. explore the possibility of using MPI RMA as a basis for an active message solution in MPI-3 RMA for Active Messages”.","PeriodicalId":107291,"journal":{"name":"DEXA Workshops","volume":"3 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"DEXA Workshops","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/DEXA.2006.92","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
A Benchmark Fast of Multithreaded Communication Performance” new benchmark for multi-threaded communication, of Multi-GPU MPI Collective Communications on Large FFT Computation” the key area of GPU enabled MPI collectives. Improvements to Allreduce” by Bienz et al. explores the topic of collective communications and whether they can be accelerated by taking into account process placement on multiple nodes and taking advantage of local node shared memory communication in an intelligent way to accelerate all reduce. Gawande et al. MPI to provide a high-performance implementation of a PGAS runtime built on top of MPI two-sided communication in “Accelerating the Global Arrays ComEx Runtime Using Multiple Progress Ranks”. Margolin et al’s Library for Collective Operations in MPI” used underlying RDMA network operations to accelerate large scale collective operations in MPI. Finally, Schuchart et al. explore the possibility of using MPI RMA as a basis for an active message solution in MPI-3 RMA for Active Messages”.
“大型FFT计算下多GPU MPI集体通信的多线程通信新基准”是GPU支持MPI集体的关键领域。Bienz等人的“Allreduce的改进”探讨了集体通信的主题,以及是否可以通过考虑多个节点上的进程放置和利用本地节点共享内存通信以智能的方式加速all reduce来加速集体通信。Gawande等人。MPI将在“使用多个进度等级加速全局阵列ComEx运行时”中提供基于MPI双边通信的PGAS运行时的高性能实现。Margolin等人的“MPI集体操作库”使用底层RDMA网络操作来加速MPI中的大规模集体操作。最后,Schuchart等人在“MPI-3 RMA for active Messages”中探讨了使用MPI RMA作为活动消息解决方案基础的可能性。