Workshop on Exascale MPI最新文献

Adaptive transport service selection for MPI with InfiniBand network 基于ib网络的MPI自适应传输服务选择

Workshop on Exascale MPI Pub Date : 2015-11-15 DOI: 10.1145/2831129.2831132

Masamichi Takagi, Norio Yamaguchi, Balazs Gerofi, A. Hori, Y. Ishikawa

{"title":"Adaptive transport service selection for MPI with InfiniBand network","authors":"Masamichi Takagi, Norio Yamaguchi, Balazs Gerofi, A. Hori, Y. Ishikawa","doi":"10.1145/2831129.2831132","DOIUrl":"https://doi.org/10.1145/2831129.2831132","url":null,"abstract":"We propose a method which adaptively selects InfiniBand transport services used for source and destination peers to improve performance while limiting memory consumption of the MPI library. There are two major choices of IB transport services available, i.e., Reliable Connection (RC) and Dynamically Connected (DC), each of which is selected for each pair of source peer and destination peer. RC is faster than DC for all communication patterns except for the case where there are many active RCs. It also consumes a lot of memory when there are many processes. DC, on the other hand, consumes less memory than RC but its performance drops when sending messages to different destinations or when many DCs sends a message to the same destination DC. Therefore, the library should find the best mapping of RCs and DCs to pairs of source peer and destination peer according to the communication pattern of the application. Our method finds a good mapping by comparing potential latency benefits for candidate mappings. It achieves 13%--19% latency reduction when compared to the methods using only DCs in micro-benchmarks representing communication patterns problematic to RC or DC with 64 processes.","PeriodicalId":417011,"journal":{"name":"Workshop on Exascale MPI","volume":"30 15","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-11-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132914584","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 3

Practical resilient cases for FA-MPI, a transactional fault-tolerant MPI FA-MPI的实际弹性案例，一个事务性容错MPI

Workshop on Exascale MPI Pub Date : 2015-11-15 DOI: 10.1145/2831129.2831130

Amin Hassani, A. Skjellum, P. Bangalore, R. Brightwell

引用次数: 11

A data streaming model in MPI MPI中的数据流模型

Workshop on Exascale MPI Pub Date : 2015-11-15 DOI: 10.1145/2831129.2831131

I. Peng, S. Markidis, E. Laure, Daniel J. Holmes, Mark Bull

{"title":"A data streaming model in MPI","authors":"I. Peng, S. Markidis, E. Laure, Daniel J. Holmes, Mark Bull","doi":"10.1145/2831129.2831131","DOIUrl":"https://doi.org/10.1145/2831129.2831131","url":null,"abstract":"Data streaming model is an effective way to tackle the challenge of data-intensive applications. As traditional HPC applications generate large volume of data and more data-intensive applications move to HPC infrastructures, it is necessary to investigate the feasibility of combining message-passing and streaming programming models. MPI, the de facto standard for programming on HPC, cannot intuitively express the communication pattern and the functional operations required in streaming models. In this work, we designed and implemented a data streaming library MPIStream atop MPI to allocate data producers and consumers, to stream data continuously or irregularly and to process data at run-time. In the same spirit as the STREAM benchmark, we developed a parallel stream benchmark to measure data processing rate. The performance of the library largely depends on the size of the stream element, the number of data producers and consumers and the computational intensity of processing one stream element. With 2,048 data producers and 2,048 data consumers in the parallel benchmark, MPIStream achieved 200 GB/s processing rate on a Blue Gene/Q supercomputer. We illustrate that a streaming library for HPC applications can effectively enable irregular parallel I/O, application monitoring and threshold collective operations.","PeriodicalId":417011,"journal":{"name":"Workshop on Exascale MPI","volume":"49 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-11-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115300397","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 24

Overtime: a tool for analyzing performance variation due to network interference 加班:分析网络干扰导致的性能变化的工具

Workshop on Exascale MPI Pub Date : 2015-11-15 DOI: 10.1145/2831129.2831133

Ryan E. Grant, K. Pedretti, A. Gentile

引用次数: 15

Preparing for exascale: modeling MPI for many-core systems using fine-grain queues 为百亿亿级做准备:使用细粒度队列为多核系统建模MPI

Workshop on Exascale MPI Pub Date : 2015-11-15 DOI: 10.1145/2831129.2831134

P. Bridges, Matthew G. F. Dosanjh, Ryan E. Grant, A. Skjellum, Shane Farmer, R. Brightwell

引用次数: 8