O. Tatebe, U. Nagashima, S. Sekiguchi, Hisayoshi Kitabayashi, Y. Hayashida
{"title":"Design and Implementation of FMPL, a Fast Message-Passing Library for Remote Memory Operations","authors":"O. Tatebe, U. Nagashima, S. Sekiguchi, Hisayoshi Kitabayashi, Y. Hayashida","doi":"10.1145/582034.582049","DOIUrl":null,"url":null,"abstract":"A fast message-passing library FMPL has been designed and developed to maximize communication performance by utilizing general architectural communication support such as remote memory operations, as well as to maximize total performance by eliminating dynamic communication overhead and overlapping communication and computation. FMPL provides a low-cost general-purpose point-to-point communication and collective communication such as broadcast, barrier synchronization and reduction. On a Hitachi SR8000, FMPL achieves an 8-byte latency of 12.8µsec., while MPI achieves 20µsec. FMPL is designed for building more highly functional message-passing libraries like BLACS as well as applications that need maximum performance.","PeriodicalId":325282,"journal":{"name":"ACM/IEEE SC 2001 Conference (SC'01)","volume":"91 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2001-11-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"ACM/IEEE SC 2001 Conference (SC'01)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/582034.582049","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 5
Abstract
A fast message-passing library FMPL has been designed and developed to maximize communication performance by utilizing general architectural communication support such as remote memory operations, as well as to maximize total performance by eliminating dynamic communication overhead and overlapping communication and computation. FMPL provides a low-cost general-purpose point-to-point communication and collective communication such as broadcast, barrier synchronization and reduction. On a Hitachi SR8000, FMPL achieves an 8-byte latency of 12.8µsec., while MPI achieves 20µsec. FMPL is designed for building more highly functional message-passing libraries like BLACS as well as applications that need maximum performance.