Design and Implementation of FMPL, a Fast Message-Passing Library for Remote Memory Operations

O. Tatebe, U. Nagashima, S. Sekiguchi, Hisayoshi Kitabayashi, Y. Hayashida
{"title":"Design and Implementation of FMPL, a Fast Message-Passing Library for Remote Memory Operations","authors":"O. Tatebe, U. Nagashima, S. Sekiguchi, Hisayoshi Kitabayashi, Y. Hayashida","doi":"10.1145/582034.582049","DOIUrl":null,"url":null,"abstract":"A fast message-passing library FMPL has been designed and developed to maximize communication performance by utilizing general architectural communication support such as remote memory operations, as well as to maximize total performance by eliminating dynamic communication overhead and overlapping communication and computation. FMPL provides a low-cost general-purpose point-to-point communication and collective communication such as broadcast, barrier synchronization and reduction. On a Hitachi SR8000, FMPL achieves an 8-byte latency of 12.8µsec., while MPI achieves 20µsec. FMPL is designed for building more highly functional message-passing libraries like BLACS as well as applications that need maximum performance.","PeriodicalId":325282,"journal":{"name":"ACM/IEEE SC 2001 Conference (SC'01)","volume":"91 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2001-11-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"ACM/IEEE SC 2001 Conference (SC'01)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/582034.582049","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 5

Abstract

A fast message-passing library FMPL has been designed and developed to maximize communication performance by utilizing general architectural communication support such as remote memory operations, as well as to maximize total performance by eliminating dynamic communication overhead and overlapping communication and computation. FMPL provides a low-cost general-purpose point-to-point communication and collective communication such as broadcast, barrier synchronization and reduction. On a Hitachi SR8000, FMPL achieves an 8-byte latency of 12.8µsec., while MPI achieves 20µsec. FMPL is designed for building more highly functional message-passing libraries like BLACS as well as applications that need maximum performance.
远程内存操作的快速消息传递库FMPL的设计与实现
设计和开发了一个快速消息传递库FMPL,通过利用诸如远程内存操作之类的通用架构通信支持来最大化通信性能,并通过消除动态通信开销和重叠通信和计算来最大化总性能。FMPL提供了低成本的通用点对点通信和集体通信,如广播、屏障同步和减少。在日立SR8000上,FMPL实现了12.8µs的8字节延迟。,而MPI达到20µs。FMPL是为构建功能更强大的消息传递库(如BLACS)以及需要最大性能的应用程序而设计的。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信