O. Tatebe, U. Nagashima, S. Sekiguchi, Hisayoshi Kitabayashi, Y. Hayashida
{"title":"远程内存操作的快速消息传递库FMPL的设计与实现","authors":"O. Tatebe, U. Nagashima, S. Sekiguchi, Hisayoshi Kitabayashi, Y. Hayashida","doi":"10.1145/582034.582049","DOIUrl":null,"url":null,"abstract":"A fast message-passing library FMPL has been designed and developed to maximize communication performance by utilizing general architectural communication support such as remote memory operations, as well as to maximize total performance by eliminating dynamic communication overhead and overlapping communication and computation. FMPL provides a low-cost general-purpose point-to-point communication and collective communication such as broadcast, barrier synchronization and reduction. On a Hitachi SR8000, FMPL achieves an 8-byte latency of 12.8µsec., while MPI achieves 20µsec. FMPL is designed for building more highly functional message-passing libraries like BLACS as well as applications that need maximum performance.","PeriodicalId":325282,"journal":{"name":"ACM/IEEE SC 2001 Conference (SC'01)","volume":"91 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2001-11-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":"{\"title\":\"Design and Implementation of FMPL, a Fast Message-Passing Library for Remote Memory Operations\",\"authors\":\"O. Tatebe, U. Nagashima, S. Sekiguchi, Hisayoshi Kitabayashi, Y. Hayashida\",\"doi\":\"10.1145/582034.582049\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"A fast message-passing library FMPL has been designed and developed to maximize communication performance by utilizing general architectural communication support such as remote memory operations, as well as to maximize total performance by eliminating dynamic communication overhead and overlapping communication and computation. FMPL provides a low-cost general-purpose point-to-point communication and collective communication such as broadcast, barrier synchronization and reduction. On a Hitachi SR8000, FMPL achieves an 8-byte latency of 12.8µsec., while MPI achieves 20µsec. FMPL is designed for building more highly functional message-passing libraries like BLACS as well as applications that need maximum performance.\",\"PeriodicalId\":325282,\"journal\":{\"name\":\"ACM/IEEE SC 2001 Conference (SC'01)\",\"volume\":\"91 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2001-11-10\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"5\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"ACM/IEEE SC 2001 Conference (SC'01)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/582034.582049\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"ACM/IEEE SC 2001 Conference (SC'01)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/582034.582049","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Design and Implementation of FMPL, a Fast Message-Passing Library for Remote Memory Operations
A fast message-passing library FMPL has been designed and developed to maximize communication performance by utilizing general architectural communication support such as remote memory operations, as well as to maximize total performance by eliminating dynamic communication overhead and overlapping communication and computation. FMPL provides a low-cost general-purpose point-to-point communication and collective communication such as broadcast, barrier synchronization and reduction. On a Hitachi SR8000, FMPL achieves an 8-byte latency of 12.8µsec., while MPI achieves 20µsec. FMPL is designed for building more highly functional message-passing libraries like BLACS as well as applications that need maximum performance.