{"title":"Efficient Multicast on Wormhole Switch-Based Nowp","authors":"Kuo-Pao Fan, C. King","doi":"10.1142/S0129053397000209","DOIUrl":null,"url":null,"abstract":"High bandwidth and low latency switches are commercially available. Using these switches, it becomes possible to build a system area network to interconnect workstations and processor clusters together to provide a cost-effective parallel computing platform. A processor cluster may be a shared-memory multiprocessor or a mesh-connected multicomputer, etc. The interconnection topology on this kind of platform, called switch-based NOWP, is usually irregular. On such systems, multicast is an important collective communication operation. Two steps are involved in a multicast: (1) the source node sends the multicast message to the destinations which are connected to a switch directly or are the leader of a processor cluster, and (2) the leader node of each cluster sends the message to other destinations in the same cluster. In this paper, we propose two unicast-based multicast algorithms. Algorithm Multicast_1 performs those two steps sequentially; while Algorithm Multicast_2 overlaps them. Performance of the two algorithms will be evaluated and compared.","PeriodicalId":270006,"journal":{"name":"Int. J. High Speed Comput.","volume":"94 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1997-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Int. J. High Speed Comput.","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1142/S0129053397000209","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
High bandwidth and low latency switches are commercially available. Using these switches, it becomes possible to build a system area network to interconnect workstations and processor clusters together to provide a cost-effective parallel computing platform. A processor cluster may be a shared-memory multiprocessor or a mesh-connected multicomputer, etc. The interconnection topology on this kind of platform, called switch-based NOWP, is usually irregular. On such systems, multicast is an important collective communication operation. Two steps are involved in a multicast: (1) the source node sends the multicast message to the destinations which are connected to a switch directly or are the leader of a processor cluster, and (2) the leader node of each cluster sends the message to other destinations in the same cluster. In this paper, we propose two unicast-based multicast algorithms. Algorithm Multicast_1 performs those two steps sequentially; while Algorithm Multicast_2 overlaps them. Performance of the two algorithms will be evaluated and compared.