Implementing Multidestination Worms In Switch-based Parallel Systems: Architectural Alternatives And Their Impact

Conference Proceedings. The 24th Annual International Symposium on Computer Architecture Pub Date : 1997-06-01 DOI:10.1145/264107.264129

Rajeev Sivaram, C. Stunkel, D. Panda

{"title":"Implementing Multidestination Worms In Switch-based Parallel Systems: Architectural Alternatives And Their Impact","authors":"Rajeev Sivaram, C. Stunkel, D. Panda","doi":"10.1145/264107.264129","DOIUrl":null,"url":null,"abstract":"Multidestination message passing has been proposed as an attractive mechanism for efficiently implementing multicast and other collective operations on direct networks. However, applying this mechanism to switch-based parallel systems is non-trivial. In this paper we propose alternative switch architectures with differing buffer organizations to implement multidestination worms on switch-based parallel systems. First, we discuss issues related to such implementation (deadlock-freedom, replication mechanisms, header encoding, and routing). Next, we demonstrate how an existing central-buffer-based switch architecture supporting unicast message passing can be enhanced to accommodate multidestination message passing. Similarly, implementing multidestination worms on an input-buffer-based switch architecture is discussed. Both of these implementations are evaluated against each other as well as against a software-based scheme using the central buffer organization. Simulation experiments under a range of traffic (multiple multicast, bimodal, varying degree of multicast, and message length) and system size are used for evaluation. The study demonstrates the superiority of the central-buffer-based switch architecture. It also indicates that under bimodal traffic the central-buffer-based hardware multicast implementation affects background unicast traffic less adversely compared to a software-based multicast implementation. Thus, multidestination message passing can easily be applied to switch-based parallel systems to deliver good collective communication performance.","PeriodicalId":405506,"journal":{"name":"Conference Proceedings. The 24th Annual International Symposium on Computer Architecture","volume":"33 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1997-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"63","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Conference Proceedings. The 24th Annual International Symposium on Computer Architecture","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/264107.264129","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 63

Abstract

Multidestination message passing has been proposed as an attractive mechanism for efficiently implementing multicast and other collective operations on direct networks. However, applying this mechanism to switch-based parallel systems is non-trivial. In this paper we propose alternative switch architectures with differing buffer organizations to implement multidestination worms on switch-based parallel systems. First, we discuss issues related to such implementation (deadlock-freedom, replication mechanisms, header encoding, and routing). Next, we demonstrate how an existing central-buffer-based switch architecture supporting unicast message passing can be enhanced to accommodate multidestination message passing. Similarly, implementing multidestination worms on an input-buffer-based switch architecture is discussed. Both of these implementations are evaluated against each other as well as against a software-based scheme using the central buffer organization. Simulation experiments under a range of traffic (multiple multicast, bimodal, varying degree of multicast, and message length) and system size are used for evaluation. The study demonstrates the superiority of the central-buffer-based switch architecture. It also indicates that under bimodal traffic the central-buffer-based hardware multicast implementation affects background unicast traffic less adversely compared to a software-based multicast implementation. Thus, multidestination message passing can easily be applied to switch-based parallel systems to deliver good collective communication performance.

查看原文本刊更多论文

在基于交换机的并行系统中实现多目标蠕虫:架构选择及其影响

多目标消息传递作为一种有效实现组播和其他在直连网络上的集体操作的有吸引力的机制被提出。然而，将这种机制应用于基于开关的并行系统并非易事。在本文中，我们提出了具有不同缓冲组织的替代交换机架构，以在基于交换机的并行系统上实现多目标蠕虫。首先，我们讨论与这种实现相关的问题(无死锁、复制机制、报头编码和路由)。接下来，我们将演示如何对支持单播消息传递的现有基于中央缓冲区的交换机架构进行增强，以适应多目的地消息传递。同样，我们也讨论了在基于输入缓冲的交换机架构上实现多目标蠕虫。这两种实现都是根据彼此以及使用中央缓冲区组织的基于软件的方案进行评估的。在各种流量(多组播、双峰、不同程度的组播和消息长度)和系统大小下的仿真实验用于评估。该研究证明了基于中央缓冲器的开关结构的优越性。它还表明，在双峰流量下，与基于软件的多播实现相比，基于中央缓冲区的硬件多播实现对后台单播流量的不利影响较小。因此，多目的地消息传递可以很容易地应用于基于交换机的并行系统，以提供良好的集体通信性能。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Conference Proceedings. The 24th Annual International Symposium on Computer Architecture

自引率

0.00%

发文量