Exploiting multiple heterogeneous networks to reduce communication costs in parallel programs

JunSeong Kim, D. Lilja
{"title":"Exploiting multiple heterogeneous networks to reduce communication costs in parallel programs","authors":"JunSeong Kim, D. Lilja","doi":"10.1109/HCW.1997.581412","DOIUrl":null,"url":null,"abstract":"The different types of messages used by a parallel application program executing in a distributed system can each have unique characteristics so that no single communication network can produce the lowest latency for all messages. For instance, short control messages may be sent with the lowest overhead on one type of network, such as Ethernet, while bulk data transfers may be better suited to a different type of network, such as Fibre Channel or HiPPI. In this paper, we investigate how to exploit multiple heterogeneous communication networks that interconnect the same set of processing nodes by dynamically selecting the best (lowest latency) network for each message based on the message size. We also show how to aggregate these multiple parallel networks into a single virtual network to further reduce the latency and increase the available bandwidth. We test this multiplexing and aggregation on a cluster of SGI multiprocessors interconnected with both Fibre Channel and Ethernet. We find that multiplexing between Ethernet and Fibre Channel can substantially reduce communication overhead in a synthetic benchmark compared to using either network alone. Aggregating these two networks into a single virtual network can further reduce communication delays for applications with many large messages. The best choice of either multiplexing or aggregation depends on the mix of message sizes in application program and the relative overheads of the two networks.","PeriodicalId":286909,"journal":{"name":"Proceedings Sixth Heterogeneous Computing Workshop (HCW'97)","volume":"70 6 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1997-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"14","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings Sixth Heterogeneous Computing Workshop (HCW'97)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/HCW.1997.581412","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 14

Abstract

The different types of messages used by a parallel application program executing in a distributed system can each have unique characteristics so that no single communication network can produce the lowest latency for all messages. For instance, short control messages may be sent with the lowest overhead on one type of network, such as Ethernet, while bulk data transfers may be better suited to a different type of network, such as Fibre Channel or HiPPI. In this paper, we investigate how to exploit multiple heterogeneous communication networks that interconnect the same set of processing nodes by dynamically selecting the best (lowest latency) network for each message based on the message size. We also show how to aggregate these multiple parallel networks into a single virtual network to further reduce the latency and increase the available bandwidth. We test this multiplexing and aggregation on a cluster of SGI multiprocessors interconnected with both Fibre Channel and Ethernet. We find that multiplexing between Ethernet and Fibre Channel can substantially reduce communication overhead in a synthetic benchmark compared to using either network alone. Aggregating these two networks into a single virtual network can further reduce communication delays for applications with many large messages. The best choice of either multiplexing or aggregation depends on the mix of message sizes in application program and the relative overheads of the two networks.
利用多个异构网络来降低并行程序中的通信成本
在分布式系统中执行的并行应用程序程序所使用的不同类型的消息都可以具有独特的特征,因此没有任何一个通信网络可以为所有消息产生最低的延迟。例如,在一种类型的网络(如以太网)上,短控制消息可能以最低的开销发送,而批量数据传输可能更适合于不同类型的网络(如光纤通道或HiPPI)。在本文中,我们研究了如何利用多个异构通信网络,这些网络通过基于消息大小动态选择每个消息的最佳(最低延迟)网络来互连同一组处理节点。我们还展示了如何将这些多个并行网络聚合为单个虚拟网络,以进一步减少延迟并增加可用带宽。我们在通过光纤通道和以太网相互连接的SGI多处理器集群上测试了这种多路复用和聚合。我们发现,与单独使用任何一种网络相比,在以太网和光纤通道之间进行多路复用可以大大减少综合基准测试中的通信开销。将这两个网络聚合到一个虚拟网络中可以进一步减少具有许多大消息的应用程序的通信延迟。多路复用或聚合的最佳选择取决于应用程序中消息大小的混合以及两个网络的相对开销。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信