Straightforward modeling of fully-connected dragonfly topologies in HPC-system simulators

P. Yébenes, P. García, F. Quiles, J. Escudero-Sahuquillo
{"title":"Straightforward modeling of fully-connected dragonfly topologies in HPC-system simulators","authors":"P. Yébenes, P. García, F. Quiles, J. Escudero-Sahuquillo","doi":"10.1109/HPCSim.2015.7237037","DOIUrl":null,"url":null,"abstract":"HPC systems are growing in number of components which have to be interconnected in an efficient way. For that reason, network design has become a key issue in the development of these systems, especially when they are made of thousands of elements. In order to maximize the performance achieved by the network with an affordable cost, new network topologies have been proposed in the last years. Among them, one of the most popular is the dragonfly topology which benefits from high radix switches. As it is not affordable to test these topologies in large real systems, simulation is widely used. In that sense, simulation frameworks are used for avoiding problems and costs derived from developing a simulator from scratch, as well as easing the design of new models. In that sense, OMNeT++ is one of the most prominent simulation frameworks, deeply accepted in modeling large networks. This paper focuses on the modeling of fully-connected dragonfly topologies and its implementation in generic HPC-system simulators. First, we explain in detail the modeling of the dragonfly interconnection pattern. Next, we also describe the modeling of the minimal-path routing algorithm which fits the proposed pattern, as well as the mechanism required for avoiding deadlocks. Besides, we describe the basics of the implementation of the proposed model in an OMNeT++-based simulator. Finally, by means of a set of experiments carried out under several dragonfly configurations, we show performance results obtained from the simulator that implements our dragonfly model, and we compare them with results shown in other papers for validation purposes. Although this evaluation has been made using an OMNeT++-based simulator, the modeled interconnection pattern and routing algorithm can be adapted to any simulation tool.","PeriodicalId":134009,"journal":{"name":"2015 International Conference on High Performance Computing & Simulation (HPCS)","volume":"14 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2015-07-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2015 International Conference on High Performance Computing & Simulation (HPCS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/HPCSim.2015.7237037","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

HPC systems are growing in number of components which have to be interconnected in an efficient way. For that reason, network design has become a key issue in the development of these systems, especially when they are made of thousands of elements. In order to maximize the performance achieved by the network with an affordable cost, new network topologies have been proposed in the last years. Among them, one of the most popular is the dragonfly topology which benefits from high radix switches. As it is not affordable to test these topologies in large real systems, simulation is widely used. In that sense, simulation frameworks are used for avoiding problems and costs derived from developing a simulator from scratch, as well as easing the design of new models. In that sense, OMNeT++ is one of the most prominent simulation frameworks, deeply accepted in modeling large networks. This paper focuses on the modeling of fully-connected dragonfly topologies and its implementation in generic HPC-system simulators. First, we explain in detail the modeling of the dragonfly interconnection pattern. Next, we also describe the modeling of the minimal-path routing algorithm which fits the proposed pattern, as well as the mechanism required for avoiding deadlocks. Besides, we describe the basics of the implementation of the proposed model in an OMNeT++-based simulator. Finally, by means of a set of experiments carried out under several dragonfly configurations, we show performance results obtained from the simulator that implements our dragonfly model, and we compare them with results shown in other papers for validation purposes. Although this evaluation has been made using an OMNeT++-based simulator, the modeled interconnection pattern and routing algorithm can be adapted to any simulation tool.
hpc系统模拟器中全连接蜻蜓拓扑的直接建模
高性能计算系统的组件数量不断增加,这些组件必须以一种有效的方式相互连接。因此,网络设计已成为这些系统开发中的一个关键问题,特别是当它们由数千个元素组成时。为了以可承受的成本实现网络性能的最大化,近年来提出了新的网络拓扑。其中,最受欢迎的是蜻蜓拓扑,它得益于高基数开关。由于无法在大型真实系统中测试这些拓扑,因此仿真被广泛使用。从这个意义上说,模拟框架用于避免从头开发模拟器产生的问题和成本,以及简化新模型的设计。从这个意义上说,omnet++是最突出的仿真框架之一,在大型网络建模中被广泛接受。本文主要研究全连接蜻蜓拓扑的建模及其在通用hpc系统模拟器中的实现。首先,我们详细解释了蜻蜓互连模式的建模。接下来,我们还描述了适合所提出模式的最小路径路由算法的建模,以及避免死锁所需的机制。此外,我们还描述了在基于omnet++的模拟器中实现所提出模型的基本原理。最后,通过在几种蜻蜓配置下进行的一组实验,我们展示了从实现我们的蜻蜓模型的模拟器获得的性能结果,并将其与其他论文的结果进行了比较,以进行验证。虽然这种评估是使用基于omnet++的模拟器进行的,但建模的互连模式和路由算法可以适应任何仿真工具。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信