Evaluation of SDN-based Conflict Avoidance between Data Staging and Inter-Process Communication

Arata Endo, Ryoichi Jingai, S. Date, Y. Kido, S. Shimojo
{"title":"Evaluation of SDN-based Conflict Avoidance between Data Staging and Inter-Process Communication","authors":"Arata Endo, Ryoichi Jingai, S. Date, Y. Kido, S. Shimojo","doi":"10.1109/HPCS.2017.48","DOIUrl":null,"url":null,"abstract":"High-performance computing systems are currently used by many scientists and researchers with Job Management Systems (JMS) that are deployed over high-performance computing systems. To achieve high performance in high-performance computing, systems communication performance must be improved in the interconnect. JMS sometimes adopts the staging, witch is a method to transfer data used by an application program to a local high-speed storage of computing nodes so that the access time to the data is reduced. In this type of high-performance computing systems, the inter-process and the staging communication traffic coexist in an interconnect. In this paper, we investigate the mutual influence between the inter-process and the staging communication traffic and explore a way to exclude the influence of the staging and inter-process communications by targeting a cluster system with a fat-tree interconnect. For this purpose, we propose two types of conflict avoidance methods between the staging communication traffic and the inter-process communication traffic; namely, a link separation conflict avoidance method and a link sharing conflict avoidance method. The link separation conflict avoidance method separates interconnect links into the staging communication traffic and the inter-process communication traffic when the staging execution takes place. The latter proposed method allocates a route to each traffic on the shared interconnect while taking the bandwidth usage into account. Both proposed methods use Software Defined Networking to get the bandwidth usage and to control both types of traffic. We performed an experimental simulation on our deployed virtual cluster system built on a single machine to investigate how each proposed method behaves when the staging communication and the inter-process communication take place in a fat-tree interconnect. In the simulation, the staging time, application execution time and job execution time were measured for each proposed method and for the Equal-Cost Multi-Path method as the existing load distributed method was applied. The result shows that both of the proposed methods improve the performance of the staging and the inter-process communication.","PeriodicalId":115758,"journal":{"name":"2017 International Conference on High Performance Computing & Simulation (HPCS)","volume":"70 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 International Conference on High Performance Computing & Simulation (HPCS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/HPCS.2017.48","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3

Abstract

High-performance computing systems are currently used by many scientists and researchers with Job Management Systems (JMS) that are deployed over high-performance computing systems. To achieve high performance in high-performance computing, systems communication performance must be improved in the interconnect. JMS sometimes adopts the staging, witch is a method to transfer data used by an application program to a local high-speed storage of computing nodes so that the access time to the data is reduced. In this type of high-performance computing systems, the inter-process and the staging communication traffic coexist in an interconnect. In this paper, we investigate the mutual influence between the inter-process and the staging communication traffic and explore a way to exclude the influence of the staging and inter-process communications by targeting a cluster system with a fat-tree interconnect. For this purpose, we propose two types of conflict avoidance methods between the staging communication traffic and the inter-process communication traffic; namely, a link separation conflict avoidance method and a link sharing conflict avoidance method. The link separation conflict avoidance method separates interconnect links into the staging communication traffic and the inter-process communication traffic when the staging execution takes place. The latter proposed method allocates a route to each traffic on the shared interconnect while taking the bandwidth usage into account. Both proposed methods use Software Defined Networking to get the bandwidth usage and to control both types of traffic. We performed an experimental simulation on our deployed virtual cluster system built on a single machine to investigate how each proposed method behaves when the staging communication and the inter-process communication take place in a fat-tree interconnect. In the simulation, the staging time, application execution time and job execution time were measured for each proposed method and for the Equal-Cost Multi-Path method as the existing load distributed method was applied. The result shows that both of the proposed methods improve the performance of the staging and the inter-process communication.
基于sdn的数据分级与进程间通信冲突避免评估
目前,许多科学家和研究人员将高性能计算系统与部署在高性能计算系统上的作业管理系统(JMS)一起使用。为了在高性能计算中实现高性能,必须在互连中提高系统的通信性能。JMS有时采用分段,它是将应用程序使用的数据传输到计算节点的本地高速存储中,从而减少对数据的访问时间的一种方法。在这种类型的高性能计算系统中,进程间通信流量和分段通信流量共存于一个互连中。本文研究了进程间和分级通信流量之间的相互影响,并以具有胖树互连的集群系统为研究对象,探索了一种排除分级和分级通信影响的方法。为此,我们提出了两种暂存通信流量和进程间通信流量之间的冲突避免方法;即链路分离冲突避免方法和链路共享冲突避免方法。链路分离冲突避免方法在进行分段执行时将互连链路分离为分段通信流量和进程间通信流量。后一种方法在考虑带宽使用情况的同时,为共享互连上的每个流量分配一条路由。这两种方法都使用软件定义网络来获取带宽使用情况并控制两种类型的流量。我们对构建在单台机器上的已部署虚拟集群系统进行了实验模拟,以研究在胖树互连中进行暂存通信和进程间通信时所提出的每种方法的行为。在仿真中,测量了每种方法的暂存时间、应用程序执行时间和作业执行时间,并在采用现有负载分配方法时测量了等成本多路径方法的暂存时间、应用程序执行时间和作业执行时间。结果表明,两种方法都提高了分级和进程间通信的性能。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信