Meta-Migration: Reducing Switch Migration Tail Latency Through Competition

Sepehr Abbasi Zadeh, Farid Zandi, Matthew Buckley, Y. Ganjali
{"title":"Meta-Migration: Reducing Switch Migration Tail Latency Through Competition","authors":"Sepehr Abbasi Zadeh, Farid Zandi, Matthew Buckley, Y. Ganjali","doi":"10.23919/IFIPNetworking57963.2023.10186446","DOIUrl":null,"url":null,"abstract":"Resource management in distributed network control planes plays a vital role in the performance of the data plane and therefore the performance of network applications. Overwhelmed controller instances or underutilized instances could reshape their workloads by exchanging their load, i.e., switches that they control. To safely implement this exchange procedure, switch migration protocols are being used. As the migration procedure pauses processing new flows for a few milliseconds, these protocols are designed to be as fast as possible. Faster protocols add to the agility of the network to rapidly cope with the changing demand. In this paper, we introduce a general framework, called Meta-Migration, which focuses on expediting the existing time-sensitive controller load migration protocols. Based on the observation that these protocols impose low overheads on the involved parties, we modify them in a way that they can run in parallel toward multiple candidate destinations. Unlike the usual Fixed protocols that have to decide their destinations before running the protocol, here we rely on the real-time probes that we obtain from multiple systems and commit to only one of them in the middle of the procedure. Typically, migrations can complete on sub-second timescales, but sudden traffic bursts or system-level glitches can significantly slow down these protocols. We observe that by using Meta-Migration, we can dramatically diminish these negative effects. We show theoretical justifications for why this approach improves the overall performance of the migration, namely, its mean finishing time, and the tail latency of the migration. In addition, by developing a distributed controller simulator over real physical devices, we thoroughly measure the effectiveness of this approach as well as its incurred overheads. Our testbed results show up to a 53% tail reduction in the migration time.","PeriodicalId":31737,"journal":{"name":"Edutech","volume":"114 1","pages":"1-9"},"PeriodicalIF":0.0000,"publicationDate":"2023-06-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Edutech","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.23919/IFIPNetworking57963.2023.10186446","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Resource management in distributed network control planes plays a vital role in the performance of the data plane and therefore the performance of network applications. Overwhelmed controller instances or underutilized instances could reshape their workloads by exchanging their load, i.e., switches that they control. To safely implement this exchange procedure, switch migration protocols are being used. As the migration procedure pauses processing new flows for a few milliseconds, these protocols are designed to be as fast as possible. Faster protocols add to the agility of the network to rapidly cope with the changing demand. In this paper, we introduce a general framework, called Meta-Migration, which focuses on expediting the existing time-sensitive controller load migration protocols. Based on the observation that these protocols impose low overheads on the involved parties, we modify them in a way that they can run in parallel toward multiple candidate destinations. Unlike the usual Fixed protocols that have to decide their destinations before running the protocol, here we rely on the real-time probes that we obtain from multiple systems and commit to only one of them in the middle of the procedure. Typically, migrations can complete on sub-second timescales, but sudden traffic bursts or system-level glitches can significantly slow down these protocols. We observe that by using Meta-Migration, we can dramatically diminish these negative effects. We show theoretical justifications for why this approach improves the overall performance of the migration, namely, its mean finishing time, and the tail latency of the migration. In addition, by developing a distributed controller simulator over real physical devices, we thoroughly measure the effectiveness of this approach as well as its incurred overheads. Our testbed results show up to a 53% tail reduction in the migration time.
元迁移:通过竞争减少交换机迁移尾部延迟
分布式网络控制平面的资源管理对数据平面的性能起着至关重要的作用,从而影响到网络应用程序的性能。超负荷的控制器实例或未充分利用的实例可以通过交换它们的负载(即它们控制的交换机)来重塑它们的工作负载。为了安全地实现这个交换过程,使用了交换机迁移协议。当迁移过程暂停处理新流几毫秒时,这些协议被设计得尽可能快。更快的协议增加了网络的敏捷性,以快速应对不断变化的需求。在本文中,我们介绍了一个通用框架,称为元迁移,它的重点是加快现有的时间敏感控制器负载迁移协议。基于观察到这些协议对相关方施加了较低的开销,我们以一种可以并行运行多个候选目的地的方式修改它们。与通常的Fixed协议在运行协议之前必须决定其目的地不同,这里我们依赖于从多个系统获得的实时探测,并在过程中间只提交其中一个。通常,迁移可以在亚秒级的时间尺度内完成,但是突然的通信突发或系统级故障会显著降低这些协议的速度。我们观察到,通过使用元迁移,我们可以显著减少这些负面影响。我们从理论上证明了为什么这种方法可以提高迁移的整体性能,即平均完成时间和迁移的尾部延迟。此外,通过在真实物理设备上开发分布式控制器模拟器,我们彻底测量了这种方法的有效性及其产生的开销。我们的测试结果显示,迁移时间减少了53%。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
审稿时长
4 weeks
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信