Dingyu Yan , Yaping Liu , Shuo Zhang , Mingguang Xu , Zhikai Yang , Binxing Fang
{"title":"LRCC: Long-haul RDMA congestion control for cross-datacenter networks","authors":"Dingyu Yan , Yaping Liu , Shuo Zhang , Mingguang Xu , Zhikai Yang , Binxing Fang","doi":"10.1016/j.comnet.2025.111756","DOIUrl":null,"url":null,"abstract":"<div><div>With the widespread deployment of applications such as cloud storage and distributed model training, Remote Direct Memory Access (RDMA) is increasingly applied to cross-datacenter networks. These networks typically consist of multiple regional datacenters interconnected by dedicated long-haul optical fiber and Data Center Interconnect (DCI) switches. However, existing RDMA congestion control mechanisms face significant challenges in cross-datacenter networks. Firstly, the long control loops struggle to effectively suppress line-rate bursts of cross-domain RDMA traffic, leading to persistent queues that degrade overall network performance. Secondly, the heterogeneous Round-Trip Time (RTT) characteristics between cross-domain and intra-datacenter traffic disrupt the convergence and fairness guarantees of conventional methods, further exacerbating cross-domain congestion issues. In this paper, we propose a switch-driven Long-haul RDMA Congestion Control (LRCC). LRCC utilizes near-source switches to generate congestion notification packets, effectively shortening the long control loops. Furthermore, LRCC implements a precise fair-rate computation mechanism on the switches and an adaptive rate-increase strategy on the host. These mechanisms mitigate cross-domain congestion caused by hybrid traffic while ensuring high throughput for long-haul flows. We implemented a prototype system of LRCC on programmable switches and 400Gbps FPGA NICs. Testbed experiments show that, compared with the NVIDIA CX7, LRCC reduces tail latency by 11 %-16 % in short-distance congestion scenarios and by 45 %-49 % in a 640 km long-distance scenario. Large-scale simulations further demonstrate that in the cross-datacenter networks, LRCC outperforms existing solutions, reducing the average Flow Completion Time (FCT) by up to 67.2 %, 94 % and 48.4 %, respectively, compared to DCQCN, HPCC and BiCC.</div></div>","PeriodicalId":50637,"journal":{"name":"Computer Networks","volume":"273 ","pages":"Article 111756"},"PeriodicalIF":4.6000,"publicationDate":"2025-10-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computer Networks","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1389128625007224","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, HARDWARE & ARCHITECTURE","Score":null,"Total":0}
引用次数: 0
Abstract
With the widespread deployment of applications such as cloud storage and distributed model training, Remote Direct Memory Access (RDMA) is increasingly applied to cross-datacenter networks. These networks typically consist of multiple regional datacenters interconnected by dedicated long-haul optical fiber and Data Center Interconnect (DCI) switches. However, existing RDMA congestion control mechanisms face significant challenges in cross-datacenter networks. Firstly, the long control loops struggle to effectively suppress line-rate bursts of cross-domain RDMA traffic, leading to persistent queues that degrade overall network performance. Secondly, the heterogeneous Round-Trip Time (RTT) characteristics between cross-domain and intra-datacenter traffic disrupt the convergence and fairness guarantees of conventional methods, further exacerbating cross-domain congestion issues. In this paper, we propose a switch-driven Long-haul RDMA Congestion Control (LRCC). LRCC utilizes near-source switches to generate congestion notification packets, effectively shortening the long control loops. Furthermore, LRCC implements a precise fair-rate computation mechanism on the switches and an adaptive rate-increase strategy on the host. These mechanisms mitigate cross-domain congestion caused by hybrid traffic while ensuring high throughput for long-haul flows. We implemented a prototype system of LRCC on programmable switches and 400Gbps FPGA NICs. Testbed experiments show that, compared with the NVIDIA CX7, LRCC reduces tail latency by 11 %-16 % in short-distance congestion scenarios and by 45 %-49 % in a 640 km long-distance scenario. Large-scale simulations further demonstrate that in the cross-datacenter networks, LRCC outperforms existing solutions, reducing the average Flow Completion Time (FCT) by up to 67.2 %, 94 % and 48.4 %, respectively, compared to DCQCN, HPCC and BiCC.
期刊介绍:
Computer Networks is an international, archival journal providing a publication vehicle for complete coverage of all topics of interest to those involved in the computer communications networking area. The audience includes researchers, managers and operators of networks as well as designers and implementors. The Editorial Board will consider any material for publication that is of interest to those groups.