{"title":"采用基于fpga的重锤检测的云级单流量反压系统","authors":"Enge Song, Nianbing Yu, Tian Pan, Liang Xu, Yisong Qiao, Jianyuan Lu, Yilong Lv, Xiaoyu Zhang, Mingxu Xie, Jian Guo, Jun He, Jinkui Mao, Chenhao Jia, Shunmin Zhu","doi":"10.1145/3472716.3472855","DOIUrl":null,"url":null,"abstract":"Virtual private clouds provide sharing resources to a massive number of tenants for economics of scale. In such clouds, off-the-shelf x86 boxes are widely deployed as network intermediate nodes. However, due to rapid growth of cloud traffic and significant slowdown of CPU improvement in recent years, although horizontal scaling is still leveraged, CPU overload and packet losses caused by heavy hitters are occasionally observed in production environment, which seriously damage tenant's SLAs. To address this, we propose a cloud-scale per-flow backpressure system designed in Alibaba Cloud. The basic idea is to (1) trigger the heavy-hitter flow acquisition at the intermediate node in an on-demand manner only when the CPU utilization exceeds a predefined threshold and (2) backpressure the identified heavy-hitter flow to the traffic source via rate limiting at sender's NIC or hypervisor. To handle the extremely large traffic rate of cloud traffic, we leverage a high-speed FPGA for heavy hitter detection acceleration. To accommodate highly concurrent flows in the cloud, we design a hierarchical memory system for accurate heavy hitter counting during a large time window. Under the per-flow backpressure mechanism, the rate of the heavy-hitter flow is accurately throttled while the rate of mice flows is completely unaffected during the backpressure.","PeriodicalId":178725,"journal":{"name":"Proceedings of the SIGCOMM '21 Poster and Demo Sessions","volume":"28 1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-08-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"A cloud-scale per-flow backpressure system via FPGA-based heavy hitter detection\",\"authors\":\"Enge Song, Nianbing Yu, Tian Pan, Liang Xu, Yisong Qiao, Jianyuan Lu, Yilong Lv, Xiaoyu Zhang, Mingxu Xie, Jian Guo, Jun He, Jinkui Mao, Chenhao Jia, Shunmin Zhu\",\"doi\":\"10.1145/3472716.3472855\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Virtual private clouds provide sharing resources to a massive number of tenants for economics of scale. In such clouds, off-the-shelf x86 boxes are widely deployed as network intermediate nodes. However, due to rapid growth of cloud traffic and significant slowdown of CPU improvement in recent years, although horizontal scaling is still leveraged, CPU overload and packet losses caused by heavy hitters are occasionally observed in production environment, which seriously damage tenant's SLAs. To address this, we propose a cloud-scale per-flow backpressure system designed in Alibaba Cloud. The basic idea is to (1) trigger the heavy-hitter flow acquisition at the intermediate node in an on-demand manner only when the CPU utilization exceeds a predefined threshold and (2) backpressure the identified heavy-hitter flow to the traffic source via rate limiting at sender's NIC or hypervisor. To handle the extremely large traffic rate of cloud traffic, we leverage a high-speed FPGA for heavy hitter detection acceleration. To accommodate highly concurrent flows in the cloud, we design a hierarchical memory system for accurate heavy hitter counting during a large time window. Under the per-flow backpressure mechanism, the rate of the heavy-hitter flow is accurately throttled while the rate of mice flows is completely unaffected during the backpressure.\",\"PeriodicalId\":178725,\"journal\":{\"name\":\"Proceedings of the SIGCOMM '21 Poster and Demo Sessions\",\"volume\":\"28 1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-08-23\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the SIGCOMM '21 Poster and Demo Sessions\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3472716.3472855\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the SIGCOMM '21 Poster and Demo Sessions","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3472716.3472855","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
A cloud-scale per-flow backpressure system via FPGA-based heavy hitter detection
Virtual private clouds provide sharing resources to a massive number of tenants for economics of scale. In such clouds, off-the-shelf x86 boxes are widely deployed as network intermediate nodes. However, due to rapid growth of cloud traffic and significant slowdown of CPU improvement in recent years, although horizontal scaling is still leveraged, CPU overload and packet losses caused by heavy hitters are occasionally observed in production environment, which seriously damage tenant's SLAs. To address this, we propose a cloud-scale per-flow backpressure system designed in Alibaba Cloud. The basic idea is to (1) trigger the heavy-hitter flow acquisition at the intermediate node in an on-demand manner only when the CPU utilization exceeds a predefined threshold and (2) backpressure the identified heavy-hitter flow to the traffic source via rate limiting at sender's NIC or hypervisor. To handle the extremely large traffic rate of cloud traffic, we leverage a high-speed FPGA for heavy hitter detection acceleration. To accommodate highly concurrent flows in the cloud, we design a hierarchical memory system for accurate heavy hitter counting during a large time window. Under the per-flow backpressure mechanism, the rate of the heavy-hitter flow is accurately throttled while the rate of mice flows is completely unaffected during the backpressure.