FlowShader: gpu加速VNF流处理的通用框架

2019 IEEE 27th International Conference on Network Protocols (ICNP) Pub Date : 2019-10-08 DOI:10.1109/ICNP.2019.8888129

Xiaodong Yi, Junjie Wang, Jingpu Duan, Wei Bai, Chuan Wu, Y. Xiong, Dongsu Han

{"title":"FlowShader: gpu加速VNF流处理的通用框架","authors":"Xiaodong Yi, Junjie Wang, Jingpu Duan, Wei Bai, Chuan Wu, Y. Xiong, Dongsu Han","doi":"10.1109/ICNP.2019.8888129","DOIUrl":null,"url":null,"abstract":"GPU acceleration has been widely investigated for packet processing in virtual network functions (NFs), but not for L7 flow-processing NFs. In L7 NFs, reassembled TCP messages of the same flow should be processed in order in the same processing thread, and the uneven sizes among flows pose a major challenge for full realization of GPU’s parallel computation power.To exploit GPUs for L7 NF processing, this paper presents FlowShader, a GPU acceleration framework to achieve both high generality and throughput even under skewed flow size distributions. We carefully design an efficient scheduling algorithm that fully exploits available GPU and CPU capacities; in particular, we dispatch large flows which seriously break up the size balance to CPU and the rest of flows to GPU. Furthermore, FlowShader allows similar NF logic (as CPU-based NFs) to run on individual threads in a GPU, which is more generalized and easy to take on as compared to redesigning an NF for operation parallelism on GPU. We implemented a number of L7 flow processing NFs based on FlowShader. Evaluations are conducted under both synthetic and real-world traffic traces and results show that the throughput achieved by FlowShader is up to 6x that of the CPU-only baseline and 3x of the GPU-only design.","PeriodicalId":385397,"journal":{"name":"2019 IEEE 27th International Conference on Network Protocols (ICNP)","volume":"117 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-10-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"9","resultStr":"{\"title\":\"FlowShader: a Generalized Framework for GPU-accelerated VNF Flow Processing\",\"authors\":\"Xiaodong Yi, Junjie Wang, Jingpu Duan, Wei Bai, Chuan Wu, Y. Xiong, Dongsu Han\",\"doi\":\"10.1109/ICNP.2019.8888129\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"GPU acceleration has been widely investigated for packet processing in virtual network functions (NFs), but not for L7 flow-processing NFs. In L7 NFs, reassembled TCP messages of the same flow should be processed in order in the same processing thread, and the uneven sizes among flows pose a major challenge for full realization of GPU’s parallel computation power.To exploit GPUs for L7 NF processing, this paper presents FlowShader, a GPU acceleration framework to achieve both high generality and throughput even under skewed flow size distributions. We carefully design an efficient scheduling algorithm that fully exploits available GPU and CPU capacities; in particular, we dispatch large flows which seriously break up the size balance to CPU and the rest of flows to GPU. Furthermore, FlowShader allows similar NF logic (as CPU-based NFs) to run on individual threads in a GPU, which is more generalized and easy to take on as compared to redesigning an NF for operation parallelism on GPU. We implemented a number of L7 flow processing NFs based on FlowShader. Evaluations are conducted under both synthetic and real-world traffic traces and results show that the throughput achieved by FlowShader is up to 6x that of the CPU-only baseline and 3x of the GPU-only design.\",\"PeriodicalId\":385397,\"journal\":{\"name\":\"2019 IEEE 27th International Conference on Network Protocols (ICNP)\",\"volume\":\"117 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-10-08\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"9\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2019 IEEE 27th International Conference on Network Protocols (ICNP)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICNP.2019.8888129\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 IEEE 27th International Conference on Network Protocols (ICNP)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICNP.2019.8888129","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 9

摘要

GPU加速在虚拟网络函数(NFs)中的数据包处理方面已经得到了广泛的研究，但在L7流处理NFs方面还没有得到广泛的研究。在L7 NFs中，同一流的重组TCP消息需要在同一处理线程中依次处理，流之间的大小不均匀对GPU的并行计算能力的充分实现提出了重大挑战。为了利用GPU进行L7 NF处理，本文提出了FlowShader，一个GPU加速框架，即使在扭曲的流量大小分布下也能实现高通用性和吞吐量。我们精心设计了一个高效的调度算法，充分利用可用的GPU和CPU容量;特别是，我们调度的大流量严重破坏了CPU的大小平衡，而其余的流量则分配给GPU。此外，FlowShader允许类似的NF逻辑(作为基于cpu的NFs)在GPU的单个线程上运行，这比在GPU上为操作并行性重新设计NF更通用，更容易实现。我们基于FlowShader实现了许多L7流处理NFs。在合成和真实流量跟踪下进行评估，结果表明FlowShader实现的吞吐量高达仅cpu基线的6倍和仅gpu设计的3倍。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

FlowShader: a Generalized Framework for GPU-accelerated VNF Flow Processing

GPU acceleration has been widely investigated for packet processing in virtual network functions (NFs), but not for L7 flow-processing NFs. In L7 NFs, reassembled TCP messages of the same flow should be processed in order in the same processing thread, and the uneven sizes among flows pose a major challenge for full realization of GPU’s parallel computation power.To exploit GPUs for L7 NF processing, this paper presents FlowShader, a GPU acceleration framework to achieve both high generality and throughput even under skewed flow size distributions. We carefully design an efficient scheduling algorithm that fully exploits available GPU and CPU capacities; in particular, we dispatch large flows which seriously break up the size balance to CPU and the rest of flows to GPU. Furthermore, FlowShader allows similar NF logic (as CPU-based NFs) to run on individual threads in a GPU, which is more generalized and easy to take on as compared to redesigning an NF for operation parallelism on GPU. We implemented a number of L7 flow processing NFs based on FlowShader. Evaluations are conducted under both synthetic and real-world traffic traces and results show that the throughput achieved by FlowShader is up to 6x that of the CPU-only baseline and 3x of the GPU-only design.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2019 IEEE 27th International Conference on Network Protocols (ICNP)

自引率

0.00%

发文量