具有有界最坏情况性能的FPGA可更新分组分类

2022 IEEE Symposium on High-Performance Interconnects (HOTI) Pub Date : 2022-08-01 DOI:10.1109/HOTI55740.2022.00019

Yao Xin, Wen J. Li, Gaogang Xie, Yang Xu, Yi Wang

{"title":"具有有界最坏情况性能的FPGA可更新分组分类","authors":"Yao Xin, Wen J. Li, Gaogang Xie, Yang Xu, Yi Wang","doi":"10.1109/HOTI55740.2022.00019","DOIUrl":null,"url":null,"abstract":"FPGA has been recognized as an attractive acceler-ator for line-speed packet classification in SmartNIC due to its ability to reconfigure and provide massive parallelism. As a promising algorithmic approach that can fully exploit the FPGA characteristics, decision tree based packet classification on FPGA has been actively investigated in the past decade. However, most of them suffer from unbalanced tree structures with unpredictable depths under certain rule sets, so the potential of FPGA may not be brought into full play. Worse still, few of them can support efficient rule updates on-the-fly, which is highly required in virtualized data centers. To address these issues, we design and implement an efficient hardware ar-chitecture based on the recently proposed KickTree algorithm, which consists of multiple balanced trees with bounded depth. A strategy of multi-PE (processing element), parallel search, and serial update is adopted to decouple the search and update process. The parsing of multiple tree search results adopts a modular and hierarchical design, supporting architecture with various tree numbers. Additionally, incremental rule updates can be achieved simply by traversing all PEs in one pass, with little and bounded impact on rule searching. Experimental results on FPGA show that our design can achieve an average classification throughput of 182.6 MPPS and an average update throughput of 3.1 MUPS for various 100k-scale rule sets.","PeriodicalId":115402,"journal":{"name":"2022 IEEE Symposium on High-Performance Interconnects (HOTI)","volume":"10 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Updatable Packet Classification on FPGA with Bounded Worst-Case Performance\",\"authors\":\"Yao Xin, Wen J. Li, Gaogang Xie, Yang Xu, Yi Wang\",\"doi\":\"10.1109/HOTI55740.2022.00019\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"FPGA has been recognized as an attractive acceler-ator for line-speed packet classification in SmartNIC due to its ability to reconfigure and provide massive parallelism. As a promising algorithmic approach that can fully exploit the FPGA characteristics, decision tree based packet classification on FPGA has been actively investigated in the past decade. However, most of them suffer from unbalanced tree structures with unpredictable depths under certain rule sets, so the potential of FPGA may not be brought into full play. Worse still, few of them can support efficient rule updates on-the-fly, which is highly required in virtualized data centers. To address these issues, we design and implement an efficient hardware ar-chitecture based on the recently proposed KickTree algorithm, which consists of multiple balanced trees with bounded depth. A strategy of multi-PE (processing element), parallel search, and serial update is adopted to decouple the search and update process. The parsing of multiple tree search results adopts a modular and hierarchical design, supporting architecture with various tree numbers. Additionally, incremental rule updates can be achieved simply by traversing all PEs in one pass, with little and bounded impact on rule searching. Experimental results on FPGA show that our design can achieve an average classification throughput of 182.6 MPPS and an average update throughput of 3.1 MUPS for various 100k-scale rule sets.\",\"PeriodicalId\":115402,\"journal\":{\"name\":\"2022 IEEE Symposium on High-Performance Interconnects (HOTI)\",\"volume\":\"10 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-08-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 IEEE Symposium on High-Performance Interconnects (HOTI)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/HOTI55740.2022.00019\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE Symposium on High-Performance Interconnects (HOTI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/HOTI55740.2022.00019","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

摘要

FPGA已被认为是一个有吸引力的加速器线速分组分类在SmartNIC由于它的能力重新配置和提供大量的并行性。基于决策树的分组分类作为一种很有前途的算法方法，能够充分利用FPGA的特点，在FPGA上得到了积极的研究。然而，在某些规则集下，它们大多存在不平衡的树结构和不可预测的深度，因此FPGA的潜力可能无法得到充分发挥。更糟糕的是，它们中很少能够支持高效的动态规则更新，而这在虚拟化数据中心中是非常需要的。为了解决这些问题，我们基于最近提出的KickTree算法设计并实现了一个高效的硬件架构，该算法由多个有界深度的平衡树组成。采用多pe(处理单元)、并行搜索和串行更新策略，实现了搜索和更新过程的解耦。多树搜索结果的解析采用模块化、层次化设计，支持多种树数的体系结构。此外，增量规则更新可以简单地通过一次遍历所有pe来实现，对规则搜索的影响很小且有限。在FPGA上的实验结果表明，我们的设计可以实现各种100k规模规则集的平均分类吞吐量为182.6 MPPS，平均更新吞吐量为3.1 MUPS。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Updatable Packet Classification on FPGA with Bounded Worst-Case Performance

FPGA has been recognized as an attractive acceler-ator for line-speed packet classification in SmartNIC due to its ability to reconfigure and provide massive parallelism. As a promising algorithmic approach that can fully exploit the FPGA characteristics, decision tree based packet classification on FPGA has been actively investigated in the past decade. However, most of them suffer from unbalanced tree structures with unpredictable depths under certain rule sets, so the potential of FPGA may not be brought into full play. Worse still, few of them can support efficient rule updates on-the-fly, which is highly required in virtualized data centers. To address these issues, we design and implement an efficient hardware ar-chitecture based on the recently proposed KickTree algorithm, which consists of multiple balanced trees with bounded depth. A strategy of multi-PE (processing element), parallel search, and serial update is adopted to decouple the search and update process. The parsing of multiple tree search results adopts a modular and hierarchical design, supporting architecture with various tree numbers. Additionally, incremental rule updates can be achieved simply by traversing all PEs in one pass, with little and bounded impact on rule searching. Experimental results on FPGA show that our design can achieve an average classification throughput of 182.6 MPPS and an average update throughput of 3.1 MUPS for various 100k-scale rule sets.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2022 IEEE Symposium on High-Performance Interconnects (HOTI)

自引率

0.00%

发文量