Reverse Hashing for High-Speed Network Monitoring: Algorithms, Evaluation, and Applications

Proceedings IEEE INFOCOM 2006. 25TH IEEE International Conference on Computer Communications Pub Date : 2006-04-23 DOI:10.1109/INFOCOM.2006.203

R. Schweller, Zhichun Li, Yan Chen, Yan Gao, A. Gupta, Yin Zhang, P. Dinda, M. Kao, G. Memik

{"title":"Reverse Hashing for High-Speed Network Monitoring: Algorithms, Evaluation, and Applications","authors":"R. Schweller, Zhichun Li, Yan Chen, Yan Gao, A. Gupta, Yin Zhang, P. Dinda, M. Kao, G. Memik","doi":"10.1109/INFOCOM.2006.203","DOIUrl":null,"url":null,"abstract":"A key function for network traffic monitoring and analysis is the ability to perform aggregate queries over multiple data streams. Change detection is an important primitive which can be extended to construct many aggregate queries. The recently proposed sketches (Krishnamurthy, 2003) are among the very few that can detect heavy changes online for high speed links, and thus support various aggregate queries in both temporal and spatial domains. However, it does not preserve the keys (e.g., source IP address) of flows, making it difficult to reconstruct the desired set of anomalous keys. In an earlier abstract we proposed a framework for a reversible sketch data structure that offers hope for efficient extraction of keys (Schweller, 2004). However, this scheme is only able to detect a single heavy change key and places restrictions on the statistical properties of the key space. To address these challenges, we propose an efficient reverse hashing scheme to infer the keys of culprit flows from reversible sketches. There are two phases. The first operates online, recording the packet stream in a compact representation with negligible extra memory and few extra memory accesses. Our prototype single FPGA board implementation can achieve a throughput of over 16 Gbps for 40-byte-packet streams (the worst case). The second phase identifies heavy changes and their keys from the representation in nearly real time. We evaluate our scheme using traces from large edge routers with OC-12 or higher links. Both the analytical and experimental results show that we are able to achieve online traffic monitoring and accurate change/intrusion detection over massive data streams on high speed links, all in a manner that scales to large key space size. To the best of our knowledge, our system is the first to achieve these properties simultaneously.","PeriodicalId":163725,"journal":{"name":"Proceedings IEEE INFOCOM 2006. 25TH IEEE International Conference on Computer Communications","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2006-04-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"38","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings IEEE INFOCOM 2006. 25TH IEEE International Conference on Computer Communications","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/INFOCOM.2006.203","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 38

Abstract

A key function for network traffic monitoring and analysis is the ability to perform aggregate queries over multiple data streams. Change detection is an important primitive which can be extended to construct many aggregate queries. The recently proposed sketches (Krishnamurthy, 2003) are among the very few that can detect heavy changes online for high speed links, and thus support various aggregate queries in both temporal and spatial domains. However, it does not preserve the keys (e.g., source IP address) of flows, making it difficult to reconstruct the desired set of anomalous keys. In an earlier abstract we proposed a framework for a reversible sketch data structure that offers hope for efficient extraction of keys (Schweller, 2004). However, this scheme is only able to detect a single heavy change key and places restrictions on the statistical properties of the key space. To address these challenges, we propose an efficient reverse hashing scheme to infer the keys of culprit flows from reversible sketches. There are two phases. The first operates online, recording the packet stream in a compact representation with negligible extra memory and few extra memory accesses. Our prototype single FPGA board implementation can achieve a throughput of over 16 Gbps for 40-byte-packet streams (the worst case). The second phase identifies heavy changes and their keys from the representation in nearly real time. We evaluate our scheme using traces from large edge routers with OC-12 or higher links. Both the analytical and experimental results show that we are able to achieve online traffic monitoring and accurate change/intrusion detection over massive data streams on high speed links, all in a manner that scales to large key space size. To the best of our knowledge, our system is the first to achieve these properties simultaneously.

查看原文本刊更多论文

高速网络监控的反向哈希:算法，评估和应用

网络流量监控和分析的一个关键功能是对多个数据流执行聚合查询的能力。变更检测是一个重要的原语，可以扩展到构造许多聚合查询。最近提出的草图(Krishnamurthy, 2003)是为数不多的能够检测高速链接的在线重大变化的草图之一，因此支持时间和空间域的各种聚合查询。然而，它不保留流的密钥(例如，源IP地址)，因此很难重建所需的异常密钥集。在早期的摘要中，我们提出了一个可逆草图数据结构框架，为有效提取密钥提供了希望(Schweller, 2004)。然而，该方案只能检测单个重更改键，并对键空间的统计属性进行了限制。为了解决这些挑战，我们提出了一种有效的反向哈希方案，从可逆草图中推断出罪魁祸首流的密钥。有两个阶段。第一种是在线操作，以紧凑的形式记录数据包流，无需额外的内存和很少的额外内存访问。我们的原型单FPGA板实现对于40字节包流(最坏的情况)可以实现超过16 Gbps的吞吐量。第二阶段几乎实时地从表示中识别重大更改及其键。我们使用具有OC-12或更高链路的大型边缘路由器的跟踪来评估我们的方案。分析和实验结果都表明，我们能够在高速链路上的大量数据流上实现在线流量监控和准确的变化/入侵检测，所有这些都可以扩展到大密钥空间大小。据我们所知，我们的系统是第一个同时实现这些特性的系统。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings IEEE INFOCOM 2006. 25TH IEEE International Conference on Computer Communications

自引率

0.00%

发文量