Identifying Frequent Items in P2P Systems

2008 The 28th International Conference on Distributed Computing Systems Pub Date : 2008-06-17 DOI:10.1109/ICDCS.2008.78

Mei Li, Wang-Chien Lee

引用次数: 15

Abstract

As peer-to-peer (P2P) systems receive growing acceptance, the need of identifying 'frequent items' in such systems appears in a variety of applications. In this paper, we define the problem of identifying frequent items (IFI) and propose an efficient in-network processing technique, called in-network filtering (netFilter), to address this important fundamental problem. netFilter operates in two phases: 1) candidate filtering: data items are grouped into item groups to obtain aggregates for pruning of infrequent items; and 2) candidate verification: the aggregates for the remaining candidate items are obtained to filter out false frequent items. We address various issues faced in realizing netFilter, including aggregate computation, candidate set optimization, and candidate set materialization. In addition, we analyze the performance of netFilter, derive the optimal setting analytically, and discuss how to achieve the optimal setting in practice. Finally, we validate the effectiveness of netFilter through extensive simulation.

查看原文本刊更多论文

识别P2P系统中的频繁项目

随着点对点(P2P)系统得到越来越多的认可，在这种系统中识别“频繁项”的需求出现在各种应用中。在本文中，我们定义了识别频繁项(IFI)的问题，并提出了一种有效的网络内处理技术，称为网络内过滤(netFilter)，以解决这一重要的基本问题。netFilter的工作分为两个阶段:1)候选过滤:将数据项分组到项组中，获得聚合，对不常见的项进行修剪;2)候选验证:得到剩余候选项的聚合，过滤掉虚假频繁项。我们解决了实现netFilter时面临的各种问题，包括聚合计算、候选集优化和候选集物化。此外，我们对netFilter的性能进行了分析，得出了最优设置，并讨论了在实践中如何实现最优设置。最后，我们通过大量的仿真验证了netFilter的有效性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2008 The 28th International Conference on Distributed Computing Systems

自引率

0.00%

发文量