Indexing techniques for file sharing in scalable peer-to-peer networks

Proceedings. Eleventh International Conference on Computer Communications and Networks Pub Date : 2002-12-10 DOI:10.1109/ICCCN.2002.1043038

Fred S. Annexstein, K. Berman, M. Jovanovic, Kovendhan Ponnavaikko

{"title":"Indexing techniques for file sharing in scalable peer-to-peer networks","authors":"Fred S. Annexstein, K. Berman, M. Jovanovic, Kovendhan Ponnavaikko","doi":"10.1109/ICCCN.2002.1043038","DOIUrl":null,"url":null,"abstract":"File sharing is a very popular service provided by peer-to-peer (P2P) networks. In a P2P file-sharing network, users share files and issue queries to the network to find the locations of files residing at other peer nodes. Recently, proxy-enabled peers, or supernodes, have been incorporated to enhance scalability by providing indexing services to nodes on slower network connections. Typically, supernodes build a vector or multi-index of shared files stored on other (slower) peer nodes connected to them. We consider a new model whereby the index tables of individual nodes are merged into a single data structure stored by the supernode. We analyze this model in relation to the standard vectorized data structure. We compare the performance of these supernode indexing algorithms and provide a theoretical analysis that is asymptotic and probabilistic in nature. However, there are several significant constant factors that the theory does not account for, and which are important for designing an optimal system solution. We report on a series of simulation experiments which provide verification of the asymptotic analysis of the formal framework and tools to determine the magnitude of the constant factors. Our general conclusion is that when the query rate exceeds the rate of data updates, the new merged model is preferable to the vector model. However, the details of our analysis allow us to consider combinations of several parameters, and thereby enable the design of optimal indexing schemes via the incorporation of measurements of the parameters of particular applications.","PeriodicalId":302787,"journal":{"name":"Proceedings. Eleventh International Conference on Computer Communications and Networks","volume":"103 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2002-12-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"9","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings. Eleventh International Conference on Computer Communications and Networks","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCCN.2002.1043038","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 9

Abstract

File sharing is a very popular service provided by peer-to-peer (P2P) networks. In a P2P file-sharing network, users share files and issue queries to the network to find the locations of files residing at other peer nodes. Recently, proxy-enabled peers, or supernodes, have been incorporated to enhance scalability by providing indexing services to nodes on slower network connections. Typically, supernodes build a vector or multi-index of shared files stored on other (slower) peer nodes connected to them. We consider a new model whereby the index tables of individual nodes are merged into a single data structure stored by the supernode. We analyze this model in relation to the standard vectorized data structure. We compare the performance of these supernode indexing algorithms and provide a theoretical analysis that is asymptotic and probabilistic in nature. However, there are several significant constant factors that the theory does not account for, and which are important for designing an optimal system solution. We report on a series of simulation experiments which provide verification of the asymptotic analysis of the formal framework and tools to determine the magnitude of the constant factors. Our general conclusion is that when the query rate exceeds the rate of data updates, the new merged model is preferable to the vector model. However, the details of our analysis allow us to consider combinations of several parameters, and thereby enable the design of optimal indexing schemes via the incorporation of measurements of the parameters of particular applications.

查看原文本刊更多论文

可扩展点对点网络中文件共享的索引技术

文件共享是由点对点(P2P)网络提供的一项非常流行的服务。在P2P文件共享网络中，用户共享文件并向网络发出查询，以查找驻留在其他对等节点上的文件的位置。最近，通过向较慢网络连接上的节点提供索引服务，引入了支持代理的对等节点或超级节点，从而增强了可伸缩性。通常，超级节点为存储在连接到它们的其他(速度较慢的)对等节点上的共享文件构建一个矢量或多索引。我们考虑一个新的模型，其中单个节点的索引表合并到由超级节点存储的单个数据结构中。我们分析了该模型与标准向量化数据结构的关系。我们比较了这些超级节点索引算法的性能，并提供了一个渐进和概率性的理论分析。然而，该理论没有考虑到几个重要的常数因素，而这些因素对于设计最优系统解决方案很重要。我们报告了一系列模拟实验，这些实验提供了形式框架和工具的渐近分析的验证，以确定常数因子的大小。我们的一般结论是，当查询率超过数据更新率时，新的合并模型优于向量模型。然而，我们分析的细节允许我们考虑几个参数的组合，从而能够通过结合特定应用的参数测量来设计最佳的索引方案。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings. Eleventh International Conference on Computer Communications and Networks

自引率

0.00%

发文量