An MDP-based peer-to-peer search server network

Proceedings of the Third International Conference on Web Information Systems Engineering, 2002. WISE 2002. Pub Date : 2002-12-12 DOI:10.1109/WISE.2002.1181663

Y. Shen, Lee

{"title":"An MDP-based peer-to-peer search server network","authors":"Y. Shen, Lee","doi":"10.1109/WISE.2002.1181663","DOIUrl":null,"url":null,"abstract":"A distributed search system consists of a large number of autonomous search servers logically connected in a peer-to-peer network. Each search server maintains a local index of a collection of documents available at the server or on other peer machines. When a query is received by any server in the network, a distributed search process determines the most relevant search servers and redirects the query to them for processing. We model the distributed search process as Markov decision processes (MDPs). The estimated relevance of a server to a query is regarded as the reward in the MDP model. Once the MDP policies representing the global knowledge are obtained at each server through asynchronous value iteration, the most relevant servers to a given query can be efficiently identified despite the lack of centralized control and global knowledge at each autonomous server. We discuss the implementation and complexity of the asynchronous value iteration and how we extend the traditional MDP to handle the multiple-access policy (i.e., more than one optimal server is returned) and queries with multiple terms. Finally, experiments are conducted using the TREC collection. We show that the MDP-based distributed search can achieve results very close to that of a centralized search.","PeriodicalId":392999,"journal":{"name":"Proceedings of the Third International Conference on Web Information Systems Engineering, 2002. WISE 2002.","volume":"19 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2002-12-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"7","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the Third International Conference on Web Information Systems Engineering, 2002. WISE 2002.","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/WISE.2002.1181663","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 7

Abstract

A distributed search system consists of a large number of autonomous search servers logically connected in a peer-to-peer network. Each search server maintains a local index of a collection of documents available at the server or on other peer machines. When a query is received by any server in the network, a distributed search process determines the most relevant search servers and redirects the query to them for processing. We model the distributed search process as Markov decision processes (MDPs). The estimated relevance of a server to a query is regarded as the reward in the MDP model. Once the MDP policies representing the global knowledge are obtained at each server through asynchronous value iteration, the most relevant servers to a given query can be efficiently identified despite the lack of centralized control and global knowledge at each autonomous server. We discuss the implementation and complexity of the asynchronous value iteration and how we extend the traditional MDP to handle the multiple-access policy (i.e., more than one optimal server is returned) and queries with multiple terms. Finally, experiments are conducted using the TREC collection. We show that the MDP-based distributed search can achieve results very close to that of a centralized search.

查看原文本刊更多论文

基于mdp的点对点搜索服务器网络

分布式搜索系统由大量逻辑上连接在对等网络中的自治搜索服务器组成。每个搜索服务器维护服务器或其他对等机器上可用的文档集合的本地索引。当网络中的任何服务器接收到查询时，分布式搜索进程确定最相关的搜索服务器，并将查询重定向到它们进行处理。我们将分布式搜索过程建模为马尔可夫决策过程(mdp)。在MDP模型中，服务器与查询的估计相关性被视为奖励。一旦通过异步值迭代在每个服务器上获得了表示全局知识的MDP策略，就可以有效地识别与给定查询最相关的服务器，尽管每个自治服务器上缺乏集中控制和全局知识。我们将讨论异步值迭代的实现和复杂性，以及如何扩展传统的MDP来处理多访问策略(即，返回多个最优服务器)和具有多个术语的查询。最后，利用TREC采集数据进行了实验。我们表明，基于mdp的分布式搜索可以获得与集中式搜索非常接近的结果。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings of the Third International Conference on Web Information Systems Engineering, 2002. WISE 2002.

自引率

0.00%

发文量