{"title":"An MDP-based peer-to-peer search server network","authors":"Y. Shen, Lee","doi":"10.1109/WISE.2002.1181663","DOIUrl":null,"url":null,"abstract":"A distributed search system consists of a large number of autonomous search servers logically connected in a peer-to-peer network. Each search server maintains a local index of a collection of documents available at the server or on other peer machines. When a query is received by any server in the network, a distributed search process determines the most relevant search servers and redirects the query to them for processing. We model the distributed search process as Markov decision processes (MDPs). The estimated relevance of a server to a query is regarded as the reward in the MDP model. Once the MDP policies representing the global knowledge are obtained at each server through asynchronous value iteration, the most relevant servers to a given query can be efficiently identified despite the lack of centralized control and global knowledge at each autonomous server. We discuss the implementation and complexity of the asynchronous value iteration and how we extend the traditional MDP to handle the multiple-access policy (i.e., more than one optimal server is returned) and queries with multiple terms. Finally, experiments are conducted using the TREC collection. We show that the MDP-based distributed search can achieve results very close to that of a centralized search.","PeriodicalId":392999,"journal":{"name":"Proceedings of the Third International Conference on Web Information Systems Engineering, 2002. WISE 2002.","volume":"19 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2002-12-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"7","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the Third International Conference on Web Information Systems Engineering, 2002. WISE 2002.","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/WISE.2002.1181663","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 7
Abstract
A distributed search system consists of a large number of autonomous search servers logically connected in a peer-to-peer network. Each search server maintains a local index of a collection of documents available at the server or on other peer machines. When a query is received by any server in the network, a distributed search process determines the most relevant search servers and redirects the query to them for processing. We model the distributed search process as Markov decision processes (MDPs). The estimated relevance of a server to a query is regarded as the reward in the MDP model. Once the MDP policies representing the global knowledge are obtained at each server through asynchronous value iteration, the most relevant servers to a given query can be efficiently identified despite the lack of centralized control and global knowledge at each autonomous server. We discuss the implementation and complexity of the asynchronous value iteration and how we extend the traditional MDP to handle the multiple-access policy (i.e., more than one optimal server is returned) and queries with multiple terms. Finally, experiments are conducted using the TREC collection. We show that the MDP-based distributed search can achieve results very close to that of a centralized search.