Distributed query processing using partitioned inverted files

Proceedings Eighth Symposium on String Processing and Information Retrieval Pub Date : 2001-11-13 DOI:10.1109/SPIRE.2001.989733

C. Badue, Ricardo Baeza-Yates, B. Ribeiro-Neto, N. Ziviani

引用次数: 115

Abstract

In this paper; we study query processing in a distributed text database. The novelty is a real distributed architecture implementation that oflers concurrent query service. The distributed system adopts a network of workstations model and the client-server paradigm. The document collection is indexed with an imerted$le. We adopt two distinct strategies of index partitioning in the distributed system, namely local index partitioning and global indexpartitioning. In both strategies, documents are ranked using the vector space model along with a documentfiltering technique for fast ranking. We evaluate and compare the impact of the two index partitioning strategies on query processing per$ormance. Experimental results on retrieval eficiency show that, within our framework, the global index partitioning outpe~orms the local index partitioning.

查看原文本刊更多论文

使用分区倒置文件的分布式查询处理

在本文中;我们研究了分布式文本数据库中的查询处理。新颖之处是提供并发查询服务的真正分布式体系结构实现。分布式系统采用工作站网络模型和客户端-服务器模式。文档集合使用插入的$le进行索引。在分布式系统中，我们采用了两种不同的索引分区策略，即本地索引分区和全局索引分区。在这两种策略中，使用向量空间模型和文档过滤技术对文档进行快速排序。我们评估和比较了两种索引分区策略对查询处理性能的影响。在检索效率方面的实验结果表明，在我们的框架内，全局索引分区优于局部索引分区。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings Eighth Symposium on String Processing and Information Retrieval

自引率

0.00%

发文量