Web搜索引擎动态倒排索引的多线程处理

C. Bonacic, Danilo Bustos, V. Gil-Costa, Mauricio Marín, Victor Sepulveda
{"title":"Web搜索引擎动态倒排索引的多线程处理","authors":"C. Bonacic, Danilo Bustos, V. Gil-Costa, Mauricio Marín, Victor Sepulveda","doi":"10.1145/2809948.2809952","DOIUrl":null,"url":null,"abstract":"Processing queries in Web search engines demands the efficient use of hardware resources to cope with the scale and dynamics of user traffic. This paper focuses on the multithreaded processing of queries that requires (1) accessing a large inverted index data structure to obtain a set of documents, (2) rank them by executing the WAND operator in order to obtain the top K most pertinent documents for the query, and (3) resolve the insertion of new documents on the inverted index concurrently with the execution of queries. We propose an efficient strategy to assign threads to queries and index update operations which is suitable to support updates on the index concurrently with query processing. The core of our proposal is a simple classification technique devised to quickly assign threads to query operations.","PeriodicalId":142249,"journal":{"name":"Proceedings of the 2015 Workshop on Large-Scale and Distributed System for Information Retrieval","volume":"31 9 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2015-10-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"9","resultStr":"{\"title\":\"Multithreaded Processing in Dynamic Inverted Indexes for Web Search Engines\",\"authors\":\"C. Bonacic, Danilo Bustos, V. Gil-Costa, Mauricio Marín, Victor Sepulveda\",\"doi\":\"10.1145/2809948.2809952\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Processing queries in Web search engines demands the efficient use of hardware resources to cope with the scale and dynamics of user traffic. This paper focuses on the multithreaded processing of queries that requires (1) accessing a large inverted index data structure to obtain a set of documents, (2) rank them by executing the WAND operator in order to obtain the top K most pertinent documents for the query, and (3) resolve the insertion of new documents on the inverted index concurrently with the execution of queries. We propose an efficient strategy to assign threads to queries and index update operations which is suitable to support updates on the index concurrently with query processing. The core of our proposal is a simple classification technique devised to quickly assign threads to query operations.\",\"PeriodicalId\":142249,\"journal\":{\"name\":\"Proceedings of the 2015 Workshop on Large-Scale and Distributed System for Information Retrieval\",\"volume\":\"31 9 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2015-10-22\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"9\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 2015 Workshop on Large-Scale and Distributed System for Information Retrieval\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/2809948.2809952\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2015 Workshop on Large-Scale and Distributed System for Information Retrieval","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/2809948.2809952","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 9

摘要

在Web搜索引擎中处理查询需要有效地利用硬件资源来应对用户流量的规模和动态。本文主要研究查询的多线程处理,它需要(1)访问大型倒排索引数据结构以获取一组文档,(2)通过执行WAND运算符对这些文档进行排序,以获得与查询最相关的前K个文档,以及(3)在执行查询的同时解决在倒排索引上插入新文档的问题。我们提出了一种有效的策略,将线程分配给查询和索引更新操作,该策略适合于在查询处理的同时支持索引更新。我们建议的核心是一种简单的分类技术,用于快速为查询操作分配线程。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Multithreaded Processing in Dynamic Inverted Indexes for Web Search Engines
Processing queries in Web search engines demands the efficient use of hardware resources to cope with the scale and dynamics of user traffic. This paper focuses on the multithreaded processing of queries that requires (1) accessing a large inverted index data structure to obtain a set of documents, (2) rank them by executing the WAND operator in order to obtain the top K most pertinent documents for the query, and (3) resolve the insertion of new documents on the inverted index concurrently with the execution of queries. We propose an efficient strategy to assign threads to queries and index update operations which is suitable to support updates on the index concurrently with query processing. The core of our proposal is a simple classification technique devised to quickly assign threads to query operations.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信