Query Expansion in Information Retrieval for Urdu Language

2018 Fourth International Conference on Information Retrieval and Knowledge Management (CAMP) Pub Date : 2018-03-01 DOI:10.1109/INFRKM.2018.8464762

Imran Rasheed, H. Banka

{"title":"Query Expansion in Information Retrieval for Urdu Language","authors":"Imran Rasheed, H. Banka","doi":"10.1109/INFRKM.2018.8464762","DOIUrl":null,"url":null,"abstract":"The information retrieval system need to be upgraded constantly to meet the challenges posed by the advanced user queries as the search system becoming more sophisticated with time. These problems have been addressed extensively in recent times in several research communities to achieve quick and relevant outcome. One such approach is to augment the query where the automatic query expansion increases the precision in information retrieval even if it can cut down the results for some queries. Here, the above approach was tested with the present Urdu data collection obtained via different expansion models such as KL, Bo1 and Bo2. The current collection is quite large in size compared to other existing Urdu datasets. It comprises of 85,304 documents in a TRECschemes and 52 topics with their relevance assessment. In this paper we emphasize to enhance the retrieval model using the query expansion which is never done before on Urdu text. However, we show that a deep analysis of initial and expanded queries brings fascinating insights that could avail future research in the domain.","PeriodicalId":196731,"journal":{"name":"2018 Fourth International Conference on Information Retrieval and Knowledge Management (CAMP)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"9","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 Fourth International Conference on Information Retrieval and Knowledge Management (CAMP)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/INFRKM.2018.8464762","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 9

Abstract

The information retrieval system need to be upgraded constantly to meet the challenges posed by the advanced user queries as the search system becoming more sophisticated with time. These problems have been addressed extensively in recent times in several research communities to achieve quick and relevant outcome. One such approach is to augment the query where the automatic query expansion increases the precision in information retrieval even if it can cut down the results for some queries. Here, the above approach was tested with the present Urdu data collection obtained via different expansion models such as KL, Bo1 and Bo2. The current collection is quite large in size compared to other existing Urdu datasets. It comprises of 85,304 documents in a TRECschemes and 52 topics with their relevance assessment. In this paper we emphasize to enhance the retrieval model using the query expansion which is never done before on Urdu text. However, we show that a deep analysis of initial and expanded queries brings fascinating insights that could avail future research in the domain.

查看原文本刊更多论文

乌尔都语信息检索中的查询扩展

随着时代的发展，信息检索系统越来越复杂，需要不断地对信息检索系统进行升级，以满足用户的高级查询需求。这些问题在最近的几个研究团体中得到了广泛的解决，以获得快速和相关的结果。其中一种方法是扩展查询，其中自动查询扩展增加了信息检索的精度，即使它可能会减少某些查询的结果。在这里，用不同的扩展模型(如KL, Bo1和Bo2)获得的现有乌尔都语数据收集对上述方法进行了测试。与其他现有的乌尔都语数据集相比，当前的集合规模相当大。它包括一个trecs计划中的85,304份文件和52个主题及其相关性评估。本文着重对乌尔都语文本的检索模型进行了前所未有的查询扩展。然而，我们表明，对初始查询和扩展查询的深入分析带来了迷人的见解，可以利用该领域的未来研究。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2018 Fourth International Conference on Information Retrieval and Knowledge Management (CAMP)

自引率

0.00%

发文量