{"title":"Query Expansion in Information Retrieval for Urdu Language","authors":"Imran Rasheed, H. Banka","doi":"10.1109/INFRKM.2018.8464762","DOIUrl":null,"url":null,"abstract":"The information retrieval system need to be upgraded constantly to meet the challenges posed by the advanced user queries as the search system becoming more sophisticated with time. These problems have been addressed extensively in recent times in several research communities to achieve quick and relevant outcome. One such approach is to augment the query where the automatic query expansion increases the precision in information retrieval even if it can cut down the results for some queries. Here, the above approach was tested with the present Urdu data collection obtained via different expansion models such as KL, Bo1 and Bo2. The current collection is quite large in size compared to other existing Urdu datasets. It comprises of 85,304 documents in a TRECschemes and 52 topics with their relevance assessment. In this paper we emphasize to enhance the retrieval model using the query expansion which is never done before on Urdu text. However, we show that a deep analysis of initial and expanded queries brings fascinating insights that could avail future research in the domain.","PeriodicalId":196731,"journal":{"name":"2018 Fourth International Conference on Information Retrieval and Knowledge Management (CAMP)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"9","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 Fourth International Conference on Information Retrieval and Knowledge Management (CAMP)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/INFRKM.2018.8464762","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 9
Abstract
The information retrieval system need to be upgraded constantly to meet the challenges posed by the advanced user queries as the search system becoming more sophisticated with time. These problems have been addressed extensively in recent times in several research communities to achieve quick and relevant outcome. One such approach is to augment the query where the automatic query expansion increases the precision in information retrieval even if it can cut down the results for some queries. Here, the above approach was tested with the present Urdu data collection obtained via different expansion models such as KL, Bo1 and Bo2. The current collection is quite large in size compared to other existing Urdu datasets. It comprises of 85,304 documents in a TRECschemes and 52 topics with their relevance assessment. In this paper we emphasize to enhance the retrieval model using the query expansion which is never done before on Urdu text. However, we show that a deep analysis of initial and expanded queries brings fascinating insights that could avail future research in the domain.