使用信息气味进行查询扩展

S. Chawla, Punam Bedi
{"title":"使用信息气味进行查询扩展","authors":"S. Chawla, Punam Bedi","doi":"10.1109/ITSIM.2008.4632051","DOIUrl":null,"url":null,"abstract":"Web has grown to a huge mass of information resource and is diverse in content. To search such rich source of information one has to be very precise in using keywords in queries to retrieve the relevant documents. Most of the queries issued to search engines are short and have ambiguous context. One way to produce effective queries is by automatic query expansion. Work has been done in this field to use the local and global techniques. The global techniques examine word occurrences and relationships in the corpus as a whole and use this information to expand a particular query. Local context analysis examines the concept occurrences and relationship in top ranked documents retrieved by the original input query to expand the same query. Query log of search engines is used by researchers to expand the input queries using the clicked documents related to any of the terms of input query in query session of query log. In this paper a new local analysis technique is proposed which make use of information need of query sessions modeled using Information Scent and content of clicked documents to select the clicked documents for query expansion. Information scent is the subjective sense of value and cost of accessing a page based on perceptual cues with respect to the information need of the user. The input query issued in a particular domain is used to select the set of documents associated with the information need of the query sessions in the same domain and used as local corpora to provide related set of terms to be added to the input query. The resulting expanded query is used to retrieve the relevant documents from the same retrieval system. This approach is unique as it is using those documents in local corpora which belong to the information need associated with the domain in which input query is issued using Information Scent and content of clicked pages in the query sessions and direct the search in a fruitful direction by expanding initial input query using set of related terms. Experimental study of the proposed approach is done on the data set extracted from Web history of ldquoGooglerdquo search engine and improvement in the information retrieval precision with low computation complexity during online processing of input queries confirms the effectiveness of the proposed approach.","PeriodicalId":314159,"journal":{"name":"2008 International Symposium on Information Technology","volume":"35 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2008-09-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":"{\"title\":\"Query expansion using information scent\",\"authors\":\"S. Chawla, Punam Bedi\",\"doi\":\"10.1109/ITSIM.2008.4632051\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Web has grown to a huge mass of information resource and is diverse in content. To search such rich source of information one has to be very precise in using keywords in queries to retrieve the relevant documents. Most of the queries issued to search engines are short and have ambiguous context. One way to produce effective queries is by automatic query expansion. Work has been done in this field to use the local and global techniques. The global techniques examine word occurrences and relationships in the corpus as a whole and use this information to expand a particular query. Local context analysis examines the concept occurrences and relationship in top ranked documents retrieved by the original input query to expand the same query. Query log of search engines is used by researchers to expand the input queries using the clicked documents related to any of the terms of input query in query session of query log. In this paper a new local analysis technique is proposed which make use of information need of query sessions modeled using Information Scent and content of clicked documents to select the clicked documents for query expansion. Information scent is the subjective sense of value and cost of accessing a page based on perceptual cues with respect to the information need of the user. The input query issued in a particular domain is used to select the set of documents associated with the information need of the query sessions in the same domain and used as local corpora to provide related set of terms to be added to the input query. The resulting expanded query is used to retrieve the relevant documents from the same retrieval system. This approach is unique as it is using those documents in local corpora which belong to the information need associated with the domain in which input query is issued using Information Scent and content of clicked pages in the query sessions and direct the search in a fruitful direction by expanding initial input query using set of related terms. Experimental study of the proposed approach is done on the data set extracted from Web history of ldquoGooglerdquo search engine and improvement in the information retrieval precision with low computation complexity during online processing of input queries confirms the effectiveness of the proposed approach.\",\"PeriodicalId\":314159,\"journal\":{\"name\":\"2008 International Symposium on Information Technology\",\"volume\":\"35 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2008-09-26\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"5\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2008 International Symposium on Information Technology\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ITSIM.2008.4632051\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2008 International Symposium on Information Technology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ITSIM.2008.4632051","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 5

摘要

网络已经发展成为一个庞大的信息资源,其内容是多样化的。要搜索如此丰富的信息源,必须非常精确地在查询中使用关键字来检索相关文档。大多数发送给搜索引擎的查询都很短,并且具有模糊的上下文。生成有效查询的一种方法是通过自动查询扩展。在这一领域已开展工作,利用当地和全球技术。全局技术从整体上检查语料库中的单词出现和关系,并使用这些信息来扩展特定的查询。本地上下文分析检查原始输入查询检索的排名靠前的文档中的概念出现和关系,以扩展相同的查询。搜索引擎的查询日志是研究人员在查询日志的查询会话中,利用与输入查询的任一项相关的点击文档,对输入查询进行扩展。本文提出了一种新的局部分析技术,利用信息气味和被点击文档的内容建模查询会话的信息需求,选择被点击的文档进行查询扩展。信息气味是基于用户对信息需求的感知线索对访问页面的价值和成本的主观感觉。在特定域中发出的输入查询用于选择与同一域中查询会话的信息需求相关联的文档集,并用作本地语料库,以提供要添加到输入查询中的相关术语集。得到的扩展查询用于从同一检索系统检索相关文档。这种方法的独特之处在于,它使用本地语料库中的文档,这些文档属于与输入查询相关的领域的信息需求,使用信息气味和查询会话中单击页面的内容,并通过使用一组相关术语扩展初始输入查询,将搜索引向富有成效的方向。在ldquoGooglerdquo搜索引擎的Web历史数据集上进行了实验研究,在在线处理输入查询时提高了信息检索精度和较低的计算复杂度,验证了所提方法的有效性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Query expansion using information scent
Web has grown to a huge mass of information resource and is diverse in content. To search such rich source of information one has to be very precise in using keywords in queries to retrieve the relevant documents. Most of the queries issued to search engines are short and have ambiguous context. One way to produce effective queries is by automatic query expansion. Work has been done in this field to use the local and global techniques. The global techniques examine word occurrences and relationships in the corpus as a whole and use this information to expand a particular query. Local context analysis examines the concept occurrences and relationship in top ranked documents retrieved by the original input query to expand the same query. Query log of search engines is used by researchers to expand the input queries using the clicked documents related to any of the terms of input query in query session of query log. In this paper a new local analysis technique is proposed which make use of information need of query sessions modeled using Information Scent and content of clicked documents to select the clicked documents for query expansion. Information scent is the subjective sense of value and cost of accessing a page based on perceptual cues with respect to the information need of the user. The input query issued in a particular domain is used to select the set of documents associated with the information need of the query sessions in the same domain and used as local corpora to provide related set of terms to be added to the input query. The resulting expanded query is used to retrieve the relevant documents from the same retrieval system. This approach is unique as it is using those documents in local corpora which belong to the information need associated with the domain in which input query is issued using Information Scent and content of clicked pages in the query sessions and direct the search in a fruitful direction by expanding initial input query using set of related terms. Experimental study of the proposed approach is done on the data set extracted from Web history of ldquoGooglerdquo search engine and improvement in the information retrieval precision with low computation complexity during online processing of input queries confirms the effectiveness of the proposed approach.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信