A Boolean Model in Information Retrieval for Search Engines

Arash Habibi Lashkari, Fereshteh Mahdavi, Vahid Ghomi
{"title":"A Boolean Model in Information Retrieval for Search Engines","authors":"Arash Habibi Lashkari, Fereshteh Mahdavi, Vahid Ghomi","doi":"10.1109/ICIME.2009.101","DOIUrl":null,"url":null,"abstract":"An information retrieval (IR) process begins when a user enters a query into the system. Queries are formal statements of information needs, for example search strings in web search engines. In IR a query does not uniquely identify a single object in the collection. Instead, several objects may match the query, perhaps with different degrees of relevancy.An object is an entity which keeps or stores information in a database. User queries are matched to objects stored in the database. Depending on the application the data objects may be, for example, text documents, images or videos. The documents themselves are not kept or stored directly in the IR system, but are instead represented in the system by document surrogates.Most IR systems compute a numeric score on how well each object in the database match the query, and rank the objects according to this value. The top ranking objects are then shown to the user. The process may then be iterated if the user wishes to refine the query.In this paper we try to explain IR methods and asses them from two view points and finally propose a simple method for ranking terms and documents on IR and implement the method and check the result.","PeriodicalId":445284,"journal":{"name":"2009 International Conference on Information Management and Engineering","volume":"32 11","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2009-04-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"53","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2009 International Conference on Information Management and Engineering","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICIME.2009.101","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 53

Abstract

An information retrieval (IR) process begins when a user enters a query into the system. Queries are formal statements of information needs, for example search strings in web search engines. In IR a query does not uniquely identify a single object in the collection. Instead, several objects may match the query, perhaps with different degrees of relevancy.An object is an entity which keeps or stores information in a database. User queries are matched to objects stored in the database. Depending on the application the data objects may be, for example, text documents, images or videos. The documents themselves are not kept or stored directly in the IR system, but are instead represented in the system by document surrogates.Most IR systems compute a numeric score on how well each object in the database match the query, and rank the objects according to this value. The top ranking objects are then shown to the user. The process may then be iterated if the user wishes to refine the query.In this paper we try to explain IR methods and asses them from two view points and finally propose a simple method for ranking terms and documents on IR and implement the method and check the result.
搜索引擎信息检索中的布尔模型
当用户向系统输入查询时,信息检索(IR)过程就开始了。查询是信息需求的正式声明,例如web搜索引擎中的搜索字符串。在IR中,查询并不唯一地标识集合中的单个对象。相反,可能有几个对象匹配查询,可能具有不同程度的相关性。对象是在数据库中保存或存储信息的实体。用户查询与存储在数据库中的对象相匹配。根据应用程序的不同,数据对象可能是文本文档、图像或视频。文档本身不直接保存或存储在IR系统中,而是通过文档代理在系统中表示。大多数IR系统计算数据库中每个对象与查询的匹配程度的数字分数,并根据该值对对象进行排名。然后将排名靠前的对象显示给用户。如果用户希望改进查询,则可以迭代该过程。本文试图从两个角度对检索词和文献的检索方法进行解释和评价,最后提出了一种简单的检索词和文献排序方法,并对该方法的实现和结果进行了检验。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信