Construction of discriminant model of web documents suitability as search results

Hikari Suganuma, Takamitsu Shioi, K. Hatano
{"title":"Construction of discriminant model of web documents suitability as search results","authors":"Hikari Suganuma, Takamitsu Shioi, K. Hatano","doi":"10.1145/3011141.3011204","DOIUrl":null,"url":null,"abstract":"In the research field of Web search engine development, the most important challenge is to extract more information from queries issued to Web search engines. However, the number of words in these queries tends to be small, so that it is difficult to extract information from them. Therefore, some researchers have focused on developing techniques, such as Web spam detection methods, that discriminate Web documents that do not constitute satisfactory search results. In this paper, we propose a method for constructing a discriminant model for determining whether Web documents constitute suitable or unsuitable search results of Web search engines. In contrast to current Web spam detection techniques, our method analyzes the characteristics of the Web documents quantitatively and eliminates the documents that are estimated to be unsuitable search results. Our experimental results show that our discriminant model can help to improve the effectiveness of Web search engines and the efficiency of Web document discriminators as compared to current Web spam detection techniques.","PeriodicalId":247823,"journal":{"name":"Proceedings of the 18th International Conference on Information Integration and Web-based Applications and Services","volume":"6 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-11-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 18th International Conference on Information Integration and Web-based Applications and Services","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3011141.3011204","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

In the research field of Web search engine development, the most important challenge is to extract more information from queries issued to Web search engines. However, the number of words in these queries tends to be small, so that it is difficult to extract information from them. Therefore, some researchers have focused on developing techniques, such as Web spam detection methods, that discriminate Web documents that do not constitute satisfactory search results. In this paper, we propose a method for constructing a discriminant model for determining whether Web documents constitute suitable or unsuitable search results of Web search engines. In contrast to current Web spam detection techniques, our method analyzes the characteristics of the Web documents quantitatively and eliminates the documents that are estimated to be unsuitable search results. Our experimental results show that our discriminant model can help to improve the effectiveness of Web search engines and the efficiency of Web document discriminators as compared to current Web spam detection techniques.
网络文档搜索结果适用性判别模型的构建
在Web搜索引擎开发的研究领域中,最大的挑战是如何从Web搜索引擎发出的查询中提取更多的信息。然而,这些查询中的单词数量往往很少,因此很难从中提取信息。因此,一些研究人员专注于开发技术,例如Web垃圾邮件检测方法,以区分不构成令人满意的搜索结果的Web文档。本文提出了一种构建判别模型的方法,以确定Web文档是否构成Web搜索引擎的合适或不合适的搜索结果。与当前的Web垃圾邮件检测技术相比,我们的方法定量地分析了Web文档的特征,并消除了估计为不合适的搜索结果的文档。实验结果表明,与当前的Web垃圾邮件检测技术相比,我们的判别模型可以帮助提高Web搜索引擎的有效性和Web文档判别器的效率。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信