How do metrics of link analysis correlate to quality, relevance and popularity in wikipedia?

Raíza Hanada, Marco Cristo, M. G. Pimentel
{"title":"How do metrics of link analysis correlate to quality, relevance and popularity in wikipedia?","authors":"Raíza Hanada, Marco Cristo, M. G. Pimentel","doi":"10.1145/2526188.2526198","DOIUrl":null,"url":null,"abstract":"Many links between Web pages can be viewed as indicative of the quality and importance of the pages they pointed to. Accordingly, several studies have proposed metrics based on links to infer web page content quality. However, as far as we know, the only work that has examined the correlation between such metrics and content quality consisted of a limited study that left many open questions. In spite of these metrics having been shown successful in the task of ranking pages which were provided as answers to queries submitted to search engines, it is not possible to determine the specific contribution of factors such as quality, popularity, and importance to the results. This difficulty is partially due to the fact that such information is hard to obtain for Web pages in general. Unlike ordinary Web pages, the quality, importance and popularity of Wikipedia articles are evaluated by human experts or might be easily estimated. Thus, it is feasible to verify the relation between link analysis metrics and such factors in Wikipedia articles, our goal in this work. To accomplish that, we implemented several link analysis algorithms and compared their resulting rankings with the ones created by human evaluators regarding factors such as quality, popularity and importance. We found that the metrics are more correlated to quality and popularity than to importance, and the correlation is moderate.","PeriodicalId":114454,"journal":{"name":"Brazilian Symposium on Multimedia and the Web","volume":"56 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2013-11-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"12","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Brazilian Symposium on Multimedia and the Web","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/2526188.2526198","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 12

Abstract

Many links between Web pages can be viewed as indicative of the quality and importance of the pages they pointed to. Accordingly, several studies have proposed metrics based on links to infer web page content quality. However, as far as we know, the only work that has examined the correlation between such metrics and content quality consisted of a limited study that left many open questions. In spite of these metrics having been shown successful in the task of ranking pages which were provided as answers to queries submitted to search engines, it is not possible to determine the specific contribution of factors such as quality, popularity, and importance to the results. This difficulty is partially due to the fact that such information is hard to obtain for Web pages in general. Unlike ordinary Web pages, the quality, importance and popularity of Wikipedia articles are evaluated by human experts or might be easily estimated. Thus, it is feasible to verify the relation between link analysis metrics and such factors in Wikipedia articles, our goal in this work. To accomplish that, we implemented several link analysis algorithms and compared their resulting rankings with the ones created by human evaluators regarding factors such as quality, popularity and importance. We found that the metrics are more correlated to quality and popularity than to importance, and the correlation is moderate.
链接分析的指标如何与维基百科的质量、相关性和受欢迎程度相关联?
Web页面之间的许多链接可以被视为它们所指向的页面的质量和重要性的指示。因此,一些研究提出了基于链接的指标来推断网页内容的质量。然而,据我们所知,唯一检验这些指标和内容质量之间相关性的工作是一项有限的研究,留下了许多悬而未决的问题。尽管这些指标已被证明能够成功地对提交给搜索引擎的查询提供答案的页面进行排名,但不可能确定诸如质量、受欢迎程度和对结果的重要性等因素的具体贡献。造成这种困难的部分原因是,一般来说,很难从Web页面获取此类信息。与普通网页不同,维基百科文章的质量、重要性和受欢迎程度是由人类专家评估的,或者很容易估计。因此,验证维基百科文章中链接分析指标与这些因素之间的关系是可行的,这是我们在这项工作中的目标。为了实现这一点,我们实施了几种链接分析算法,并将它们的结果排名与人类评估者根据质量、受欢迎程度和重要性等因素创建的排名进行比较。我们发现这些指标与质量和受欢迎程度的相关性大于与重要性的相关性,而且相关性是中等的。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信