Cosine similarity-based PageRank calculation

Int. J. Web Sci. Pub Date : 2011-12-05 DOI:10.1504/IJWS.2011.044085

S. Poomagal, T. Hamsapriya

引用次数: 3

Abstract

This paper introduces a new method for calculating the rank of a web page based on the content similarity and the link structure. There are different ranking algorithms available in the literature to calculate the importance score of web pages. The basis of all ranking algorithms is the link structure of the web. Since links from similar documents are more important than the links from other dissimilar documents, combining content similarity with link structure assigns higher ranks to more relevant documents. Cosine similarity measure is used in this paper for calculating similarity among the documents. The proposed technique is compared with existing ranking algorithms using precision, recall and F-measure.

查看原文本刊更多论文

基于余弦相似度的PageRank计算

本文介绍了一种基于网页内容相似度和链接结构计算网页排名的新方法。在文献中有不同的排序算法可用来计算网页的重要性得分。所有排名算法的基础是网络的链接结构。由于来自相似文档的链接比来自其他不相似文档的链接更重要，因此将内容相似度与链接结构相结合可以为更相关的文档分配更高的排名。本文采用余弦相似度度量来计算文档之间的相似度。将该方法与现有的排序算法进行了精度、召回率和f度量的比较。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Int. J. Web Sci.

自引率

0.00%

发文量