{"title":"Cosine similarity-based PageRank calculation","authors":"S. Poomagal, T. Hamsapriya","doi":"10.1504/IJWS.2011.044085","DOIUrl":null,"url":null,"abstract":"This paper introduces a new method for calculating the rank of a web page based on the content similarity and the link structure. There are different ranking algorithms available in the literature to calculate the importance score of web pages. The basis of all ranking algorithms is the link structure of the web. Since links from similar documents are more important than the links from other dissimilar documents, combining content similarity with link structure assigns higher ranks to more relevant documents. Cosine similarity measure is used in this paper for calculating similarity among the documents. The proposed technique is compared with existing ranking algorithms using precision, recall and F-measure.","PeriodicalId":425045,"journal":{"name":"Int. J. Web Sci.","volume":"9 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2011-12-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Int. J. Web Sci.","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1504/IJWS.2011.044085","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3
Abstract
This paper introduces a new method for calculating the rank of a web page based on the content similarity and the link structure. There are different ranking algorithms available in the literature to calculate the importance score of web pages. The basis of all ranking algorithms is the link structure of the web. Since links from similar documents are more important than the links from other dissimilar documents, combining content similarity with link structure assigns higher ranks to more relevant documents. Cosine similarity measure is used in this paper for calculating similarity among the documents. The proposed technique is compared with existing ranking algorithms using precision, recall and F-measure.