Shaojie Qiao, Tianrui Li, Hong Li, Yan Zhu, Jing Peng, Jiangtao Qiu
{"title":"simmrank:基于相似性度量的页面排名方法","authors":"Shaojie Qiao, Tianrui Li, Hong Li, Yan Zhu, Jing Peng, Jiangtao Qiu","doi":"10.1109/ISKE.2010.5680842","DOIUrl":null,"url":null,"abstract":"As the Web contains rich and convenient information, Web search engine is increasingly becoming the dominant information retrieving approach. In order to rank the query results of web pages in an effective and efficient fashion, we propose a new page rank algorithm based on similarity measure from the vector space model, called SimRank, to score web pages. Firstly, we propose a new similarity measure to compute the similarity of pages and apply it to partition a web database into several web social networks (WSNs). Secondly, we improve the traditional PageRank algorithm by taking into account the relevance of page to a given query. Thirdly, we design an efficient web crawler to download the web data. And finally, we perform experimental studies to evaluate the time efficiency and scoring accuracy of SimRank with other approaches.","PeriodicalId":6417,"journal":{"name":"2010 IEEE International Conference on Intelligent Systems and Knowledge Engineering","volume":"47 1","pages":"390-395"},"PeriodicalIF":0.0000,"publicationDate":"2010-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"39","resultStr":"{\"title\":\"SimRank: A Page Rank approach based on similarity measure\",\"authors\":\"Shaojie Qiao, Tianrui Li, Hong Li, Yan Zhu, Jing Peng, Jiangtao Qiu\",\"doi\":\"10.1109/ISKE.2010.5680842\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"As the Web contains rich and convenient information, Web search engine is increasingly becoming the dominant information retrieving approach. In order to rank the query results of web pages in an effective and efficient fashion, we propose a new page rank algorithm based on similarity measure from the vector space model, called SimRank, to score web pages. Firstly, we propose a new similarity measure to compute the similarity of pages and apply it to partition a web database into several web social networks (WSNs). Secondly, we improve the traditional PageRank algorithm by taking into account the relevance of page to a given query. Thirdly, we design an efficient web crawler to download the web data. And finally, we perform experimental studies to evaluate the time efficiency and scoring accuracy of SimRank with other approaches.\",\"PeriodicalId\":6417,\"journal\":{\"name\":\"2010 IEEE International Conference on Intelligent Systems and Knowledge Engineering\",\"volume\":\"47 1\",\"pages\":\"390-395\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2010-11-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"39\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2010 IEEE International Conference on Intelligent Systems and Knowledge Engineering\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ISKE.2010.5680842\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2010 IEEE International Conference on Intelligent Systems and Knowledge Engineering","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISKE.2010.5680842","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
SimRank: A Page Rank approach based on similarity measure
As the Web contains rich and convenient information, Web search engine is increasingly becoming the dominant information retrieving approach. In order to rank the query results of web pages in an effective and efficient fashion, we propose a new page rank algorithm based on similarity measure from the vector space model, called SimRank, to score web pages. Firstly, we propose a new similarity measure to compute the similarity of pages and apply it to partition a web database into several web social networks (WSNs). Secondly, we improve the traditional PageRank algorithm by taking into account the relevance of page to a given query. Thirdly, we design an efficient web crawler to download the web data. And finally, we perform experimental studies to evaluate the time efficiency and scoring accuracy of SimRank with other approaches.