{"title":"超链接分类:提高PageRank的新方法","authors":"Li Cun-he, Lv Ke-qiang","doi":"10.1109/DEXA.2007.14","DOIUrl":null,"url":null,"abstract":"Hyperlink structure is widely used in the hypertext classification, but it has not been paid enough attention. We propose a hyperlink classification approach to improve PageRank algorithm which is widely used in the link analysis of search engine. The cause of the topic drift problem is analyzed and the hyperlinks are classified according to their creating motivations and effects. The improved PageRank algorithm is implemented on the open source search engine NUTCH in Chinese Internet. The experimental results show that the improved PageRank algorithm performs better than the standard PageRank.","PeriodicalId":314834,"journal":{"name":"18th International Workshop on Database and Expert Systems Applications (DEXA 2007)","volume":"110 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2007-09-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"17","resultStr":"{\"title\":\"Hyperlink Classification: A New Approach to Improve PageRank\",\"authors\":\"Li Cun-he, Lv Ke-qiang\",\"doi\":\"10.1109/DEXA.2007.14\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Hyperlink structure is widely used in the hypertext classification, but it has not been paid enough attention. We propose a hyperlink classification approach to improve PageRank algorithm which is widely used in the link analysis of search engine. The cause of the topic drift problem is analyzed and the hyperlinks are classified according to their creating motivations and effects. The improved PageRank algorithm is implemented on the open source search engine NUTCH in Chinese Internet. The experimental results show that the improved PageRank algorithm performs better than the standard PageRank.\",\"PeriodicalId\":314834,\"journal\":{\"name\":\"18th International Workshop on Database and Expert Systems Applications (DEXA 2007)\",\"volume\":\"110 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2007-09-03\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"17\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"18th International Workshop on Database and Expert Systems Applications (DEXA 2007)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/DEXA.2007.14\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"18th International Workshop on Database and Expert Systems Applications (DEXA 2007)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/DEXA.2007.14","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Hyperlink Classification: A New Approach to Improve PageRank
Hyperlink structure is widely used in the hypertext classification, but it has not been paid enough attention. We propose a hyperlink classification approach to improve PageRank algorithm which is widely used in the link analysis of search engine. The cause of the topic drift problem is analyzed and the hyperlinks are classified according to their creating motivations and effects. The improved PageRank algorithm is implemented on the open source search engine NUTCH in Chinese Internet. The experimental results show that the improved PageRank algorithm performs better than the standard PageRank.