{"title":"基于用户行为的Web缓存自动搜索引擎替换算法研究","authors":"Feng Zhang, Xia-long Li","doi":"10.1109/WISA.2010.25","DOIUrl":null,"url":null,"abstract":"To improve the retrieval efficiency and performance of the large scale information retrieval systems, analyzed existing replacement algorithm for WEB caching, due to the diversity of the WEB traffic pattern, the traditional algorithms for cache updating can not be used in WEB environment effectively. In this paper, with click-through data analysis, a inverted file replacement algorithm for WEB caching is proposed. The analytic result shows that the click-through data for the cache updating algorithms is how the algorithm suits the WEB traffic pattern properly. Based on the poisson arrival model, a new cache policy, inverted file replacement algorithm, is proposed. The trace driven simulation shows that the retrieval algorithm under the new organization of the inverted file can decrease its execution time significantly and the performance of the inverted file replacement algorithms is better than that of the existing algorithms proposed in the literature.","PeriodicalId":122827,"journal":{"name":"2010 Seventh Web Information Systems and Applications Conference","volume":"39 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2010-08-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Research in Automatic Search Engine Replacement Algorithm for Web Caching Based on User Behavior\",\"authors\":\"Feng Zhang, Xia-long Li\",\"doi\":\"10.1109/WISA.2010.25\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"To improve the retrieval efficiency and performance of the large scale information retrieval systems, analyzed existing replacement algorithm for WEB caching, due to the diversity of the WEB traffic pattern, the traditional algorithms for cache updating can not be used in WEB environment effectively. In this paper, with click-through data analysis, a inverted file replacement algorithm for WEB caching is proposed. The analytic result shows that the click-through data for the cache updating algorithms is how the algorithm suits the WEB traffic pattern properly. Based on the poisson arrival model, a new cache policy, inverted file replacement algorithm, is proposed. The trace driven simulation shows that the retrieval algorithm under the new organization of the inverted file can decrease its execution time significantly and the performance of the inverted file replacement algorithms is better than that of the existing algorithms proposed in the literature.\",\"PeriodicalId\":122827,\"journal\":{\"name\":\"2010 Seventh Web Information Systems and Applications Conference\",\"volume\":\"39 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2010-08-20\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2010 Seventh Web Information Systems and Applications Conference\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/WISA.2010.25\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2010 Seventh Web Information Systems and Applications Conference","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/WISA.2010.25","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Research in Automatic Search Engine Replacement Algorithm for Web Caching Based on User Behavior
To improve the retrieval efficiency and performance of the large scale information retrieval systems, analyzed existing replacement algorithm for WEB caching, due to the diversity of the WEB traffic pattern, the traditional algorithms for cache updating can not be used in WEB environment effectively. In this paper, with click-through data analysis, a inverted file replacement algorithm for WEB caching is proposed. The analytic result shows that the click-through data for the cache updating algorithms is how the algorithm suits the WEB traffic pattern properly. Based on the poisson arrival model, a new cache policy, inverted file replacement algorithm, is proposed. The trace driven simulation shows that the retrieval algorithm under the new organization of the inverted file can decrease its execution time significantly and the performance of the inverted file replacement algorithms is better than that of the existing algorithms proposed in the literature.