Neural nets based predictive prefetching to tolerate WWW latency

Proceedings 20th IEEE International Conference on Distributed Computing Systems Pub Date : 2000-04-10 DOI:10.1109/ICDCS.2000.840980

Tamer I. Ibrahim, Chengzhong Xu

{"title":"Neural nets based predictive prefetching to tolerate WWW latency","authors":"Tamer I. Ibrahim, Chengzhong Xu","doi":"10.1109/ICDCS.2000.840980","DOIUrl":null,"url":null,"abstract":"With the explosive growth of WWW applications on the Internet, users are experiencing access delays more often than ever. Recent studies showed that prefetching could alleviate the WWW latency to a larger extent than caching. Existing prefetching methods are mostly based on URL graphs. They use the graphical nature of hypertext links to determine the possible paths through a hypertext system. While they have been demonstrated effective in prefetching of documents that are often accessed, they are incapable of pre-retrieving documents whose URLs had never been accessed. We propose a context-specific prefetching technique to overcome the limitation. It relies on keywords in anchor texts of URLs to characterize user access patterns and on neural networks over the keyword set to predict future requests. It features a self-learning capability and good adaptivity to the change of user surfing interest. The technique was implemented in a SmartNewsReader system and cross-examined in a daily browsing of MSNBC and CNN news sites. The experimental results showed an achievement of approximately 60% hit ratio due to prefetching. Of the prefetched documents, less than 30% was undesired.","PeriodicalId":284992,"journal":{"name":"Proceedings 20th IEEE International Conference on Distributed Computing Systems","volume":"19 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2000-04-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"57","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings 20th IEEE International Conference on Distributed Computing Systems","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICDCS.2000.840980","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 57

Abstract

With the explosive growth of WWW applications on the Internet, users are experiencing access delays more often than ever. Recent studies showed that prefetching could alleviate the WWW latency to a larger extent than caching. Existing prefetching methods are mostly based on URL graphs. They use the graphical nature of hypertext links to determine the possible paths through a hypertext system. While they have been demonstrated effective in prefetching of documents that are often accessed, they are incapable of pre-retrieving documents whose URLs had never been accessed. We propose a context-specific prefetching technique to overcome the limitation. It relies on keywords in anchor texts of URLs to characterize user access patterns and on neural networks over the keyword set to predict future requests. It features a self-learning capability and good adaptivity to the change of user surfing interest. The technique was implemented in a SmartNewsReader system and cross-examined in a daily browsing of MSNBC and CNN news sites. The experimental results showed an achievement of approximately 60% hit ratio due to prefetching. Of the prefetched documents, less than 30% was undesired.

查看原文本刊更多论文

基于神经网络的预测预取容忍WWW延迟

随着Internet上WWW应用程序的爆炸性增长，用户比以往任何时候都更频繁地遇到访问延迟。最近的研究表明，预取比缓存更能缓解WWW延迟。现有的预取方法大多基于URL图。它们利用超文本链接的图形特性来确定通过超文本系统的可能路径。虽然它们在预取经常被访问的文档方面已被证明是有效的，但它们无法预检索url从未被访问过的文档。我们提出了一种特定于上下文的预取技术来克服这一限制。它依赖于url锚文本中的关键字来描述用户访问模式，并依赖于关键字集上的神经网络来预测未来的请求。它具有自学习能力和对用户上网兴趣变化的良好适应能力。该技术在SmartNewsReader系统中实现，并在每日浏览MSNBC和CNN新闻网站时进行了交叉检验。实验结果表明，通过预取可以达到约60%的命中率。在预取的文档中，不到30%是不需要的。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings 20th IEEE International Conference on Distributed Computing Systems

自引率

0.00%

发文量