{"title":"Utilizing group prediction by users' interests to improve the performance of web proxy servers","authors":"Tsozen Yeh, Liang-fan Chang","doi":"10.1109/APNOMS.2014.6996559","DOIUrl":null,"url":null,"abstract":"Companies and institutions often use Web proxy servers to service the multiple requests of the same Web pages (or Web objects) from users therein to save the network bandwidth and reduce the Internet latency. Web proxy servers usually are geographically close to their clients (users). If Web proxy serves have cached valid copies of requested Web objects, they can be directly delivered to users. Otherwise, users need to spend a long time on getting Web objects from their hosting Web servers. Both cases will require users to wait for their requests. This period of latency can be largely reduced if Web proxy servers could predict what Web objects users may need in the near future, and send those predicted Web objects to the client sites before their actual usage. Various predicting algorithms, such as those based on temporal locality and data mining, had been proposed to enable proxy servers to make prediction of what Web objects users may access. However, they often need to constantly update and maintain their models in realtime to make their schemes effective. As a result, for proxy servers servicing a large number of users, schemes demanding for complicated and realtime calculation will not be as useful as expected in practice. Based on Web sites and Web pages commonly visited by users, we proposed a new model with an offline learning algorithm to help Web proxy servers make prediction about upcoming requests of Web objects from users. Compared with the hit ratio achieved by the original environment without prediction, our model can improve the caching performance of users' Web browsers by up to 51.37%.","PeriodicalId":269952,"journal":{"name":"The 16th Asia-Pacific Network Operations and Management Symposium","volume":"26 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2014-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"The 16th Asia-Pacific Network Operations and Management Symposium","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/APNOMS.2014.6996559","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Companies and institutions often use Web proxy servers to service the multiple requests of the same Web pages (or Web objects) from users therein to save the network bandwidth and reduce the Internet latency. Web proxy servers usually are geographically close to their clients (users). If Web proxy serves have cached valid copies of requested Web objects, they can be directly delivered to users. Otherwise, users need to spend a long time on getting Web objects from their hosting Web servers. Both cases will require users to wait for their requests. This period of latency can be largely reduced if Web proxy servers could predict what Web objects users may need in the near future, and send those predicted Web objects to the client sites before their actual usage. Various predicting algorithms, such as those based on temporal locality and data mining, had been proposed to enable proxy servers to make prediction of what Web objects users may access. However, they often need to constantly update and maintain their models in realtime to make their schemes effective. As a result, for proxy servers servicing a large number of users, schemes demanding for complicated and realtime calculation will not be as useful as expected in practice. Based on Web sites and Web pages commonly visited by users, we proposed a new model with an offline learning algorithm to help Web proxy servers make prediction about upcoming requests of Web objects from users. Compared with the hit ratio achieved by the original environment without prediction, our model can improve the caching performance of users' Web browsers by up to 51.37%.