使用语义信息改进动态内容Web站点的透明查询缓存

G. Soundararajan, C. Amza
{"title":"使用语义信息改进动态内容Web站点的透明查询缓存","authors":"G. Soundararajan, C. Amza","doi":"10.1109/DEEC.2005.25","DOIUrl":null,"url":null,"abstract":"In this paper, we study the use of semantic information to improve performance of transparent query caching for dynamic content Web sites. We observe that in dynamic content Web applications, the most recently inserted items are also the ones that register the highest activity. For example, the newest books in a bookstore are also the ones more frequently browsed and bought. Hence, assuming repeatable queries, a particular read-only query response is likely to incrementally change as new rows are added to the queries tables. We avoid the cached query response invalidations that would otherwise occur due to the addition of new items by keeping the newly inserted rows in small temporary tables. This allows us to reuse cached responses for partial coverage of query results. A query result is then obtained from merging an existing cached response with one or more lightweight residual query results that involve the temporary tables. In addition, we enhance our cache with other partial coverage techniques based on per-query semantic information such as sub-range queries for all queries that match a specific template. We implement semantic query caching on top of an existing template-based cache with column-based invalidations. Our evaluation is based on a dynamic content site using the Apache Web server with Tomcat Java servlets and the MySQL relational database. We use the industry-standard TPC-W e-commerce benchmark as our benchmark application. We conclude that augmenting transparent query caching with the ability to retrieve partial results from the cache improves performance substantially in terms of latency and to a lesser extent in terms of hit-rate and throughput.","PeriodicalId":297129,"journal":{"name":"International Workshop on Data Engineering Issues in E-Commerce","volume":"79 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2005-04-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"14","resultStr":"{\"title\":\"Using semantic information to improve transparent query caching for dynamic content Web sites\",\"authors\":\"G. Soundararajan, C. Amza\",\"doi\":\"10.1109/DEEC.2005.25\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In this paper, we study the use of semantic information to improve performance of transparent query caching for dynamic content Web sites. We observe that in dynamic content Web applications, the most recently inserted items are also the ones that register the highest activity. For example, the newest books in a bookstore are also the ones more frequently browsed and bought. Hence, assuming repeatable queries, a particular read-only query response is likely to incrementally change as new rows are added to the queries tables. We avoid the cached query response invalidations that would otherwise occur due to the addition of new items by keeping the newly inserted rows in small temporary tables. This allows us to reuse cached responses for partial coverage of query results. A query result is then obtained from merging an existing cached response with one or more lightweight residual query results that involve the temporary tables. In addition, we enhance our cache with other partial coverage techniques based on per-query semantic information such as sub-range queries for all queries that match a specific template. We implement semantic query caching on top of an existing template-based cache with column-based invalidations. Our evaluation is based on a dynamic content site using the Apache Web server with Tomcat Java servlets and the MySQL relational database. We use the industry-standard TPC-W e-commerce benchmark as our benchmark application. We conclude that augmenting transparent query caching with the ability to retrieve partial results from the cache improves performance substantially in terms of latency and to a lesser extent in terms of hit-rate and throughput.\",\"PeriodicalId\":297129,\"journal\":{\"name\":\"International Workshop on Data Engineering Issues in E-Commerce\",\"volume\":\"79 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2005-04-09\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"14\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"International Workshop on Data Engineering Issues in E-Commerce\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/DEEC.2005.25\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Workshop on Data Engineering Issues in E-Commerce","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/DEEC.2005.25","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 14

摘要

在本文中,我们研究了使用语义信息来提高动态内容网站透明查询缓存的性能。我们观察到,在动态内容Web应用程序中,最近插入的项也是注册活动最多的项。例如,书店里最新的书也是最常浏览和购买的书。因此,假设查询是可重复的,当新行被添加到查询表中时,特定的只读查询响应可能会发生增量更改。通过将新插入的行保存在小型临时表中,我们避免了由于添加新项而导致的缓存查询响应失效。这允许我们重用缓存的响应来部分覆盖查询结果。然后,将现有的缓存响应与一个或多个涉及临时表的轻量级剩余查询结果合并,从而获得查询结果。此外,我们还使用其他基于每个查询语义信息的部分覆盖技术来增强缓存,例如针对匹配特定模板的所有查询的子范围查询。我们在现有的基于模板的缓存上实现语义查询缓存,其中包含基于列的失效。我们的评估是基于一个使用Apache Web服务器和Tomcat Java servlet以及MySQL关系数据库的动态内容站点。我们使用行业标准TPC-W电子商务基准作为我们的基准应用程序。我们得出的结论是,通过从缓存中检索部分结果的能力来增强透明查询缓存,在延迟方面大大提高了性能,并在较小程度上提高了命中率和吞吐量。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Using semantic information to improve transparent query caching for dynamic content Web sites
In this paper, we study the use of semantic information to improve performance of transparent query caching for dynamic content Web sites. We observe that in dynamic content Web applications, the most recently inserted items are also the ones that register the highest activity. For example, the newest books in a bookstore are also the ones more frequently browsed and bought. Hence, assuming repeatable queries, a particular read-only query response is likely to incrementally change as new rows are added to the queries tables. We avoid the cached query response invalidations that would otherwise occur due to the addition of new items by keeping the newly inserted rows in small temporary tables. This allows us to reuse cached responses for partial coverage of query results. A query result is then obtained from merging an existing cached response with one or more lightweight residual query results that involve the temporary tables. In addition, we enhance our cache with other partial coverage techniques based on per-query semantic information such as sub-range queries for all queries that match a specific template. We implement semantic query caching on top of an existing template-based cache with column-based invalidations. Our evaluation is based on a dynamic content site using the Apache Web server with Tomcat Java servlets and the MySQL relational database. We use the industry-standard TPC-W e-commerce benchmark as our benchmark application. We conclude that augmenting transparent query caching with the ability to retrieve partial results from the cache improves performance substantially in terms of latency and to a lesser extent in terms of hit-rate and throughput.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信