现代数据存储的成本/性能:数据缓存系统如何成功

2019 IEEE 35th International Conference on Data Engineering Workshops (ICDEW) Pub Date : 2018-06-11 DOI:10.1145/3211922.3211927

D. Lomet

{"title":"现代数据存储的成本/性能:数据缓存系统如何成功","authors":"D. Lomet","doi":"10.1145/3211922.3211927","DOIUrl":null,"url":null,"abstract":"Summary form only given, as follows. The complete presentation was not made available for publication as part of the conference proceedings. Data in traditional \"caching\" data systems resides on secondary storage, and is read into main memory only when operated on. This limits system performance. Main memory data stores with data always in main memory are much faster. But this performance comes at a cost. In this paper, we analyze the costs of both in-memory operations and secondary storage operations where data is not \"in cache\". We study the performance impact of cache misses on caching system performance. The analysis considers both execution and storage costs. Based on our analysis, we derive cost/performance results for a data caching system [Deuteronomy and its Bw-tree] and a main memory system [MassTree] to understand where each demonstrates the best cost per operation, what is driving the cost differences, and the scale of the differences. This analysis (1) provides insight into why data caching systems continue to dominate the market; (2) points to higher performance that does not rely on simply increasing main memory cache size; and (3) suggests a path to lower costs and hence better cost/performance.","PeriodicalId":186190,"journal":{"name":"2019 IEEE 35th International Conference on Data Engineering Workshops (ICDEW)","volume":"6 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-06-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"28","resultStr":"{\"title\":\"Cost/Performance in Modern Data Stores: How Data Caching Systems Succeed\",\"authors\":\"D. Lomet\",\"doi\":\"10.1145/3211922.3211927\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Summary form only given, as follows. The complete presentation was not made available for publication as part of the conference proceedings. Data in traditional \\\"caching\\\" data systems resides on secondary storage, and is read into main memory only when operated on. This limits system performance. Main memory data stores with data always in main memory are much faster. But this performance comes at a cost. In this paper, we analyze the costs of both in-memory operations and secondary storage operations where data is not \\\"in cache\\\". We study the performance impact of cache misses on caching system performance. The analysis considers both execution and storage costs. Based on our analysis, we derive cost/performance results for a data caching system [Deuteronomy and its Bw-tree] and a main memory system [MassTree] to understand where each demonstrates the best cost per operation, what is driving the cost differences, and the scale of the differences. This analysis (1) provides insight into why data caching systems continue to dominate the market; (2) points to higher performance that does not rely on simply increasing main memory cache size; and (3) suggests a path to lower costs and hence better cost/performance.\",\"PeriodicalId\":186190,\"journal\":{\"name\":\"2019 IEEE 35th International Conference on Data Engineering Workshops (ICDEW)\",\"volume\":\"6 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2018-06-11\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"28\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2019 IEEE 35th International Conference on Data Engineering Workshops (ICDEW)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3211922.3211927\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 IEEE 35th International Conference on Data Engineering Workshops (ICDEW)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3211922.3211927","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 28

摘要

仅给出摘要形式，如下。完整的报告没有作为会议记录的一部分提供出版。传统的“缓存”数据系统中的数据驻留在二级存储器上，只有在对其进行操作时才读入主存。这限制了系统性能。数据总是在主存中的主存数据存储要快得多。但这种表现是有代价的。在本文中，我们分析了内存操作和二级存储操作的成本，其中数据不在“缓存中”。我们研究了缓存缺失对缓存系统性能的影响。该分析同时考虑了执行和存储成本。根据我们的分析，我们得出了数据缓存系统(Deuteronomy及其Bw-tree)和主内存系统(masstreet)的成本/性能结果，以了解每个操作在哪些方面表现出最佳成本，是什么导致了成本差异，以及差异的规模。本分析(1)提供了数据缓存系统继续主导市场的原因;(2)指向更高的性能，而不是简单地依赖于增加主内存缓存大小;(3)提出了降低成本从而提高性价比的途径。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Cost/Performance in Modern Data Stores: How Data Caching Systems Succeed

Summary form only given, as follows. The complete presentation was not made available for publication as part of the conference proceedings. Data in traditional "caching" data systems resides on secondary storage, and is read into main memory only when operated on. This limits system performance. Main memory data stores with data always in main memory are much faster. But this performance comes at a cost. In this paper, we analyze the costs of both in-memory operations and secondary storage operations where data is not "in cache". We study the performance impact of cache misses on caching system performance. The analysis considers both execution and storage costs. Based on our analysis, we derive cost/performance results for a data caching system [Deuteronomy and its Bw-tree] and a main memory system [MassTree] to understand where each demonstrates the best cost per operation, what is driving the cost differences, and the scale of the differences. This analysis (1) provides insight into why data caching systems continue to dominate the market; (2) points to higher performance that does not rely on simply increasing main memory cache size; and (3) suggests a path to lower costs and hence better cost/performance.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2019 IEEE 35th International Conference on Data Engineering Workshops (ICDEW)

自引率

0.00%

发文量