NoSQL数据库性能评估:一个案例研究

Proceedings of the 1st Workshop on Performance Analysis of Big Data Systems Pub Date : 2015-02-01 DOI:10.1145/2694730.2694731

John Klein, I. Gorton, Neil A. Ernst, P. Donohoe, Kim Pham, Chrisjan Matser

{"title":"NoSQL数据库性能评估:一个案例研究","authors":"John Klein, I. Gorton, Neil A. Ernst, P. Donohoe, Kim Pham, Chrisjan Matser","doi":"10.1145/2694730.2694731","DOIUrl":null,"url":null,"abstract":"The choice of a particular NoSQL database imposes a specific distributed software architecture and data model, and is a major determinant of the overall system throughput. NoSQL database performance is in turn strongly influenced by how well the data model and query capabilities fit the application use cases, and so system-specific testing and characterization is required. This paper presents a method and the results of a study that selected among three NoSQL databases for a large, distributed healthcare organization. While the method and study considered consistency, availability, and partition tolerance (CAP) tradeoffs, and other quality attributes that influence the selection decision, this paper reports on the performance evaluation method and results. In our testing, a typical workload and configuration produced throughput that varied from 225 to 3200 operations per second between database products, while read operation latency varied by a factor of 5 and write latency by a factor of 4 (with the highest throughput product delivering the highest latency). We also found that achieving strong consistency reduced throughput by 10-25% compared to eventual consistency.","PeriodicalId":298926,"journal":{"name":"Proceedings of the 1st Workshop on Performance Analysis of Big Data Systems","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2015-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"94","resultStr":"{\"title\":\"Performance Evaluation of NoSQL Databases: A Case Study\",\"authors\":\"John Klein, I. Gorton, Neil A. Ernst, P. Donohoe, Kim Pham, Chrisjan Matser\",\"doi\":\"10.1145/2694730.2694731\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The choice of a particular NoSQL database imposes a specific distributed software architecture and data model, and is a major determinant of the overall system throughput. NoSQL database performance is in turn strongly influenced by how well the data model and query capabilities fit the application use cases, and so system-specific testing and characterization is required. This paper presents a method and the results of a study that selected among three NoSQL databases for a large, distributed healthcare organization. While the method and study considered consistency, availability, and partition tolerance (CAP) tradeoffs, and other quality attributes that influence the selection decision, this paper reports on the performance evaluation method and results. In our testing, a typical workload and configuration produced throughput that varied from 225 to 3200 operations per second between database products, while read operation latency varied by a factor of 5 and write latency by a factor of 4 (with the highest throughput product delivering the highest latency). We also found that achieving strong consistency reduced throughput by 10-25% compared to eventual consistency.\",\"PeriodicalId\":298926,\"journal\":{\"name\":\"Proceedings of the 1st Workshop on Performance Analysis of Big Data Systems\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2015-02-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"94\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 1st Workshop on Performance Analysis of Big Data Systems\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/2694730.2694731\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 1st Workshop on Performance Analysis of Big Data Systems","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/2694730.2694731","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 94

摘要

选择特定的NoSQL数据库会带来特定的分布式软件架构和数据模型，这是整个系统吞吐量的主要决定因素。反过来，数据模型和查询功能与应用程序用例的匹配程度对NoSQL数据库性能有很大影响，因此需要进行特定于系统的测试和表征。本文介绍了一种方法和一项研究的结果，该研究从三个NoSQL数据库中选择了一个大型的分布式医疗保健组织。虽然该方法和研究考虑了一致性、可用性和分区容忍度(CAP)权衡以及其他影响选择决策的质量属性，但本文报告了性能评估方法和结果。在我们的测试中，典型的工作负载和配置产生的吞吐量在不同的数据库产品之间从每秒225到3200次操作不等，而读取操作延迟变化了5倍，写入延迟变化了4倍(最高吞吐量的产品提供了最高的延迟)。我们还发现，与最终的一致性相比，实现强一致性会使吞吐量降低10-25%。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Performance Evaluation of NoSQL Databases: A Case Study

The choice of a particular NoSQL database imposes a specific distributed software architecture and data model, and is a major determinant of the overall system throughput. NoSQL database performance is in turn strongly influenced by how well the data model and query capabilities fit the application use cases, and so system-specific testing and characterization is required. This paper presents a method and the results of a study that selected among three NoSQL databases for a large, distributed healthcare organization. While the method and study considered consistency, availability, and partition tolerance (CAP) tradeoffs, and other quality attributes that influence the selection decision, this paper reports on the performance evaluation method and results. In our testing, a typical workload and configuration produced throughput that varied from 225 to 3200 operations per second between database products, while read operation latency varied by a factor of 5 and write latency by a factor of 4 (with the highest throughput product delivering the highest latency). We also found that achieving strong consistency reduced throughput by 10-25% compared to eventual consistency.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Proceedings of the 1st Workshop on Performance Analysis of Big Data Systems

自引率

0.00%

发文量