{"title":"可扩展NoSQL数据库的定量分析","authors":"Surya Narayanan Swaminathan, R. Elmasri","doi":"10.1109/BigDataCongress.2016.49","DOIUrl":null,"url":null,"abstract":"NoSQL databases are rapidly becoming the customary data platform for big data applications. These databases are emerging as a gateway for alternative approaches outside traditional relational databases and are characterized by efficient horizontal scalability, schema-less approach to data modeling, high performance data access, and limited querying capabilities. The lack of transactional semantics among NoSQL databases has made the choice of a particular consistency model dependent on the application. Therefore, it is essential to examine methodically, and in detail, the performance of various databases under diverse workload conditions. Three of the most commonly used NoSQL databases: MongoDB, Cassandra and HBase are evaluated using the Yahoo Cloud Service Bench-mark, a popular benchmark tool. The horizontal scalability of the three systems under different workload conditions and varying dataset sizes is captured. A benchmark suite which summarizes the results of the evaluation is presented.","PeriodicalId":407471,"journal":{"name":"2016 IEEE International Congress on Big Data (BigData Congress)","volume":"107 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"22","resultStr":"{\"title\":\"Quantitative Analysis of Scalable NoSQL Databases\",\"authors\":\"Surya Narayanan Swaminathan, R. Elmasri\",\"doi\":\"10.1109/BigDataCongress.2016.49\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"NoSQL databases are rapidly becoming the customary data platform for big data applications. These databases are emerging as a gateway for alternative approaches outside traditional relational databases and are characterized by efficient horizontal scalability, schema-less approach to data modeling, high performance data access, and limited querying capabilities. The lack of transactional semantics among NoSQL databases has made the choice of a particular consistency model dependent on the application. Therefore, it is essential to examine methodically, and in detail, the performance of various databases under diverse workload conditions. Three of the most commonly used NoSQL databases: MongoDB, Cassandra and HBase are evaluated using the Yahoo Cloud Service Bench-mark, a popular benchmark tool. The horizontal scalability of the three systems under different workload conditions and varying dataset sizes is captured. A benchmark suite which summarizes the results of the evaluation is presented.\",\"PeriodicalId\":407471,\"journal\":{\"name\":\"2016 IEEE International Congress on Big Data (BigData Congress)\",\"volume\":\"107 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2016-06-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"22\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2016 IEEE International Congress on Big Data (BigData Congress)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/BigDataCongress.2016.49\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 IEEE International Congress on Big Data (BigData Congress)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/BigDataCongress.2016.49","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 22
摘要
NoSQL数据库正迅速成为大数据应用的常用数据平台。这些数据库正在成为传统关系数据库之外的替代方法的门户,其特点是高效的水平可伸缩性、无模式的数据建模方法、高性能数据访问和有限的查询功能。NoSQL数据库之间缺乏事务语义,这使得选择特定的一致性模型取决于应用程序。因此,有必要有条不紊地详细检查各种数据库在不同工作负载条件下的性能。三种最常用的NoSQL数据库:MongoDB、Cassandra和HBase使用Yahoo Cloud Service benchmark(一种流行的基准测试工具)进行评估。捕获了三个系统在不同工作负载条件和不同数据集大小下的水平可伸缩性。给出了一个总结评估结果的基准套件。
NoSQL databases are rapidly becoming the customary data platform for big data applications. These databases are emerging as a gateway for alternative approaches outside traditional relational databases and are characterized by efficient horizontal scalability, schema-less approach to data modeling, high performance data access, and limited querying capabilities. The lack of transactional semantics among NoSQL databases has made the choice of a particular consistency model dependent on the application. Therefore, it is essential to examine methodically, and in detail, the performance of various databases under diverse workload conditions. Three of the most commonly used NoSQL databases: MongoDB, Cassandra and HBase are evaluated using the Yahoo Cloud Service Bench-mark, a popular benchmark tool. The horizontal scalability of the three systems under different workload conditions and varying dataset sizes is captured. A benchmark suite which summarizes the results of the evaluation is presented.