{"title":"Energy efficiency analysis of query optimizations on MongoDB and Cassandra","authors":"Divya Mahajan, Ziliang Zong","doi":"10.1109/IGCC.2017.8323581","DOIUrl":null,"url":null,"abstract":"As big data emerges, the complexity of database workloads and database systems has increased significantly. It is no longer possible for one type of database to efficiently handle all big data applications. NoSQL databases are widely used to complement conventional SQL databases. In addition to traditional metrics such as response time and throughput, large scale NoSQL database systems pose higher requirements on energy efficiency due to the incredible volume of data (and the associated cost) that need to be stored and processed. Unfortunately, research on optimizations for energy efficiency in database systems has been historically overlooked. In this paper, we investigate numerous optimizations for two NoSQL databases (MongoDB and Cassandra) and conduct a comprehensive study on the impact of these optimizations on performance and energy efficiency. Our experimental results derived from 100GB of Twitter data reveal that 1) energy efficiency can be improved significantly for both MongoDB and Cassandra via query optimizations without degrading performance; and 2) energy efficiency does not always scale linearly with performance improvement.","PeriodicalId":133239,"journal":{"name":"2017 Eighth International Green and Sustainable Computing Conference (IGSC)","volume":"30 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"8","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 Eighth International Green and Sustainable Computing Conference (IGSC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IGCC.2017.8323581","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 8
Abstract
As big data emerges, the complexity of database workloads and database systems has increased significantly. It is no longer possible for one type of database to efficiently handle all big data applications. NoSQL databases are widely used to complement conventional SQL databases. In addition to traditional metrics such as response time and throughput, large scale NoSQL database systems pose higher requirements on energy efficiency due to the incredible volume of data (and the associated cost) that need to be stored and processed. Unfortunately, research on optimizations for energy efficiency in database systems has been historically overlooked. In this paper, we investigate numerous optimizations for two NoSQL databases (MongoDB and Cassandra) and conduct a comprehensive study on the impact of these optimizations on performance and energy efficiency. Our experimental results derived from 100GB of Twitter data reveal that 1) energy efficiency can be improved significantly for both MongoDB and Cassandra via query optimizations without degrading performance; and 2) energy efficiency does not always scale linearly with performance improvement.