{"title":"Saving Money for Analytical Workloads in the Cloud","authors":"Tapan Srivastava, Raul Castro Fernandez","doi":"arxiv-2408.00253","DOIUrl":null,"url":null,"abstract":"As users migrate their analytical workloads to cloud databases, it is\nbecoming just as important to reduce monetary costs as it is to optimize query\nruntime. In the cloud, a query is billed based on either its compute time or\nthe amount of data it processes. We observe that analytical queries are either\ncompute- or IO-bound and each query type executes cheaper in a different\npricing model. We exploit this opportunity and propose methods to build cheaper\nexecution plans across pricing models that complete within user-defined runtime\nconstraints. We implement these methods and produce execution plans spanning\nmultiple pricing models that reduce the monetary cost for workloads by as much\nas 56%. We reduce individual query costs by as much as 90%. The prices chosen\nby cloud vendors for cloud services also impact savings opportunities. To study\nthis effect, we simulate our proposed methods with different cloud prices and\nobserve that multi-cloud savings are robust to changes in cloud vendor prices.\nThese results indicate the massive opportunity to save money by executing\nworkloads across multiple pricing models.","PeriodicalId":501123,"journal":{"name":"arXiv - CS - Databases","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2024-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - CS - Databases","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2408.00253","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
As users migrate their analytical workloads to cloud databases, it is
becoming just as important to reduce monetary costs as it is to optimize query
runtime. In the cloud, a query is billed based on either its compute time or
the amount of data it processes. We observe that analytical queries are either
compute- or IO-bound and each query type executes cheaper in a different
pricing model. We exploit this opportunity and propose methods to build cheaper
execution plans across pricing models that complete within user-defined runtime
constraints. We implement these methods and produce execution plans spanning
multiple pricing models that reduce the monetary cost for workloads by as much
as 56%. We reduce individual query costs by as much as 90%. The prices chosen
by cloud vendors for cloud services also impact savings opportunities. To study
this effect, we simulate our proposed methods with different cloud prices and
observe that multi-cloud savings are robust to changes in cloud vendor prices.
These results indicate the massive opportunity to save money by executing
workloads across multiple pricing models.