{"title":"为云分析工作负载节省资金","authors":"Tapan Srivastava, Raul Castro Fernandez","doi":"arxiv-2408.00253","DOIUrl":null,"url":null,"abstract":"As users migrate their analytical workloads to cloud databases, it is\nbecoming just as important to reduce monetary costs as it is to optimize query\nruntime. In the cloud, a query is billed based on either its compute time or\nthe amount of data it processes. We observe that analytical queries are either\ncompute- or IO-bound and each query type executes cheaper in a different\npricing model. We exploit this opportunity and propose methods to build cheaper\nexecution plans across pricing models that complete within user-defined runtime\nconstraints. We implement these methods and produce execution plans spanning\nmultiple pricing models that reduce the monetary cost for workloads by as much\nas 56%. We reduce individual query costs by as much as 90%. The prices chosen\nby cloud vendors for cloud services also impact savings opportunities. To study\nthis effect, we simulate our proposed methods with different cloud prices and\nobserve that multi-cloud savings are robust to changes in cloud vendor prices.\nThese results indicate the massive opportunity to save money by executing\nworkloads across multiple pricing models.","PeriodicalId":501123,"journal":{"name":"arXiv - CS - Databases","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2024-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Saving Money for Analytical Workloads in the Cloud\",\"authors\":\"Tapan Srivastava, Raul Castro Fernandez\",\"doi\":\"arxiv-2408.00253\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"As users migrate their analytical workloads to cloud databases, it is\\nbecoming just as important to reduce monetary costs as it is to optimize query\\nruntime. In the cloud, a query is billed based on either its compute time or\\nthe amount of data it processes. We observe that analytical queries are either\\ncompute- or IO-bound and each query type executes cheaper in a different\\npricing model. We exploit this opportunity and propose methods to build cheaper\\nexecution plans across pricing models that complete within user-defined runtime\\nconstraints. We implement these methods and produce execution plans spanning\\nmultiple pricing models that reduce the monetary cost for workloads by as much\\nas 56%. We reduce individual query costs by as much as 90%. The prices chosen\\nby cloud vendors for cloud services also impact savings opportunities. To study\\nthis effect, we simulate our proposed methods with different cloud prices and\\nobserve that multi-cloud savings are robust to changes in cloud vendor prices.\\nThese results indicate the massive opportunity to save money by executing\\nworkloads across multiple pricing models.\",\"PeriodicalId\":501123,\"journal\":{\"name\":\"arXiv - CS - Databases\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-08-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"arXiv - CS - Databases\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/arxiv-2408.00253\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - CS - Databases","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2408.00253","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Saving Money for Analytical Workloads in the Cloud
As users migrate their analytical workloads to cloud databases, it is
becoming just as important to reduce monetary costs as it is to optimize query
runtime. In the cloud, a query is billed based on either its compute time or
the amount of data it processes. We observe that analytical queries are either
compute- or IO-bound and each query type executes cheaper in a different
pricing model. We exploit this opportunity and propose methods to build cheaper
execution plans across pricing models that complete within user-defined runtime
constraints. We implement these methods and produce execution plans spanning
multiple pricing models that reduce the monetary cost for workloads by as much
as 56%. We reduce individual query costs by as much as 90%. The prices chosen
by cloud vendors for cloud services also impact savings opportunities. To study
this effect, we simulate our proposed methods with different cloud prices and
observe that multi-cloud savings are robust to changes in cloud vendor prices.
These results indicate the massive opportunity to save money by executing
workloads across multiple pricing models.