Robust Query Driven Cardinality Estimation under Changing Workloads

Proc. VLDB Endow. Pub Date : 2023-02-01 DOI:10.14778/3583140.3583164

Parimarjan Negi, Ziniu Wu, Andreas Kipf, Nesime Tatbul, Ryan Marcus, S. Madden, Tim Kraska, Mohammad Alizadeh

{"title":"Robust Query Driven Cardinality Estimation under Changing Workloads","authors":"Parimarjan Negi, Ziniu Wu, Andreas Kipf, Nesime Tatbul, Ryan Marcus, S. Madden, Tim Kraska, Mohammad Alizadeh","doi":"10.14778/3583140.3583164","DOIUrl":null,"url":null,"abstract":"Query driven cardinality estimation models learn from a historical log of queries. They are lightweight, having low storage requirements, fast inference and training, and are easily adaptable for any kind of query. Unfortunately, such models can suffer unpredictably bad performance under workload drift, i.e., if the query pattern or data changes. This makes them unreliable and hard to deploy. We analyze the reasons why models become unpredictable due to workload drift, and introduce modifications to the query representation and neural network training techniques to make query-driven models robust to the effects of workload drift. First, we emulate workload drift in queries involving some unseen tables or columns by randomly masking out some table or column features during training. This forces the model to make predictions with missing query information, relying more on robust features based on up-to-date DBMS statistics that are useful even when query or data drift happens. Second, we introduce join bitmaps, which extends sampling-based features to be consistent across joins using ideas from sideways information passing. Finally, we show how both of these ideas can be adapted to handle data updates.\n \n We show significantly greater generalization than past works across different workloads and databases. For instance, a model trained with our techniques on a simple workload (JOBLight-train), with 40\n k\n synthetically generated queries of at most 3 tables each, is able to generalize to the much more complex Join Order Benchmark, which include queries with up to 16 tables, and improve query runtimes by 2× over PostgreSQL. We show similar robustness results with data updates, and across other workloads. We discuss the situations where we expect, and see, improvements, as well as more challenging workload drift scenarios where these techniques do not improve much over PostgreSQL. However, even in the most challenging scenarios, our models never perform worse than PostgreSQL, while standard query driven models can get much worse than PostgreSQL.\n","PeriodicalId":20467,"journal":{"name":"Proc. VLDB Endow.","volume":"16 1","pages":"1520-1533"},"PeriodicalIF":0.0000,"publicationDate":"2023-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proc. VLDB Endow.","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.14778/3583140.3583164","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 5

Abstract

Query driven cardinality estimation models learn from a historical log of queries. They are lightweight, having low storage requirements, fast inference and training, and are easily adaptable for any kind of query. Unfortunately, such models can suffer unpredictably bad performance under workload drift, i.e., if the query pattern or data changes. This makes them unreliable and hard to deploy. We analyze the reasons why models become unpredictable due to workload drift, and introduce modifications to the query representation and neural network training techniques to make query-driven models robust to the effects of workload drift. First, we emulate workload drift in queries involving some unseen tables or columns by randomly masking out some table or column features during training. This forces the model to make predictions with missing query information, relying more on robust features based on up-to-date DBMS statistics that are useful even when query or data drift happens. Second, we introduce join bitmaps, which extends sampling-based features to be consistent across joins using ideas from sideways information passing. Finally, we show how both of these ideas can be adapted to handle data updates. We show significantly greater generalization than past works across different workloads and databases. For instance, a model trained with our techniques on a simple workload (JOBLight-train), with 40 k synthetically generated queries of at most 3 tables each, is able to generalize to the much more complex Join Order Benchmark, which include queries with up to 16 tables, and improve query runtimes by 2× over PostgreSQL. We show similar robustness results with data updates, and across other workloads. We discuss the situations where we expect, and see, improvements, as well as more challenging workload drift scenarios where these techniques do not improve much over PostgreSQL. However, even in the most challenging scenarios, our models never perform worse than PostgreSQL, while standard query driven models can get much worse than PostgreSQL.

查看原文本刊更多论文

工作负载变化下的鲁棒查询驱动基数估计

查询驱动的基数估计模型从查询的历史日志中学习。它们是轻量级的，具有低存储需求，快速推理和训练，并且很容易适应任何类型的查询。不幸的是，在工作负载漂移的情况下，例如，如果查询模式或数据发生变化，这种模型的性能可能会出现不可预测的下降。这使得它们不可靠且难以部署。我们分析了由于工作负载漂移导致模型变得不可预测的原因，并引入了对查询表示和神经网络训练技术的修改，以使查询驱动模型对工作负载漂移的影响具有鲁棒性。首先，我们通过在训练期间随机屏蔽一些表或列的特征来模拟涉及一些看不见的表或列的查询中的工作负载漂移。这迫使模型使用缺失的查询信息进行预测，更多地依赖于基于最新DBMS统计数据的健壮特性，这些特性即使在发生查询或数据漂移时也很有用。其次，我们引入了连接位图，它扩展了基于采样的特征，使用横向信息传递的思想在连接之间保持一致。最后，我们将展示如何将这两种思想应用于处理数据更新。在不同的工作负载和数据库中，我们展示了比过去更大的泛化。例如，在一个简单的工作负载(JOBLight-train)上使用我们的技术进行训练的模型，每个查询最多生成3个表的40 k综合查询，能够推广到更复杂的Join Order基准，其中包括最多16个表的查询，并且查询运行时间比PostgreSQL提高了2倍。我们在数据更新和其他工作负载中显示了类似的鲁棒性结果。我们讨论了我们期望和看到的改进情况，以及更具挑战性的工作负载漂移场景，在这些场景中，这些技术与PostgreSQL相比没有太大的改进。然而，即使在最具挑战性的场景中，我们的模型也不会比PostgreSQL表现得差，而标准的查询驱动模型可能会比PostgreSQL差得多。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proc. VLDB Endow.

自引率

0.00%

发文量