Revisiting Runtime Dynamic Optimization for Join Queries in Big Data Management Systems

ACM SIGMOD Record Pub Date : 2023-06-08 DOI:https://dl.acm.org/doi/10.1145/3604437.3604460

Christina Pavlopoulou, Michael J. Carey, Vassilis J. Tsotras

引用次数: 0

Abstract

Effective query optimization remains an open problem for Big Data Management Systems. In this work, we revisit an old idea, runtime dynamic optimization, and adapt it to a big data management system, AsterixDB. The approach runs in stages (re-optimization points), starting by first executing all predicates local to a single dataset. The intermediate result created by a stage is then used to re-optimize the remaining query. This re-optimization approach avoids inaccurate intermediate result cardinality estimates, thus leading to much better execution plans. While it introduces overhead for materializing intermediate results, experiments show that this overhead is relatively small and is an acceptable price to pay given the optimization benefits.

查看原文本刊更多论文

大数据管理系统中Join查询的运行时动态优化研究

有效的查询优化仍然是大数据管理系统的一个开放性问题。在这项工作中，我们重新审视了一个古老的思想，运行时动态优化，并将其应用于大数据管理系统AsterixDB。该方法分阶段运行(重新优化点)，首先执行单个数据集本地的所有谓词。然后使用阶段创建的中间结果重新优化剩余的查询。这种重新优化方法避免了不准确的中间结果基数估计，从而导致更好的执行计划。虽然它引入了实现中间结果的开销，但实验表明，这种开销相对较小，并且考虑到优化的好处，这是可以接受的代价。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

ACM SIGMOD Record

自引率

0.00%

发文量