IMAX: incremental maintenance of schema-based XML statistics

21st International Conference on Data Engineering (ICDE'05) Pub Date : 2005-04-05 DOI:10.1109/ICDE.2005.75

Maya Ramanath, L. Zhang, J. Freire, J. Haritsa

引用次数: 10

Abstract

Current approaches for estimating the cardinality of XML queries are applicable to a static scenario wherein the underlying XML data does not change subsequent to the collection of statistics on the repository. However, in practice, many XML-based applications are dynamic and involve frequent updates to the data. In this paper, we investigate efficient strategies for incrementally maintaining statistical summaries as and when updates are applied to the data. Specifically, we propose algorithms that handle both the addition of new documents as well as random insertions in the existing document trees. We also show, through a detailed performance evaluation, that our incremental techniques are significantly faster than the naive recomputation approach; and that estimation accuracy can be maintained even with a fixed memory budget.

查看原文本刊更多论文

IMAX:基于模式的XML统计信息的增量维护

目前估计XML查询基数的方法适用于静态场景，其中底层XML数据在存储库的统计信息收集之后不会发生变化。然而，在实践中，许多基于xml的应用程序是动态的，并且涉及对数据的频繁更新。在本文中，我们研究了当更新应用于数据时增量式维护统计摘要的有效策略。具体来说，我们提出的算法既可以处理新文档的添加，也可以处理现有文档树中的随机插入。我们还通过详细的性能评估表明，我们的增量技术比单纯的重新计算方法要快得多;即使在内存预算固定的情况下，也可以保持估计的准确性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

21st International Conference on Data Engineering (ICDE'05)

自引率

0.00%

发文量