基于成本的非平衡r树

Proceedings Thirteenth International Conference on Scientific and Statistical Database Management. SSDBM 2001 Pub Date : 2001-07-18 DOI:10.1109/SSDM.2001.938552

K. A. Ross, I. Sitzmann, Peter James Stuckey

{"title":"基于成本的非平衡r树","authors":"K. A. Ross, I. Sitzmann, Peter James Stuckey","doi":"10.1109/SSDM.2001.938552","DOIUrl":null,"url":null,"abstract":"Cost-based unbalanced R-trees (CUR-trees) are a cost-function-based data structure for spatial data. CUR-trees are constructed specifically to improve the evaluation of intersection queries, the most basic selection query in an R-tree. A CUR-tree is built taking into account a given query distribution for the queries and a cost model for their execution. Depending on the expected frequency of access, objects or subtrees are stored higher up in the tree. After each insertion in the tree, local reorganizations of a node and its children have their expected query cost evaluated, and a reorganization is performed if this is beneficial. No strict balancing of the trees applies, allowing the tree to unfold solely based on the result of the cost evaluation. We present our cost-based approach and describe the evaluation and reorganization operations based on the cost function. We present a cost model for in-memory access costs and we present three different query models. In our experiments, we compare the performance of the CUR-tree to the R-tree and the R*-tree. The CUR-tree is able to significantly improve intersection query performance, without unacceptably increasing the cost of building the tree. The use of R-trees for in-memory data reflects the high (and growing) cost of bringing data from RAM into the CPU cache relative to the cost of other computations.","PeriodicalId":129323,"journal":{"name":"Proceedings Thirteenth International Conference on Scientific and Statistical Database Management. SSDBM 2001","volume":"33 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2001-07-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"13","resultStr":"{\"title\":\"Cost-based unbalanced R-trees\",\"authors\":\"K. A. Ross, I. Sitzmann, Peter James Stuckey\",\"doi\":\"10.1109/SSDM.2001.938552\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Cost-based unbalanced R-trees (CUR-trees) are a cost-function-based data structure for spatial data. CUR-trees are constructed specifically to improve the evaluation of intersection queries, the most basic selection query in an R-tree. A CUR-tree is built taking into account a given query distribution for the queries and a cost model for their execution. Depending on the expected frequency of access, objects or subtrees are stored higher up in the tree. After each insertion in the tree, local reorganizations of a node and its children have their expected query cost evaluated, and a reorganization is performed if this is beneficial. No strict balancing of the trees applies, allowing the tree to unfold solely based on the result of the cost evaluation. We present our cost-based approach and describe the evaluation and reorganization operations based on the cost function. We present a cost model for in-memory access costs and we present three different query models. In our experiments, we compare the performance of the CUR-tree to the R-tree and the R*-tree. The CUR-tree is able to significantly improve intersection query performance, without unacceptably increasing the cost of building the tree. The use of R-trees for in-memory data reflects the high (and growing) cost of bringing data from RAM into the CPU cache relative to the cost of other computations.\",\"PeriodicalId\":129323,\"journal\":{\"name\":\"Proceedings Thirteenth International Conference on Scientific and Statistical Database Management. SSDBM 2001\",\"volume\":\"33 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2001-07-18\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"13\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings Thirteenth International Conference on Scientific and Statistical Database Management. SSDBM 2001\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/SSDM.2001.938552\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings Thirteenth International Conference on Scientific and Statistical Database Management. SSDBM 2001","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SSDM.2001.938552","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 13

摘要

基于成本的非平衡r树是一种基于成本函数的空间数据结构。curc树是专门为改进交集查询的求值而构造的，交集查询是r树中最基本的选择查询。考虑到查询的给定查询分布及其执行的成本模型，构建了cur树。根据预期的访问频率，对象或子树存储在树的更高位置。在树中的每次插入之后，对节点及其子节点的局部重组评估其预期查询成本，如果这样做有益，则执行重组。没有严格的树的平衡应用，允许树展开完全基于成本评估的结果。我们提出了基于成本的方法，并描述了基于成本函数的评估和重组操作。我们提出了内存访问成本的成本模型，并提出了三种不同的查询模型。在实验中，我们将CUR-tree的性能与R-tree和R*-tree进行了比较。curc -tree能够显著提高交集查询性能，而不会不可接受地增加构建树的成本。对内存中的数据使用r树反映了相对于其他计算的成本，将数据从RAM带入CPU缓存的成本很高(而且还在不断增长)。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Cost-based unbalanced R-trees

Cost-based unbalanced R-trees (CUR-trees) are a cost-function-based data structure for spatial data. CUR-trees are constructed specifically to improve the evaluation of intersection queries, the most basic selection query in an R-tree. A CUR-tree is built taking into account a given query distribution for the queries and a cost model for their execution. Depending on the expected frequency of access, objects or subtrees are stored higher up in the tree. After each insertion in the tree, local reorganizations of a node and its children have their expected query cost evaluated, and a reorganization is performed if this is beneficial. No strict balancing of the trees applies, allowing the tree to unfold solely based on the result of the cost evaluation. We present our cost-based approach and describe the evaluation and reorganization operations based on the cost function. We present a cost model for in-memory access costs and we present three different query models. In our experiments, we compare the performance of the CUR-tree to the R-tree and the R*-tree. The CUR-tree is able to significantly improve intersection query performance, without unacceptably increasing the cost of building the tree. The use of R-trees for in-memory data reflects the high (and growing) cost of bringing data from RAM into the CPU cache relative to the cost of other computations.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Proceedings Thirteenth International Conference on Scientific and Statistical Database Management. SSDBM 2001

自引率

0.00%

发文量