Proceedings. Ninth International Conference on Scientific and Statistical Database Management (Cat. No.97TB100150)最新文献_第3页

A cost model for estimating the performance of spatial joins using R-trees 使用r树估计空间连接性能的代价模型

Proceedings. Ninth International Conference on Scientific and Statistical Database Management (Cat. No.97TB100150) Pub Date : 1997-08-11 DOI: 10.1109/SSDM.1997.621148

Yun-Wu Huang, N. Jing, Elke A. Rundensteiner

{"title":"A cost model for estimating the performance of spatial joins using R-trees","authors":"Yun-Wu Huang, N. Jing, Elke A. Rundensteiner","doi":"10.1109/SSDM.1997.621148","DOIUrl":"https://doi.org/10.1109/SSDM.1997.621148","url":null,"abstract":"The development of a cost model for predicting the performance of spatial joins has been identified in the literature as an important and difficult problem. The authors present the first cost model that can predict the performance of spatial joins using R-trees. Based on two existing R-trees (join targets), the model first estimates the number of expected I/Os for the join process by assuming a zero buffer size. The method for this estimation extends the cost model for R-tree window queries (developed by Kamel and Faloutsos (1993) and by Pagel et al. (1993)) to also handle spatial joins (which are more complex). In the context of spatial join processing, this number of zero-buffer expected I/Os is not practical for performance prediction in a buffered environment. To model the buffer impact, they use an (exponential) distribution function to measure the probability that a bufferless I/O would cause a page fault in a buffered environment. Based on this probability and the zero-buffer expected I/O cost, the estimated number of I/Os for an R-tree join can then be computed. The comparisons between the predictions from the cost model and the actual results from the experiments based on real GIS maps show that the average relative error ratio is about 10% with a maximum of about 20% for a wide range of buffer sizes. Therefore, our model is a useful tool for the query optimization of spatial join queries.","PeriodicalId":159935,"journal":{"name":"Proceedings. Ninth International Conference on Scientific and Statistical Database Management (Cat. No.97TB100150)","volume":"27 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1997-08-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121830820","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 53

VANILLA: a dynamic data schema for a generic scientific database VANILLA:通用科学数据库的动态数据模式

Proceedings. Ninth International Conference on Scientific and Statistical Database Management (Cat. No.97TB100150) Pub Date : 1997-08-11 DOI: 10.1109/SSDM.1997.621163

Karla Massey, L. Kerschberg, George Michaels

引用次数: 4

A prototype metadata database for online analytical processing of environmental data 一种用于环境数据在线分析处理的元数据原型数据库

Proceedings. Ninth International Conference on Scientific and Statistical Database Management (Cat. No.97TB100150) Pub Date : 1997-08-11 DOI: 10.1109/SSDM.1997.621157

H. Geller, Sarah Conger, John Ertlschweiger, August J. Ryberg

引用次数: 2

LOGOS: a computational framework for neuroinformatics research LOGOS:神经信息学研究的计算框架

Proceedings. Ninth International Conference on Scientific and Statistical Database Management (Cat. No.97TB100150) Pub Date : 1997-08-11 DOI: 10.1109/SSDM.1997.621190

Michael Stiber, G. Jacobs, D. Swanberg

引用次数: 6

Query pre-execution and batching in Paradise: a two-pronged approach to the efficient processing of queries on tape-resident raster images Paradise中的查询预执行和批处理:一种有效处理驻留在磁带上的光栅图像的查询的双管齐下的方法

Proceedings. Ninth International Conference on Scientific and Statistical Database Management (Cat. No.97TB100150) Pub Date : 1997-08-11 DOI: 10.1109/SSDM.1997.621153

Jie-Bing Yu, D. DeWitt

{"title":"Query pre-execution and batching in Paradise: a two-pronged approach to the efficient processing of queries on tape-resident raster images","authors":"Jie-Bing Yu, D. DeWitt","doi":"10.1109/SSDM.1997.621153","DOIUrl":"https://doi.org/10.1109/SSDM.1997.621153","url":null,"abstract":"The focus of the Paradise project (D. DeWitt et al., 194; J. Patel et al., 1997) is to design and implement a scalable database system capable of storing and processing massive data sets such as those produced by NASA's EOSDIS project. The paper describes extensions to Paradise to handle the execution of queries involving collections of satellite images stored on tertiary storage. Several modifications were made to Paradise in order to make the execution of such queries both transparent to the user and efficient. First, the Paradise storage engine (the SHORE storage manager) was extended to support tertiary storage using a log structured organization for tape volumes. Second, the Paradise query processing engine was modified to incorporate a number of novel mechanisms including query pre execution, object abstraction, cache conscious tape scheduling, and query batching. A performance evaluation on a working prototype demonstrates that, together, these techniques can provide a dramatic improvement over more traditional approaches to the management of data stored on tape.","PeriodicalId":159935,"journal":{"name":"Proceedings. Ninth International Conference on Scientific and Statistical Database Management (Cat. No.97TB100150)","volume":"86 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1997-08-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125671508","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 27

Metadata: a case study from the environmental sciences 元数据:一个来自环境科学的案例研究

Proceedings. Ninth International Conference on Scientific and Statistical Database Management (Cat. No.97TB100150) Pub Date : 1997-08-11 DOI: 10.1109/SSDM.1997.621182

F. Bretherton, W. Hibbard

引用次数: 5

Scientific Databases: A Challenge in Interdisciplinary Education 科学数据库:跨学科教育的挑战

Proceedings. Ninth International Conference on Scientific and Statistical Database Management (Cat. No.97TB100150) Pub Date : 1997-08-11 DOI: 10.1109/SSDM.1997.621193

L. Kerschberg, M. Kafatos, George Michaels, John Cherniasky

引用次数: 0

ESMDIS: Earth System Model Data Information System 地球系统模型数据信息系统

Proceedings. Ninth International Conference on Scientific and Statistical Database Management (Cat. No.97TB100150) Pub Date : 1997-08-11 DOI: 10.1109/SSDM.1997.621169

Y. Chi, C. Mechoso, M. Stonebraker, K. Sklower, R. Troy, R. Muntz, E. Mesrobian

引用次数: 3

Knowledge discovery in an earthquake text database: correlation between significant earthquakes and the time of day 地震文本数据库中的知识发现:重大地震与时间之间的相关性

Proceedings. Ninth International Conference on Scientific and Statistical Database Management (Cat. No.97TB100150) Pub Date : 1997-08-11 DOI: 10.1109/SSDM.1997.621144

J. Goldman, D. S. Parker, W. Chu

引用次数: 4

A spatial data cube concept to support data analysis in environmental epidemiology 支持环境流行病学数据分析的空间数据立方体概念

Proceedings. Ninth International Conference on Scientific and Statistical Database Management (Cat. No.97TB100150) Pub Date : 1997-08-11 DOI: 10.1109/SSDM.1997.621161

V. Kamp, L. Sitzmann, Frank Wietek

引用次数: 4