{"title":"Paradise中的查询预执行和批处理:一种有效处理驻留在磁带上的光栅图像的查询的双管齐下的方法","authors":"Jie-Bing Yu, D. DeWitt","doi":"10.1109/SSDM.1997.621153","DOIUrl":null,"url":null,"abstract":"The focus of the Paradise project (D. DeWitt et al., 194; J. Patel et al., 1997) is to design and implement a scalable database system capable of storing and processing massive data sets such as those produced by NASA's EOSDIS project. The paper describes extensions to Paradise to handle the execution of queries involving collections of satellite images stored on tertiary storage. Several modifications were made to Paradise in order to make the execution of such queries both transparent to the user and efficient. First, the Paradise storage engine (the SHORE storage manager) was extended to support tertiary storage using a log structured organization for tape volumes. Second, the Paradise query processing engine was modified to incorporate a number of novel mechanisms including query pre execution, object abstraction, cache conscious tape scheduling, and query batching. A performance evaluation on a working prototype demonstrates that, together, these techniques can provide a dramatic improvement over more traditional approaches to the management of data stored on tape.","PeriodicalId":159935,"journal":{"name":"Proceedings. Ninth International Conference on Scientific and Statistical Database Management (Cat. No.97TB100150)","volume":"86 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1997-08-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"27","resultStr":"{\"title\":\"Query pre-execution and batching in Paradise: a two-pronged approach to the efficient processing of queries on tape-resident raster images\",\"authors\":\"Jie-Bing Yu, D. DeWitt\",\"doi\":\"10.1109/SSDM.1997.621153\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The focus of the Paradise project (D. DeWitt et al., 194; J. Patel et al., 1997) is to design and implement a scalable database system capable of storing and processing massive data sets such as those produced by NASA's EOSDIS project. The paper describes extensions to Paradise to handle the execution of queries involving collections of satellite images stored on tertiary storage. Several modifications were made to Paradise in order to make the execution of such queries both transparent to the user and efficient. First, the Paradise storage engine (the SHORE storage manager) was extended to support tertiary storage using a log structured organization for tape volumes. Second, the Paradise query processing engine was modified to incorporate a number of novel mechanisms including query pre execution, object abstraction, cache conscious tape scheduling, and query batching. A performance evaluation on a working prototype demonstrates that, together, these techniques can provide a dramatic improvement over more traditional approaches to the management of data stored on tape.\",\"PeriodicalId\":159935,\"journal\":{\"name\":\"Proceedings. Ninth International Conference on Scientific and Statistical Database Management (Cat. No.97TB100150)\",\"volume\":\"86 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"1997-08-11\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"27\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings. Ninth International Conference on Scientific and Statistical Database Management (Cat. No.97TB100150)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/SSDM.1997.621153\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings. Ninth International Conference on Scientific and Statistical Database Management (Cat. No.97TB100150)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SSDM.1997.621153","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 27
摘要
天堂计划的焦点(D. DeWitt et al., 1994;J. Patel et al., 1997)是设计和实现一个可扩展的数据库系统,能够存储和处理大量数据集,如美国宇航局EOSDIS项目产生的数据集。本文描述了对Paradise的扩展,以处理涉及存储在三级存储器上的卫星图像集合的查询的执行。为了使此类查询的执行对用户透明且高效,对Paradise进行了一些修改。首先,对Paradise存储引擎(SHORE存储管理器)进行了扩展,使用磁带卷的日志结构化组织来支持三级存储。其次,对Paradise查询处理引擎进行了修改,以纳入许多新的机制,包括查询预执行、对象抽象、缓存意识磁带调度和查询批处理。对一个工作原型的性能评估表明,这些技术结合在一起,可以大大改进存储在磁带上的数据的管理方法。
Query pre-execution and batching in Paradise: a two-pronged approach to the efficient processing of queries on tape-resident raster images
The focus of the Paradise project (D. DeWitt et al., 194; J. Patel et al., 1997) is to design and implement a scalable database system capable of storing and processing massive data sets such as those produced by NASA's EOSDIS project. The paper describes extensions to Paradise to handle the execution of queries involving collections of satellite images stored on tertiary storage. Several modifications were made to Paradise in order to make the execution of such queries both transparent to the user and efficient. First, the Paradise storage engine (the SHORE storage manager) was extended to support tertiary storage using a log structured organization for tape volumes. Second, the Paradise query processing engine was modified to incorporate a number of novel mechanisms including query pre execution, object abstraction, cache conscious tape scheduling, and query batching. A performance evaluation on a working prototype demonstrates that, together, these techniques can provide a dramatic improvement over more traditional approaches to the management of data stored on tape.