International Workshop on Data Warehousing and OLAP最新文献_第9页

Cardinality estimation in ETL processes ETL过程中的基数估计

International Workshop on Data Warehousing and OLAP Pub Date : 2009-11-06 DOI: 10.1145/1651291.1651302

Maik Thiele, Tim Kiefer, Wolfgang Lehner

引用次数: 4

Consistency-aware evaluation of OLAP queries in replicated data warehouses 复制数据仓库中OLAP查询的一致性感知评估

International Workshop on Data Warehousing and OLAP Pub Date : 2009-11-06 DOI: 10.1145/1651291.1651305

Javier García-García, C. Ordonez

{"title":"Consistency-aware evaluation of OLAP queries in replicated data warehouses","authors":"Javier García-García, C. Ordonez","doi":"10.1145/1651291.1651305","DOIUrl":"https://doi.org/10.1145/1651291.1651305","url":null,"abstract":"OLAP tools for distributed data warehouses generally assume underlying replicated tables are up to date. Unfortunately, maintaining updated replicas is difficult due to the inherent tradeoff between consistency and availability. In this paper, we propose techniques to evaluate OLAP queries in distributed data warehouses assuming a lazy replication model. Considering that it may be admissible to evaluate OLAP queries with slightly outdated replicated tables, our technique first efficiently computes the degree of obsolescence of replicated local tables and when such result is acceptable, given an error threshold, then the query is evaluated locally, avoiding the transmission of large tables over the network. Otherwise, the query can be remotely evaluated less efficiently with the master copy of tables, provided they are stored at a single site. Inconsistency measurement is computed by adapting distributed set reconciliation algorithms to efficiently compute the symmetric difference between the master and replicated tables. Our improved distributed database algorithm has linear communication complexity and cubic time complexity in the size of the symmetric difference, which is expected to be small in a replicated data warehouse. Our technique is independent of the method employed to propagate data warehouse insertions, deletions and updates. We present experiments simulating distributed databases, with different CPU and transmission speeds, showing our method is effective to decide if the query should be evaluated either locally or remotely.","PeriodicalId":335396,"journal":{"name":"International Workshop on Data Warehousing and OLAP","volume":"2015 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2009-11-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127691472","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 4

View usability and safety for the answering of top-k queries via materialized views 查看通过物化视图回答top-k查询的可用性和安全性

International Workshop on Data Warehousing and OLAP Pub Date : 2009-11-06 DOI: 10.1145/1651291.1651308

Eftychia Baikousi, Panos Vassiliadis

引用次数: 11

Generating data quality rules and integration into ETL process 生成数据质量规则并集成到ETL流程中

International Workshop on Data Warehousing and OLAP Pub Date : 2009-11-06 DOI: 10.1145/1651291.1651303

J. Rodic, M. Baranović

引用次数: 26

Defining ETL worfklows using BPMN and BPEL 使用BPMN和BPEL定义ETL工作流

International Workshop on Data Warehousing and OLAP Pub Date : 2009-11-06 DOI: 10.1145/1651291.1651299

Z. E. Akkaoui, E. Zimányi

引用次数: 97

A comprehensive approach to data warehouse testing 一个全面的数据仓库测试方法

International Workshop on Data Warehousing and OLAP Pub Date : 2009-11-06 DOI: 10.1145/1651291.1651295

M. Golfarelli, S. Rizzi

引用次数: 48

Discovering functional dependencies for multidimensional design 发现多维设计的功能依赖关系

International Workshop on Data Warehousing and OLAP Pub Date : 2009-11-06 DOI: 10.1145/1651291.1651293

Oscar Romero, Diego Calvanese, A. Abelló, M. Rodriguez-Muro

引用次数: 50

Automatic generation of ETL processes from conceptual models 从概念模型自动生成ETL过程

International Workshop on Data Warehousing and OLAP Pub Date : 2009-11-06 DOI: 10.1145/1651291.1651298

Lilia Muñoz, J. Mazón, J. Trujillo

引用次数: 73

A set of aggregation functions for spatial measures 一组用于空间度量的聚合函数

International Workshop on Data Warehousing and OLAP Pub Date : 2008-10-30 DOI: 10.1145/1458432.1458438

J. Silva, V. Times, A. Salgado, Clenúbio Souza, R. Fidalgo, A. Oliveira

引用次数: 23

Efficient OLAP with UDFs 具有udf的高效OLAP

International Workshop on Data Warehousing and OLAP Pub Date : 2008-10-30 DOI: 10.1145/1458432.1458440

Zhibo Chen, C. Ordonez

{"title":"Efficient OLAP with UDFs","authors":"Zhibo Chen, C. Ordonez","doi":"10.1145/1458432.1458440","DOIUrl":"https://doi.org/10.1145/1458432.1458440","url":null,"abstract":"Since the early 1990s, On-Line Analytical Processing (OLAP) has been a well studied research topic that has focused on implementation outside the database, either with OLAP servers or entirely within the client computers. Our approach involves the computation and storage of OLAP cubes using User-Defined Functions (UDF) with a database management system. UDFs offer users a chance to write their own code that can then called like any other standard SQL function. By generating OLAP cubes within a UDF, we are able to create the entire lattice in main memory. The UDF also allows the user to assert more control over the actual generation process than when using standard OLAP functions such as the CUBE operator. We introduce a data structure that can not only efficiently create an OLAP lattice in main memory, but also be adapted to generate association rule itemsets with minimal change. We experimentally show that the UDF approach is more efficient than SQL using one real dataset and a synthetic dataset. Also, we present several experiments showing that generating association rule itemsets using the UDF approach is comparable to a SQL approach. In this paper, we show that techniques such as OLAP and association rules can be efficiently pushed into the UDF, and has better performance, in most cases, compared to standard SQL functions.","PeriodicalId":335396,"journal":{"name":"International Workshop on Data Warehousing and OLAP","volume":"10 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2008-10-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116879903","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 23