International Workshop on Data Warehousing and OLAP最新文献_第2页

Bijoux: Data Generator for Evaluating ETL Process Quality Bijoux:评估ETL过程质量的数据生成器

International Workshop on Data Warehousing and OLAP Pub Date : 2014-11-07 DOI: 10.1145/2666158.2666183

Emona Nakuçi, V. Theodorou, P. Jovanovic, A. Abelló

引用次数: 15

Big Graph Analytics: The State of the Art and Future Research Agenda 大图表分析:技术现状和未来研究议程

International Workshop on Data Warehousing and OLAP Pub Date : 2014-11-07 DOI: 10.1145/2666158.2668454

A. Cuzzocrea, I. Song

引用次数: 52

An Advanced Data Warehouse for Integrating Large Sets of GPS Data 一种集成大型GPS数据集的高级数据仓库

International Workshop on Data Warehousing and OLAP Pub Date : 2014-11-07 DOI: 10.1145/2666158.2666172

O. Andersen, Benjamin B. Krogh, Christian Thomsen, K. Torp

引用次数: 9

GOLAM: A Framework for Analyzing Genomic Data GOLAM:分析基因组数据的框架

International Workshop on Data Warehousing and OLAP Pub Date : 2014-11-07 DOI: 10.1145/2666158.2666175

Lorenzo Baldacci, M. Golfarelli, Simone Graziani, S. Rizzi

引用次数: 4

Can we analyze big data inside a DBMS? 我们能在DBMS中分析大数据吗?

International Workshop on Data Warehousing and OLAP Pub Date : 2013-10-28 DOI: 10.1145/2513190.2513198

C. Ordonez

引用次数: 25

Using REO on ETL conceptual modelling: a first approach 在ETL概念建模中使用REO:第一种方法

International Workshop on Data Warehousing and OLAP Pub Date : 2013-10-28 DOI: 10.1145/2513190.2513202

Bruno Oliveira, O. Belo

{"title":"Using REO on ETL conceptual modelling: a first approach","authors":"Bruno Oliveira, O. Belo","doi":"10.1145/2513190.2513202","DOIUrl":"https://doi.org/10.1145/2513190.2513202","url":null,"abstract":"The formalization of software patterns has proven to be very useful in software developing, improving systems communication, data interchange across platforms, and simplifying the integration of processes and data flows. Populating a data warehouse (ETL) is often a very complex task demanding significant computational resources. It faces many drawbacks during its design and implementation, involving not only large volumes of data that must be processed but also undesirable change of business requirements. All of this leads frequently to reuse significant parts of other ETL implementations, adapting data structures and processes to comply with new requirements. Additionally, we believe that it's necessary a more simply and reliable approach for ETL conceptual modelling covering the \"lack of mature\" of this important part of ETL development. In this paper we explored a new approach to ETL conceptual modelling using the Reo coordination language, trying to evaluate its adequacy and expressiveness on the coordination of ETL tasks. A pattern-based approach was designed to map typical operations used in real world ETL scenarios from an initial Reo specification. For demonstration purposes, we present and discuss as two case studies, a slowly changing dimension and a surrogated key pipelining processes.","PeriodicalId":335396,"journal":{"name":"International Workshop on Data Warehousing and OLAP","volume":"39 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-10-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127808087","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 14

Meta-stars: multidimensional modeling for social business intelligence 元之星:社会商业智能的多维建模

International Workshop on Data Warehousing and OLAP Pub Date : 2013-10-28 DOI: 10.1145/2513190.2513195

E. Gallinucci, M. Golfarelli, S. Rizzi

{"title":"Meta-stars: multidimensional modeling for social business intelligence","authors":"E. Gallinucci, M. Golfarelli, S. Rizzi","doi":"10.1145/2513190.2513195","DOIUrl":"https://doi.org/10.1145/2513190.2513195","url":null,"abstract":"Social business intelligence is the discipline of combining corporate data with user-generated content (UGC) to let decision-makers improve their business based on the trends perceived from the environment. A key role in the analysis of textual UGC is played by topics, meant as specific concepts of interest within a subject area. To enable aggregations of topics at different levels, a topic hierarchy is to be defined. Some attempts have been made to address some of the peculiarities of topic hierarchies, but no comprehensive solution has been found so far. The approach we propose to model topic hierarchies in ROLAP systems is called meta-stars. Its basic idea is to use meta-modeling coupled with navigation tables and with traditional dimension tables: navigation tables support hierarchy instances with different lengths and with non-leaf facts, and allow different roll-up semantics to be explicitly annotated; meta-modeling enables hierarchy heterogeneity and dynamics to be accommodated; dimension tables are easily integrated with standard business hierarchies. After outlining a reference architecture for social business intelligence and describing the meta-star approach, we discuss its effectiveness and efficiency by showing its querying expressiveness and by presenting some experimental results for query performances.","PeriodicalId":335396,"journal":{"name":"International Workshop on Data Warehousing and OLAP","volume":"2011 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-10-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131871642","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 24

INDREX: in-database distributional relation extraction INDREX:数据库内分布关系提取

International Workshop on Data Warehousing and OLAP Pub Date : 2013-10-28 DOI: 10.1145/2513190.2513196

T. Kilias, Alexander Löser, Periklis Andritsos

{"title":"INDREX: in-database distributional relation extraction","authors":"T. Kilias, Alexander Löser, Periklis Andritsos","doi":"10.1145/2513190.2513196","DOIUrl":"https://doi.org/10.1145/2513190.2513196","url":null,"abstract":"Relation extraction transforms the textual representation of a relationship into the relational model of a data warehouse. Early systems, such as SystemT by IBM or the open source system GATE solve this task with handcrafted rule sets that the system executes document-by-document. Thereby the user must execute a highly interactive and iterative process of reading a document, of expressing rules, of testing these rules on the next document and of refining rules. Until now, these systems do neither leverage the full potential of built-in declarative query languages nor the indexing and query optimization techniques of a modern RDBMS that would enable a user interactive rule refinement across documents and on the entire corpus. We propose the INDREX system that enables a user for the first time to describe corpus-wide extraction tasks in a declarative language and permits the user to run interactive rule refinement queries. For enabling this powerful functionality we extend a standard PostgreSQL with a set of white-box user-defined functions that enable corpus-wide transformations from sentences into relationships. We store the text corpus and rules in the same RDBMS that already holds domain specific structured data. As a result, (1) the user can leverage this data to further adapt rules to the target domain, (2) the user does not need an additional system for rule extraction and (3) the INDREX system can leverage the full power of built-in indexing and query optimization techniques of the underlaying RDBMS. In a preliminary study we report on the feasibility of this disruptive approach and show multiple queries in INDREX on the Reuters Corpus, Volume 1.","PeriodicalId":335396,"journal":{"name":"International Workshop on Data Warehousing and OLAP","volume":"215 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-10-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116823655","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 3

CXT-cube: contextual text cube model and aggregation operator for text OLAP CXT-cube:文本OLAP的上下文文本多维数据集模型和聚合操作符

International Workshop on Data Warehousing and OLAP Pub Date : 2013-10-28 DOI: 10.1145/2513190.2513201

Lamia Oukid, Ounas Asfari, F. Bentayeb, N. Benblidia, Omar Boussaïd

引用次数: 28

Lazy data structure maintenance for main-memory analytics over sliding windows 通过滑动窗口进行主存分析的延迟数据结构维护

International Workshop on Data Warehousing and OLAP Pub Date : 2013-10-28 DOI: 10.1145/2513190.2513203

Chang Ge, Lukasz Golab

引用次数: 2