2014 IEEE 30th International Conference on Data Engineering Workshops最新文献_第4页

Mapping abstract queries to big data web resources for on-the-fly data integration and information retrieval 将抽象查询映射到大数据网络资源，实现实时数据集成和信息检索

2014 IEEE 30th International Conference on Data Engineering Workshops Pub Date : 2014-05-19 DOI: 10.1109/ICDEW.2014.6818304

H. Jamil

{"title":"Mapping abstract queries to big data web resources for on-the-fly data integration and information retrieval","authors":"H. Jamil","doi":"10.1109/ICDEW.2014.6818304","DOIUrl":"https://doi.org/10.1109/ICDEW.2014.6818304","url":null,"abstract":"The emergence of technologies such as XML, web services and cloud computing have helped, the proliferation of databases and their diversity pose serious barriers to meaningful information extraction from these “big databases”. Research in intention recognition has also progressed substantially, yet very little has been done to recognize query intents to search, select, map and extract responses from such enormous pools of candidate databases. Query mapping becomes truly complicated particularly in scientific databases where tools and functions are needed to interpret the database contents, semantics of which are usually hidden inside the functions. In this paper, we present a declarative meta-language, called BioVis, using which biologists potentially are able to express their “intentional queries” with the expectation that a mapping function μ is able to accurately understand the meaning of the queries and map them to the underlying resources appropriately. We show that such a function is technically feasible if we can design a schema mapping function that can tailor itself according to a knowledgebase and recognize entities in schema graphs. We offer this idea as a possible research problem for the community to address.","PeriodicalId":302600,"journal":{"name":"2014 IEEE 30th International Conference on Data Engineering Workshops","volume":"28 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-05-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126798963","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 7

In schema matching, even experts are human: Towards expert sourcing in schema matching 在模式匹配中，专家也是人:走向模式匹配中的专家溯源

2014 IEEE 30th International Conference on Data Engineering Workshops Pub Date : 2014-05-19 DOI: 10.1109/ICDEW.2014.6818301

Tomer Sagi, A. Gal

引用次数: 8

Interactive data exploration based on user relevance feedback 基于用户相关性反馈的交互式数据探索

2014 IEEE 30th International Conference on Data Engineering Workshops Pub Date : 2014-03-01 DOI: 10.1109/ICDEW.2014.6818343

Kyriaki Dimitriadou, Olga Papaemmanouil, Y. Diao

引用次数: 3

Aggregation of similarity measures in schema matching based on generalized mean 基于广义均值的模式匹配中相似测度的聚合

2014 IEEE 30th International Conference on Data Engineering Workshops Pub Date : 2014-03-01 DOI: 10.1109/ICDEW.2014.6818306

Faten A. Elshwimy, Alsayed Algergawy, A. Sarhan, E. Sallam

{"title":"Aggregation of similarity measures in schema matching based on generalized mean","authors":"Faten A. Elshwimy, Alsayed Algergawy, A. Sarhan, E. Sallam","doi":"10.1109/ICDEW.2014.6818306","DOIUrl":"https://doi.org/10.1109/ICDEW.2014.6818306","url":null,"abstract":"Schema matching represents a critical step to integrate heterogeneous e-Business and shared-data applications. Most existing schema matching approaches rely heavily on similarity-based techniques, which attempt to discover correspondences based on various element similarity measures, each computed by an individual base matcher. It has been accepted that aggregating results of multiple base matchers is a promising technique to obtain more accurate matching correspondences. A number of current matching systems use experimental weights for aggregation of similarities among different element matchers while others use machine learning approaches to find optimal weights that should be assigned to different matchers. However, both approaches have their own deficiencies. To overcome the limitations of existing aggregation strategies and to achieve better performance, in this paper, we propose a new aggregation strategy, called the AHGM strategy, which aggregates multiple element matchers based on the concept of generalized mean. In particular, we first develop a practical way to obtain optimal weights that will be assigned to each associated matcher for the given aggregation task. We then use these weights in our aggregation method to improve the performance of matcher combining. To validate the performance of the proposed strategy, we conducted a set of experiments, and the obtained results are encouraging.","PeriodicalId":302600,"journal":{"name":"2014 IEEE 30th International Conference on Data Engineering Workshops","volume":"43 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132637514","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 10

Leveraging in-memory technology for interactive analyses of point-of-sales data 利用内存技术对销售点数据进行交互式分析

2014 IEEE 30th International Conference on Data Engineering Workshops Pub Date : 2014-03-01 DOI: 10.1109/ICDEW.2014.6818311

David Schwalb, Martin Faust, Jens Krüger, H. Plattner

引用次数: 9

Predictive query processing on moving objects 移动对象的预测查询处理

2014 IEEE 30th International Conference on Data Engineering Workshops Pub Date : 2014-03-01 DOI: 10.1109/ICDEW.2014.6818352

Abdeltawab M. Hendawi

引用次数: 4

2B or not 2B and everything in between — novel evaluation methods for matching problems 2B或非2B以及介于两者之间的一切——匹配问题的新评估方法

2014 IEEE 30th International Conference on Data Engineering Workshops Pub Date : 2014-03-01 DOI: 10.1109/ICDEW.2014.6818349

Tomer Sagi

引用次数: 1

Data stream partitioning re-optimization based on runtime dependency mining 基于运行时依赖挖掘的数据流分区重新优化

2014 IEEE 30th International Conference on Data Engineering Workshops Pub Date : 2014-03-01 DOI: 10.1109/ICDEW.2014.6818327

Emeric Viel, Haruyasu Ueda

引用次数: 7

Curracurrong cloud: Stream processing in the cloud Curracurrong云:云中的流处理

2014 IEEE 30th International Conference on Data Engineering Workshops Pub Date : 2014-03-01 DOI: 10.1109/ICDEW.2014.6818328

Vasvi Kakkad, Akon Dey, A. Fekete, Bernhard Scholz

引用次数: 4

RQ-RDF-3X: Going beyond triplestores RQ-RDF-3X:超越三重存储

2014 IEEE 30th International Conference on Data Engineering Workshops Pub Date : 2014-03-01 DOI: 10.1109/ICDEW.2014.6818337

Jyoti Leeka, Srikanta J. Bedathur

{"title":"RQ-RDF-3X: Going beyond triplestores","authors":"Jyoti Leeka, Srikanta J. Bedathur","doi":"10.1109/ICDEW.2014.6818337","DOIUrl":"https://doi.org/10.1109/ICDEW.2014.6818337","url":null,"abstract":"Efficient storage and querying of large repositories of RDF content is important due to the widespread growth of Semantic Web and Linked Open Data initiatives. Many novel database systems that store RDF in its native form or within traditional relational storage have demonstrated their ability to scale to large volumes of RDF content. However, it is increasingly becoming obvious that the simple dyadic relationship captured through traditional triples alone is not sufficient for modelling multi-entity relationships, provenance of facts, etc. Such richer models are supported in RDF through two techniques - first, called reification which retains the triple nature of RDF and the second, a non-standard extension called N-Quads. In this paper, we explore the challenges of supporting such richer semantic data by extending the state-of-the-art RDF-3X system. We describe our implementation of RQ-RDF-3X, a reification and quad enhanced RDF-3X, which involved a significant re-engineering ranging from the set of indexes and their compression schemes to the query processing pipeline for queries over reified content. Using large RDF repositories such as YAGO2S and DBpedia, and a set of SPARQL queries that utilize reification model, we demonstrate that RQ-RDF-3X is significantly faster than RDF-3X.","PeriodicalId":302600,"journal":{"name":"2014 IEEE 30th International Conference on Data Engineering Workshops","volume":"37 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127799882","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 12