On-line Versioned Schema Inference for Large Semantic Web Data Sources

Kenza Kellou-Menouer, Zoubida Kedad
{"title":"On-line Versioned Schema Inference for Large Semantic Web Data Sources","authors":"Kenza Kellou-Menouer, Zoubida Kedad","doi":"10.1145/3085504.3085513","DOIUrl":null,"url":null,"abstract":"A growing number of data sources expressed in RDF(S)/OWL are available on the Web. They are increasingly used in big data and real-time applications. These data sources may be created without formally defining their schema, which is implicit in the stored data. The instances of a source do not have to conform to the schema when it is defined. This offers more flexibility and eases data evolution. However, it comes at the cost of losing the description of the data, which can be useful in many contexts. In this paper, we present SchemaDecrypt, a novel approach for discovering a versioned schema for a remote data source. SchemaDecrypt enables the discovery of the different structures of the existing classes. Our approach discovers the versions on-line, without uploading or browsing the data source. It enables to overcome the source querying restrictions and the combinatorial explosion of the candidate versions. We present some experimental evaluations on DBpedia to demonstrate the performances of our approach.","PeriodicalId":431308,"journal":{"name":"Proceedings of the 29th International Conference on Scientific and Statistical Database Management","volume":"59 3","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-06-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 29th International Conference on Scientific and Statistical Database Management","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3085504.3085513","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 6

Abstract

A growing number of data sources expressed in RDF(S)/OWL are available on the Web. They are increasingly used in big data and real-time applications. These data sources may be created without formally defining their schema, which is implicit in the stored data. The instances of a source do not have to conform to the schema when it is defined. This offers more flexibility and eases data evolution. However, it comes at the cost of losing the description of the data, which can be useful in many contexts. In this paper, we present SchemaDecrypt, a novel approach for discovering a versioned schema for a remote data source. SchemaDecrypt enables the discovery of the different structures of the existing classes. Our approach discovers the versions on-line, without uploading or browsing the data source. It enables to overcome the source querying restrictions and the combinatorial explosion of the candidate versions. We present some experimental evaluations on DBpedia to demonstrate the performances of our approach.
大型语义Web数据源的在线版本模式推断
Web上有越来越多的以RDF(S)/OWL表示的数据源。它们越来越多地用于大数据和实时应用。可以在没有正式定义模式的情况下创建这些数据源,这在存储的数据中是隐式的。源的实例在定义模式时不必遵循模式。这提供了更大的灵活性,并简化了数据演变。然而,这样做的代价是丢失数据的描述,这在许多上下文中都是有用的。在本文中,我们介绍了SchemaDecrypt,这是一种用于发现远程数据源的版本化模式的新方法。SchemaDecrypt支持发现现有类的不同结构。我们的方法在线发现版本,不需要上传或浏览数据源。它能够克服源查询限制和候选版本的组合爆炸。我们在DBpedia上进行了一些实验评估,以证明我们的方法的性能。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信