Schema inference for multi-model data

Proceedings of the 25th International Conference on Model Driven Engineering Languages and Systems Pub Date : 2022-10-23 DOI:10.1145/3550355.3552400

P. Koupil, Sebastián Hricko, I. Holubová

引用次数: 1

Abstract

The knowledge of a structural schema of data is a crucial aspect of most data management tasks. Unfortunately, in many real-world scenarios, the data is not accompanied by it, and schema-inference approaches need to be utilised. In this paper, we focus on a specific and complex use case of multi-model data where several often contradictory features of the combined models must be considered. Hence, single-model approaches cannot be applied straightforwardly. In addition, the data often reach the scale of Big Data, and thus a scalable solution is inevitable. In our approach, we reflect all these challenges. In addition, we can also infer local integrity constraints as well as intra- and inter-model references. Last but not least, we can cope with cross-model data redundancy. Using a set of experiments, we prove the advantages of the proposed approach and we compare it with related work.

查看原文本刊更多论文

多模型数据的模式推断

了解数据的结构模式是大多数数据管理任务的一个关键方面。不幸的是，在许多现实场景中，数据并没有伴随着它，因此需要使用模式推理方法。在本文中，我们关注多模型数据的一个特定和复杂的用例，其中必须考虑组合模型的几个经常相互矛盾的特征。因此，单模型方法不能直接应用。此外，数据往往达到大数据的规模，因此可扩展的解决方案是不可避免的。在我们的做法中，我们反映了所有这些挑战。此外，我们还可以推断局部完整性约束以及模型内和模型间的引用。最后但并非最不重要的是，我们可以处理跨模型数据冗余。通过一组实验，证明了该方法的优越性，并与相关工作进行了比较。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings of the 25th International Conference on Model Driven Engineering Languages and Systems

自引率

0.00%

发文量