Multidimensional Contexts for Data Quality Assessment

Aida Malaki, L. Bertossi, Flavio Rizzolo
{"title":"Multidimensional Contexts for Data Quality Assessment","authors":"Aida Malaki, L. Bertossi, Flavio Rizzolo","doi":"10.22215/etd/2013-07045","DOIUrl":null,"url":null,"abstract":"The notion of data quality cannot be separated from the context in which the data is produced or used. Recently, a conceptual framework for capturing context-dependent data quality assessment has been proposed. According to it, a database D is assessed wrt. a context which is modeled as an external system containing additional data, metadata, and definitions of quality predicates. The instance D is “put in context” via schema mappings; and after contextual processing of the data, a collection of alternative clean versions D′ of D is produced. The quality of D is measured in terms of its distance to this class. In this work we extend contexts for data quality assessment by including multidimensional data, which allows to analyze data from multiple perspectives and different degrees of granularity. It is possible to navigate through dimensional hierarchies in order to go for the data that is needed for quality assessment. More precisely, we introduce contextual hierarchies as components of contexts for data quality assessment. The resulting contexts are later represented as ontologies written in description logic.","PeriodicalId":282387,"journal":{"name":"Alberto Mendelzon Workshop on Foundations of Data Management","volume":"24 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2012-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"12","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Alberto Mendelzon Workshop on Foundations of Data Management","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.22215/etd/2013-07045","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 12

Abstract

The notion of data quality cannot be separated from the context in which the data is produced or used. Recently, a conceptual framework for capturing context-dependent data quality assessment has been proposed. According to it, a database D is assessed wrt. a context which is modeled as an external system containing additional data, metadata, and definitions of quality predicates. The instance D is “put in context” via schema mappings; and after contextual processing of the data, a collection of alternative clean versions D′ of D is produced. The quality of D is measured in terms of its distance to this class. In this work we extend contexts for data quality assessment by including multidimensional data, which allows to analyze data from multiple perspectives and different degrees of granularity. It is possible to navigate through dimensional hierarchies in order to go for the data that is needed for quality assessment. More precisely, we introduce contextual hierarchies as components of contexts for data quality assessment. The resulting contexts are later represented as ontologies written in description logic.
数据质量评估的多维上下文
数据质量的概念不能与产生或使用数据的上下文分开。最近,人们提出了一个用于获取上下文相关数据质量评估的概念框架。根据它,对数据库D进行wrt评估。一个被建模为包含附加数据、元数据和质量谓词定义的外部系统的上下文。实例D通过模式映射“置于上下文中”;在对数据进行上下文处理后,生成D的备选干净版本D '的集合。D的质量是用它到这个类的距离来衡量的。在这项工作中,我们通过包含多维数据来扩展数据质量评估的上下文,这允许从多个角度和不同粒度程度分析数据。为了获得质量评估所需的数据,可以浏览维度层次结构。更准确地说,我们引入上下文层次结构作为数据质量评估上下文的组成部分。结果上下文随后被表示为用描述逻辑编写的本体。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信