在数据集成中引入上下文和上下文感知:识别问题和知情同意的初步案例研究

C. Debruyne
{"title":"在数据集成中引入上下文和上下文感知:识别问题和知情同意的初步案例研究","authors":"C. Debruyne","doi":"10.1145/3428757.3429116","DOIUrl":null,"url":null,"abstract":"Data integration is the process of selecting, preprocessing, and transforming data from heterogeneous sources in data-driven projects. This process also requires the most time, effort, resources. Data integration is such an involved process due to the many informed decisions one has to make. These decisions are influenced by the complex context of a data-driven project. We argue that using said context could facilitate the decision-making processes and even automate some integration steps. However, the problem we identify in this paper is that the context of a data-driven project is tacit and, therefore, not easily accessible by humans and certainly not by software agents. From the SotA, however, we observe that current models represent the context in crude and simplistic terms. These context models are furthermore built for specific tasks or application domains such as query optimization or a smart home. The current state of affairs is thus is not fit for intelligent data integration. Next to identifying the problem, we postulate that solving this problem requires two steps: formalizing context and using that context for building context-aware agents. We illustrate this notion of \"context-aware data integration\" with preliminary results obtained with a use case in the domain of GDPR, more specifically the generation of datasets that takes into account informed consent.","PeriodicalId":212557,"journal":{"name":"Proceedings of the 22nd International Conference on Information Integration and Web-based Applications & Services","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-11-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Introducing Context and Context-awareness in Data Integration: Identifying the Problem and a Preliminary Case Study on Informed Consent\",\"authors\":\"C. Debruyne\",\"doi\":\"10.1145/3428757.3429116\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Data integration is the process of selecting, preprocessing, and transforming data from heterogeneous sources in data-driven projects. This process also requires the most time, effort, resources. Data integration is such an involved process due to the many informed decisions one has to make. These decisions are influenced by the complex context of a data-driven project. We argue that using said context could facilitate the decision-making processes and even automate some integration steps. However, the problem we identify in this paper is that the context of a data-driven project is tacit and, therefore, not easily accessible by humans and certainly not by software agents. From the SotA, however, we observe that current models represent the context in crude and simplistic terms. These context models are furthermore built for specific tasks or application domains such as query optimization or a smart home. The current state of affairs is thus is not fit for intelligent data integration. Next to identifying the problem, we postulate that solving this problem requires two steps: formalizing context and using that context for building context-aware agents. We illustrate this notion of \\\"context-aware data integration\\\" with preliminary results obtained with a use case in the domain of GDPR, more specifically the generation of datasets that takes into account informed consent.\",\"PeriodicalId\":212557,\"journal\":{\"name\":\"Proceedings of the 22nd International Conference on Information Integration and Web-based Applications & Services\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-11-30\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 22nd International Conference on Information Integration and Web-based Applications & Services\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3428757.3429116\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 22nd International Conference on Information Integration and Web-based Applications & Services","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3428757.3429116","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

数据集成是在数据驱动的项目中从异构源选择、预处理和转换数据的过程。这个过程也需要最多的时间、精力和资源。由于必须做出许多明智的决策,数据集成是一个非常复杂的过程。这些决策受到数据驱动项目的复杂环境的影响。我们认为,使用上述上下文可以促进决策过程,甚至自动化一些集成步骤。然而,我们在本文中确定的问题是,数据驱动项目的上下文是隐性的,因此,不容易被人类访问,当然也不容易被软件代理访问。然而,从SotA中,我们观察到当前的模型以粗糙和简单的术语表示上下文。这些上下文模型是为特定任务或应用程序领域(如查询优化或智能家居)进一步构建的。因此,目前的现状并不适合智能数据集成。在确定问题之后,我们假设解决这个问题需要两个步骤:形式化上下文并使用该上下文构建上下文感知代理。我们通过GDPR领域的一个用例获得的初步结果来说明“上下文感知数据集成”的概念,更具体地说,是考虑到知情同意的数据集的生成。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Introducing Context and Context-awareness in Data Integration: Identifying the Problem and a Preliminary Case Study on Informed Consent
Data integration is the process of selecting, preprocessing, and transforming data from heterogeneous sources in data-driven projects. This process also requires the most time, effort, resources. Data integration is such an involved process due to the many informed decisions one has to make. These decisions are influenced by the complex context of a data-driven project. We argue that using said context could facilitate the decision-making processes and even automate some integration steps. However, the problem we identify in this paper is that the context of a data-driven project is tacit and, therefore, not easily accessible by humans and certainly not by software agents. From the SotA, however, we observe that current models represent the context in crude and simplistic terms. These context models are furthermore built for specific tasks or application domains such as query optimization or a smart home. The current state of affairs is thus is not fit for intelligent data integration. Next to identifying the problem, we postulate that solving this problem requires two steps: formalizing context and using that context for building context-aware agents. We illustrate this notion of "context-aware data integration" with preliminary results obtained with a use case in the domain of GDPR, more specifically the generation of datasets that takes into account informed consent.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信