基于ETL技术的异构数据源数据集成

Marek Macura
{"title":"基于ETL技术的异构数据源数据集成","authors":"Marek Macura","doi":"10.7494/csci.2014.15.2.109","DOIUrl":null,"url":null,"abstract":"Data integration is a crucial issue in the environments of heterogeneous data sources. At present, the afore-mentioned heterogeneity is becoming widespread. Based on various data sources, if we want to gain useful information and knowl- edge, we must solve data integration problems in order to apply appropriate analytical methods to comprehensive and uniform data. Such activity is known as knowledge discovery from the data process. Therefore, approaches to the data integration problem are very interesting and bring us closer to the \"age of information\". This paper presents an architecture which implements knowl- edge discovery from the data process. The solution combines ETL technology and a wrapper layer known from mediated systems. It also provides semantic integration through connection mechanism between data elements. The solu- tion allows for integration of any data sources and implementation of analytical methods in one environment. The proposed environment is verified by applying it to data sources in the foundry industry.","PeriodicalId":23063,"journal":{"name":"Theor. Comput. Sci.","volume":"10 1","pages":"109-132"},"PeriodicalIF":0.0000,"publicationDate":"2014-03-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"12","resultStr":"{\"title\":\"Integration of Data from Heterogeneous Sources using ETL Technology\",\"authors\":\"Marek Macura\",\"doi\":\"10.7494/csci.2014.15.2.109\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Data integration is a crucial issue in the environments of heterogeneous data sources. At present, the afore-mentioned heterogeneity is becoming widespread. Based on various data sources, if we want to gain useful information and knowl- edge, we must solve data integration problems in order to apply appropriate analytical methods to comprehensive and uniform data. Such activity is known as knowledge discovery from the data process. Therefore, approaches to the data integration problem are very interesting and bring us closer to the \\\"age of information\\\". This paper presents an architecture which implements knowl- edge discovery from the data process. The solution combines ETL technology and a wrapper layer known from mediated systems. It also provides semantic integration through connection mechanism between data elements. The solu- tion allows for integration of any data sources and implementation of analytical methods in one environment. The proposed environment is verified by applying it to data sources in the foundry industry.\",\"PeriodicalId\":23063,\"journal\":{\"name\":\"Theor. Comput. Sci.\",\"volume\":\"10 1\",\"pages\":\"109-132\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2014-03-14\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"12\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Theor. Comput. Sci.\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.7494/csci.2014.15.2.109\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Theor. Comput. Sci.","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.7494/csci.2014.15.2.109","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 12

摘要

数据集成是异构数据源环境中的一个关键问题。目前,上述异质性正变得普遍。基于各种各样的数据源,要想获得有用的信息和知识,就必须解决数据集成问题,以便对全面统一的数据应用合适的分析方法。这种活动被称为从数据过程中发现知识。因此,解决数据集成问题的方法非常有趣,并使我们更接近“信息时代”。本文提出了一种从数据过程中实现知识发现的体系结构。该解决方案结合了ETL技术和中介系统中已知的包装器层。它还通过数据元素之间的连接机制提供语义集成。该解决方案允许在一个环境中集成任何数据源和实现分析方法。通过将所提出的环境应用于铸造行业的数据源,对其进行了验证。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Integration of Data from Heterogeneous Sources using ETL Technology
Data integration is a crucial issue in the environments of heterogeneous data sources. At present, the afore-mentioned heterogeneity is becoming widespread. Based on various data sources, if we want to gain useful information and knowl- edge, we must solve data integration problems in order to apply appropriate analytical methods to comprehensive and uniform data. Such activity is known as knowledge discovery from the data process. Therefore, approaches to the data integration problem are very interesting and bring us closer to the "age of information". This paper presents an architecture which implements knowl- edge discovery from the data process. The solution combines ETL technology and a wrapper layer known from mediated systems. It also provides semantic integration through connection mechanism between data elements. The solu- tion allows for integration of any data sources and implementation of analytical methods in one environment. The proposed environment is verified by applying it to data sources in the foundry industry.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信