虚拟数据集成环境中的统计数据融合技术

International Journal of Data Mining & Knowledge Management Process Pub Date : 2013-09-30 DOI:10.5121/IJDKP.2013.3503

Mohamed M. Hafez, A. E. Bastawissy, O. H. Mohamed

{"title":"虚拟数据集成环境中的统计数据融合技术","authors":"Mohamed M. Hafez, A. E. Bastawissy, O. H. Mohamed","doi":"10.5121/IJDKP.2013.3503","DOIUrl":null,"url":null,"abstract":"Data fusion in the virtual data integration environment starts after detecting and clustering duplicated records from the different integrated data sources. It refers to the process of selecting or fusing attribute values from the clustered duplicates into a single record representing the real world object. In this paper, a statistical technique for data fusion is introduced based on some probabilistic scores from both data sources and clustered duplicates.","PeriodicalId":131153,"journal":{"name":"International Journal of Data Mining & Knowledge Management Process","volume":"51 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2013-09-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":"{\"title\":\"A Statistical Data Fusion Technique in Virtual Data Integration Environment\",\"authors\":\"Mohamed M. Hafez, A. E. Bastawissy, O. H. Mohamed\",\"doi\":\"10.5121/IJDKP.2013.3503\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Data fusion in the virtual data integration environment starts after detecting and clustering duplicated records from the different integrated data sources. It refers to the process of selecting or fusing attribute values from the clustered duplicates into a single record representing the real world object. In this paper, a statistical technique for data fusion is introduced based on some probabilistic scores from both data sources and clustered duplicates.\",\"PeriodicalId\":131153,\"journal\":{\"name\":\"International Journal of Data Mining & Knowledge Management Process\",\"volume\":\"51 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2013-09-30\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"4\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"International Journal of Data Mining & Knowledge Management Process\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.5121/IJDKP.2013.3503\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Data Mining & Knowledge Management Process","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.5121/IJDKP.2013.3503","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 4

摘要

虚拟数据集成环境中的数据融合始于对不同集成数据源中的重复记录进行检测和聚类。它指的是从聚集的重复项中选择或融合属性值到表示真实世界对象的单个记录中的过程。本文介绍了一种基于数据源和聚类副本的概率分数的数据融合统计技术。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

A Statistical Data Fusion Technique in Virtual Data Integration Environment

Data fusion in the virtual data integration environment starts after detecting and clustering duplicated records from the different integrated data sources. It refers to the process of selecting or fusing attribute values from the clustered duplicates into a single record representing the real world object. In this paper, a statistical technique for data fusion is introduced based on some probabilistic scores from both data sources and clustered duplicates.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

International Journal of Data Mining & Knowledge Management Process

自引率

0.00%

发文量