{"title":"On using historical update information for instance identification in federated databases","authors":"A. Si, Chi C. Ying, D. McLeod","doi":"10.1109/COOPIS.1996.554999","DOIUrl":null,"url":null,"abstract":"To support database interoperability in federated databases systems, it is critical to be able to identify (potentially) equivalent data instances from individual autonomous database components. Since the components in a federation are autonomous, their data may be updated asynchronously, viz., modifications to a real world entity may be captured in different databases at different times; the authors term this effect update heterogeneity. Existing approaches largely base data instance similarity identification only on current attribute/property values; in the face of update heterogeneity, this is inadequate. They present an approach to address the problem of update heterogeneity in the federated databases context. They employ a probabilistic model, which utilizes historical database update information to estimate the degree of similarity between candidate data instances from different database components. They employ transaction history (log) information to this end, which is typically already available in the component database systems. They have experimentally implemented and tested this approach within the context of a prototype experimental federated databases system, FeXpress.","PeriodicalId":314823,"journal":{"name":"Proceedings First IFCIS International Conference on Cooperative Information Systems","volume":"26 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1996-06-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"7","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings First IFCIS International Conference on Cooperative Information Systems","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/COOPIS.1996.554999","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 7
Abstract
To support database interoperability in federated databases systems, it is critical to be able to identify (potentially) equivalent data instances from individual autonomous database components. Since the components in a federation are autonomous, their data may be updated asynchronously, viz., modifications to a real world entity may be captured in different databases at different times; the authors term this effect update heterogeneity. Existing approaches largely base data instance similarity identification only on current attribute/property values; in the face of update heterogeneity, this is inadequate. They present an approach to address the problem of update heterogeneity in the federated databases context. They employ a probabilistic model, which utilizes historical database update information to estimate the degree of similarity between candidate data instances from different database components. They employ transaction history (log) information to this end, which is typically already available in the component database systems. They have experimentally implemented and tested this approach within the context of a prototype experimental federated databases system, FeXpress.