{"title":"Evolution of Fuzzy Grammars to aid Instance Matching","authors":"T. Martin, B. Azvine","doi":"10.1109/ISEFS.2006.251174","DOIUrl":null,"url":null,"abstract":"The need for information fusion exists in the semi-structured and unstructured domains - for example, to integrate responses from multiple sources into a unified response. This can be regarded as a two stage process - first to determine whether any two sources are considering the same real-world entities, and second, to ascertain how the attributes correspond (e.g. author/composer should correspond almost exactly to creator, business-location should correspond to address, etc). Within the unstructured and semi-structured attribute values there is frequently hidden structure -e.g. a free text attribute labeled as name might consist of title, first name and family name. Revealing this structure can greatly assist the matching process. In this paper, we outline a method for approximate matching of entities from different data sources and show how an evolutionary approach can create accurate approximate grammars to aid the information integration","PeriodicalId":269492,"journal":{"name":"2006 International Symposium on Evolving Fuzzy Systems","volume":"7 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2006-11-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2006 International Symposium on Evolving Fuzzy Systems","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISEFS.2006.251174","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 5
Abstract
The need for information fusion exists in the semi-structured and unstructured domains - for example, to integrate responses from multiple sources into a unified response. This can be regarded as a two stage process - first to determine whether any two sources are considering the same real-world entities, and second, to ascertain how the attributes correspond (e.g. author/composer should correspond almost exactly to creator, business-location should correspond to address, etc). Within the unstructured and semi-structured attribute values there is frequently hidden structure -e.g. a free text attribute labeled as name might consist of title, first name and family name. Revealing this structure can greatly assist the matching process. In this paper, we outline a method for approximate matching of entities from different data sources and show how an evolutionary approach can create accurate approximate grammars to aid the information integration