Evolution of Fuzzy Grammars to aid Instance Matching

2006 International Symposium on Evolving Fuzzy Systems Pub Date : 2006-11-30 DOI:10.1109/ISEFS.2006.251174

T. Martin, B. Azvine

引用次数: 5

Abstract

The need for information fusion exists in the semi-structured and unstructured domains - for example, to integrate responses from multiple sources into a unified response. This can be regarded as a two stage process - first to determine whether any two sources are considering the same real-world entities, and second, to ascertain how the attributes correspond (e.g. author/composer should correspond almost exactly to creator, business-location should correspond to address, etc). Within the unstructured and semi-structured attribute values there is frequently hidden structure -e.g. a free text attribute labeled as name might consist of title, first name and family name. Revealing this structure can greatly assist the matching process. In this paper, we outline a method for approximate matching of entities from different data sources and show how an evolutionary approach can create accurate approximate grammars to aid the information integration

查看原文本刊更多论文

模糊语法的演化以辅助实例匹配

信息融合的需求存在于半结构化和非结构化领域中——例如，将来自多个源的响应集成到一个统一的响应中。这可以看作是一个两阶段的过程——首先确定是否有任何两个来源考虑相同的现实世界实体，其次确定属性如何对应(例如，作者/作曲家应该几乎完全对应于创作者，业务位置应该对应于地址，等等)。在非结构化和半结构化的属性值中，经常有隐藏的结构——例如，一个标签为name的自由文本属性可能由标题、名字和姓氏组成。揭示这种结构可以极大地帮助匹配过程。在本文中，我们概述了一种来自不同数据源的实体的近似匹配方法，并展示了一种进化方法如何创建准确的近似语法来帮助信息集成

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2006 International Symposium on Evolving Fuzzy Systems

自引率

0.00%

发文量