Rudra Pratap Deb Nath, Hanif Seddiqui, Masaki Aono
{"title":"Resolving scalability issue to ontology instance matching in Semantic Web","authors":"Rudra Pratap Deb Nath, Hanif Seddiqui, Masaki Aono","doi":"10.1109/ICCITECHN.2012.6509778","DOIUrl":null,"url":null,"abstract":"Ontology instance matching is a key interoperability enabler across heterogeneous data sources in the Semantic Web and a useful maneuver in some classical data integration tasks dealing with the semantic heterogeneous assignments. Though most of the research has been conducted on ontology schema level matching so far, with the introduction of Linked Open Data (LOD) and social networks, research on ontology matching is shifting from ontology schema or concept level to instance level. Since heterogeneous sources of massive ontology instances grow sharply day-by-day, scalability has become a major research issue in ontology instance matching of semantic knowledge bases. In this paper, we propose an efficient method by grouping instances of knowledge base into several sub-groups to address the scalability issue. Then, our instance matcher, which considers the semantic specification of properties associated to instances in the matching strategy, works by comparing an instance within a classification group of one knowledge base against the instances of same sub-group of other knowledge base to achieve interoperability. A novel approach for measuring the influence of properties in the matching process is also presented. The experiment and evaluation depicts satisfactory results in terms of effectiveness and scalability over baseline methods.","PeriodicalId":127060,"journal":{"name":"2012 15th International Conference on Computer and Information Technology (ICCIT)","volume":"18 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2012-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"7","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2012 15th International Conference on Computer and Information Technology (ICCIT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCITECHN.2012.6509778","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 7
Abstract
Ontology instance matching is a key interoperability enabler across heterogeneous data sources in the Semantic Web and a useful maneuver in some classical data integration tasks dealing with the semantic heterogeneous assignments. Though most of the research has been conducted on ontology schema level matching so far, with the introduction of Linked Open Data (LOD) and social networks, research on ontology matching is shifting from ontology schema or concept level to instance level. Since heterogeneous sources of massive ontology instances grow sharply day-by-day, scalability has become a major research issue in ontology instance matching of semantic knowledge bases. In this paper, we propose an efficient method by grouping instances of knowledge base into several sub-groups to address the scalability issue. Then, our instance matcher, which considers the semantic specification of properties associated to instances in the matching strategy, works by comparing an instance within a classification group of one knowledge base against the instances of same sub-group of other knowledge base to achieve interoperability. A novel approach for measuring the influence of properties in the matching process is also presented. The experiment and evaluation depicts satisfactory results in terms of effectiveness and scalability over baseline methods.
本体实例匹配是语义Web中跨异构数据源的关键互操作性实现手段,也是处理语义异构分配的经典数据集成任务的一种有效策略。虽然目前大部分的研究都是在本体模式级的匹配上进行的,但是随着链接开放数据(Linked Open Data, LOD)和社交网络的引入,对本体匹配的研究正从本体模式或概念级转向实例级。随着海量本体实例异构来源的急剧增长,可扩展性成为语义知识库本体实例匹配的主要研究问题。在本文中,我们提出了一种有效的方法,通过将知识库实例分组到几个子组来解决可扩展性问题。然后,我们的实例匹配器考虑匹配策略中与实例相关的属性的语义规范,通过将一个知识库的分类组中的实例与其他知识库的同一子组的实例进行比较来实现互操作性。提出了一种测量匹配过程中属性影响的新方法。实验和评估描述了在有效性和可扩展性方面优于基线方法的令人满意的结果。