Mojgan Askarizade, M. Nematbakhsh, Ensieh Davoodi Jam
{"title":"Web of Data中相同实体之间的数据冲突解决","authors":"Mojgan Askarizade, M. Nematbakhsh, Ensieh Davoodi Jam","doi":"10.1109/ICCKE.2012.6395392","DOIUrl":null,"url":null,"abstract":"With the growing amount of published RDF datasets on similar domains, data conflict between similar entities (same-as) is becoming a common problem for Web of Data applications. In this paper we propose an algorithm to detect conflict of same properties values of similar entities and select the most accurate value. The proposed algorithm contains two major steps. The first step filters out low ranked datasets using a link analysis technique. The second step calculates and evaluates the focus level of a dataset in a specific domain. Finally, the value of the top ranked dataset is considered. The proposed algorithm is implemented by Java Programming Language and is evaluated by geographical datasets containing \"country\" entities.","PeriodicalId":154379,"journal":{"name":"2012 2nd International eConference on Computer and Knowledge Engineering (ICCKE)","volume":"36 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2012-12-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":"{\"title\":\"Data conflict resolution among same entities in Web of Data\",\"authors\":\"Mojgan Askarizade, M. Nematbakhsh, Ensieh Davoodi Jam\",\"doi\":\"10.1109/ICCKE.2012.6395392\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"With the growing amount of published RDF datasets on similar domains, data conflict between similar entities (same-as) is becoming a common problem for Web of Data applications. In this paper we propose an algorithm to detect conflict of same properties values of similar entities and select the most accurate value. The proposed algorithm contains two major steps. The first step filters out low ranked datasets using a link analysis technique. The second step calculates and evaluates the focus level of a dataset in a specific domain. Finally, the value of the top ranked dataset is considered. The proposed algorithm is implemented by Java Programming Language and is evaluated by geographical datasets containing \\\"country\\\" entities.\",\"PeriodicalId\":154379,\"journal\":{\"name\":\"2012 2nd International eConference on Computer and Knowledge Engineering (ICCKE)\",\"volume\":\"36 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2012-12-31\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2012 2nd International eConference on Computer and Knowledge Engineering (ICCKE)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICCKE.2012.6395392\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2012 2nd International eConference on Computer and Knowledge Engineering (ICCKE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCKE.2012.6395392","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3
摘要
随着相似领域上发布的RDF数据集数量的增长,相似实体(相同实体)之间的数据冲突正在成为Web of data应用程序的一个常见问题。本文提出了一种检测相似实体中相同属性值冲突并选择最准确值的算法。该算法包含两个主要步骤。第一步使用链接分析技术过滤掉低排名的数据集。第二步计算和评估特定域中数据集的焦点级别。最后,考虑排名靠前的数据集的值。该算法由Java编程语言实现,并通过包含“国家”实体的地理数据集进行评估。
Data conflict resolution among same entities in Web of Data
With the growing amount of published RDF datasets on similar domains, data conflict between similar entities (same-as) is becoming a common problem for Web of Data applications. In this paper we propose an algorithm to detect conflict of same properties values of similar entities and select the most accurate value. The proposed algorithm contains two major steps. The first step filters out low ranked datasets using a link analysis technique. The second step calculates and evaluates the focus level of a dataset in a specific domain. Finally, the value of the top ranked dataset is considered. The proposed algorithm is implemented by Java Programming Language and is evaluated by geographical datasets containing "country" entities.