Anderson Carlos Ferreira Da Silva, Fatiha Saïs, E. Waller, F. Andrès
{"title":"基于差异的身份链接失效方法","authors":"Anderson Carlos Ferreira Da Silva, Fatiha Saïs, E. Waller, F. Andrès","doi":"10.1109/WETICE49692.2020.00056","DOIUrl":null,"url":null,"abstract":"More and more datasets are currently connected by identity links using properties such as owl:sameAs expressed in OWL. Identity links are statements that declare that two resources refer to the same real-world entity. However, we cannot attest the correctness of all identity links. Without a central name authority, most identity links are generated by heuristics and they are not reviewed by experts. The main issue in invalidating identity links is the heterogeneity of datasets, they commonly do not share the same predicates. Furthermore, the description of the resources can be incomplete. Despite how the resources are described, identity links are necessary to link data for posterior use. In this paper, we present a framework to invalidate identity links by dissimilarity and outlier detection in equivalence classes of identity links.","PeriodicalId":114214,"journal":{"name":"2020 IEEE 29th International Conference on Enabling Technologies: Infrastructure for Collaborative Enterprises (WETICE)","volume":"6 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Dissimilarity-based approach for Identity Link Invalidation\",\"authors\":\"Anderson Carlos Ferreira Da Silva, Fatiha Saïs, E. Waller, F. Andrès\",\"doi\":\"10.1109/WETICE49692.2020.00056\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"More and more datasets are currently connected by identity links using properties such as owl:sameAs expressed in OWL. Identity links are statements that declare that two resources refer to the same real-world entity. However, we cannot attest the correctness of all identity links. Without a central name authority, most identity links are generated by heuristics and they are not reviewed by experts. The main issue in invalidating identity links is the heterogeneity of datasets, they commonly do not share the same predicates. Furthermore, the description of the resources can be incomplete. Despite how the resources are described, identity links are necessary to link data for posterior use. In this paper, we present a framework to invalidate identity links by dissimilarity and outlier detection in equivalence classes of identity links.\",\"PeriodicalId\":114214,\"journal\":{\"name\":\"2020 IEEE 29th International Conference on Enabling Technologies: Infrastructure for Collaborative Enterprises (WETICE)\",\"volume\":\"6 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-09-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2020 IEEE 29th International Conference on Enabling Technologies: Infrastructure for Collaborative Enterprises (WETICE)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/WETICE49692.2020.00056\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 IEEE 29th International Conference on Enabling Technologies: Infrastructure for Collaborative Enterprises (WETICE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/WETICE49692.2020.00056","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Dissimilarity-based approach for Identity Link Invalidation
More and more datasets are currently connected by identity links using properties such as owl:sameAs expressed in OWL. Identity links are statements that declare that two resources refer to the same real-world entity. However, we cannot attest the correctness of all identity links. Without a central name authority, most identity links are generated by heuristics and they are not reviewed by experts. The main issue in invalidating identity links is the heterogeneity of datasets, they commonly do not share the same predicates. Furthermore, the description of the resources can be incomplete. Despite how the resources are described, identity links are necessary to link data for posterior use. In this paper, we present a framework to invalidate identity links by dissimilarity and outlier detection in equivalence classes of identity links.