实体名称系统:开放和可扩展数据网络的骨干

P. Bouquet, Heiko Stoermer, C. Niederée, A. Maña
{"title":"实体名称系统:开放和可扩展数据网络的骨干","authors":"P. Bouquet, Heiko Stoermer, C. Niederée, A. Maña","doi":"10.1109/ICSC.2008.37","DOIUrl":null,"url":null,"abstract":"Recognizing that information from different sources refers to the same (real world) entity is a crucial challenge in instance-level information integration, as it is a pre-requisite for combining the information about one entity from different sources. The required entity matching is time consuming and thus imposes a crucial limit for large-scale, dynamic information integration. An increased re-use of entity identifiers (or names) across different information collections such as RDF repositories, databases and document collections, eases this situation.In the ideal case, entity matching can be reduced to the trivial problem of spotting the same entity identifier in different information collections. In this paper we propose the use of an entity name system (ENS) - as it is currently under development in the EU-funded project OKKAM - for systematically supporting the re-use of entity identifiers. The main purpose of the ENS is to provide unique and uniform names for entities for the use in information collections, so that the same name is used for an entity, even when it is referenced in different contexts. Of course the creation of an ENS that can efficiently deal with entities on the Web scale raises scalability issues of its own. This paper focuses on the role of an ENS in contributing to the scalability of ad-hoc and on demand information integration tasks.","PeriodicalId":102805,"journal":{"name":"2008 IEEE International Conference on Semantic Computing","volume":"30 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2008-08-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"88","resultStr":"{\"title\":\"Entity Name System: The Back-Bone of an Open and Scalable Web of Data\",\"authors\":\"P. Bouquet, Heiko Stoermer, C. Niederée, A. Maña\",\"doi\":\"10.1109/ICSC.2008.37\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Recognizing that information from different sources refers to the same (real world) entity is a crucial challenge in instance-level information integration, as it is a pre-requisite for combining the information about one entity from different sources. The required entity matching is time consuming and thus imposes a crucial limit for large-scale, dynamic information integration. An increased re-use of entity identifiers (or names) across different information collections such as RDF repositories, databases and document collections, eases this situation.In the ideal case, entity matching can be reduced to the trivial problem of spotting the same entity identifier in different information collections. In this paper we propose the use of an entity name system (ENS) - as it is currently under development in the EU-funded project OKKAM - for systematically supporting the re-use of entity identifiers. The main purpose of the ENS is to provide unique and uniform names for entities for the use in information collections, so that the same name is used for an entity, even when it is referenced in different contexts. Of course the creation of an ENS that can efficiently deal with entities on the Web scale raises scalability issues of its own. This paper focuses on the role of an ENS in contributing to the scalability of ad-hoc and on demand information integration tasks.\",\"PeriodicalId\":102805,\"journal\":{\"name\":\"2008 IEEE International Conference on Semantic Computing\",\"volume\":\"30 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2008-08-04\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"88\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2008 IEEE International Conference on Semantic Computing\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICSC.2008.37\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2008 IEEE International Conference on Semantic Computing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICSC.2008.37","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 88

摘要

在实例级信息集成中,认识到来自不同来源的信息引用相同的(现实世界)实体是一个关键挑战,因为这是组合来自不同来源的关于一个实体的信息的先决条件。所需的实体匹配非常耗时,因此对大规模动态信息集成造成了严重限制。跨不同信息集合(如RDF存储库、数据库和文档集合)增加实体标识符(或名称)的重用,可以缓解这种情况。在理想情况下,实体匹配可以简化为在不同信息集合中发现相同实体标识符的简单问题。在本文中,我们建议使用实体名称系统(ENS),因为它目前正在欧盟资助的OKKAM项目中开发,以系统地支持实体标识符的重用。ENS的主要目的是为信息集合中使用的实体提供唯一和统一的名称,以便即使在不同的上下文中引用实体,也可以使用相同的名称。当然,创建一个能够有效地处理Web规模上的实体的ENS本身也带来了可伸缩性问题。本文重点讨论了ENS在促进ad-hoc和随需应变信息集成任务的可扩展性方面的作用。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Entity Name System: The Back-Bone of an Open and Scalable Web of Data
Recognizing that information from different sources refers to the same (real world) entity is a crucial challenge in instance-level information integration, as it is a pre-requisite for combining the information about one entity from different sources. The required entity matching is time consuming and thus imposes a crucial limit for large-scale, dynamic information integration. An increased re-use of entity identifiers (or names) across different information collections such as RDF repositories, databases and document collections, eases this situation.In the ideal case, entity matching can be reduced to the trivial problem of spotting the same entity identifier in different information collections. In this paper we propose the use of an entity name system (ENS) - as it is currently under development in the EU-funded project OKKAM - for systematically supporting the re-use of entity identifiers. The main purpose of the ENS is to provide unique and uniform names for entities for the use in information collections, so that the same name is used for an entity, even when it is referenced in different contexts. Of course the creation of an ENS that can efficiently deal with entities on the Web scale raises scalability issues of its own. This paper focuses on the role of an ENS in contributing to the scalability of ad-hoc and on demand information integration tasks.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信