Graph based tweet entity linking using DBpedia

Fahd Kalloubi, E. Nfaoui, O. Beqqali
{"title":"Graph based tweet entity linking using DBpedia","authors":"Fahd Kalloubi, E. Nfaoui, O. Beqqali","doi":"10.1109/AICCSA.2014.7073240","DOIUrl":null,"url":null,"abstract":"Twitter has became an invaluable source of information, due to his dynamic nature with more than 400 million tweets posted per day. Determining what an individual post is about can be a non trivial task because his high contextualization and his informal nature. Named Entity Linking (NEL) is a subtask of information extraction that aims to ground entity mentions to their corresponding node in a Knowledge Base (KB), which requires a disambiguation step, because many resources can be matched to the same entity that lead to synonymy and polysemy problems. To overcome these problems, especially in the context of short text, we present a novel system for tweet entity linking based on graph centrality and DBpedia as knowledge base. Our approach relies on the assumption that related entities tend to appear in the same tweet as tweets are topic specific. Also, we address the problem of irregular name mentions. Finally, to show the effectiveness of our system we evaluate it using a real twitter dataset and compare it to a well known state-of-the-art named entity linking system for short text.","PeriodicalId":412749,"journal":{"name":"2014 IEEE/ACS 11th International Conference on Computer Systems and Applications (AICCSA)","volume":"50 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2014-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"11","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2014 IEEE/ACS 11th International Conference on Computer Systems and Applications (AICCSA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/AICCSA.2014.7073240","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 11

Abstract

Twitter has became an invaluable source of information, due to his dynamic nature with more than 400 million tweets posted per day. Determining what an individual post is about can be a non trivial task because his high contextualization and his informal nature. Named Entity Linking (NEL) is a subtask of information extraction that aims to ground entity mentions to their corresponding node in a Knowledge Base (KB), which requires a disambiguation step, because many resources can be matched to the same entity that lead to synonymy and polysemy problems. To overcome these problems, especially in the context of short text, we present a novel system for tweet entity linking based on graph centrality and DBpedia as knowledge base. Our approach relies on the assumption that related entities tend to appear in the same tweet as tweets are topic specific. Also, we address the problem of irregular name mentions. Finally, to show the effectiveness of our system we evaluate it using a real twitter dataset and compare it to a well known state-of-the-art named entity linking system for short text.
使用DBpedia进行基于图的tweet实体链接
推特已经成为一个宝贵的信息来源,由于他的动态性质,每天发布超过4亿条推文。确定一个单独的帖子是关于什么的可能是一个不平凡的任务,因为他的高度语境化和他的非正式性质。命名实体链接(NEL)是信息提取的一个子任务,其目的是将实体提及与知识库(KB)中的相应节点联系起来,这需要消歧义步骤,因为许多资源可能与同一实体匹配,从而导致同义词和多义问题。为了克服这些问题,特别是在短文本环境下,我们提出了一种基于图中心性和DBpedia作为知识库的推文实体链接系统。我们的方法依赖于一个假设,即相关实体往往出现在同一条tweet中,因为tweet是特定于主题的。此外,我们还解决了不规则名字提及的问题。最后,为了展示我们系统的有效性,我们使用真实的twitter数据集对其进行评估,并将其与一个众所周知的最先进的短文本命名实体链接系统进行比较。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信