Using Semantic Linking to Understand Persons’ Networks Extracted from Text

Alessio Palmero Aprosio, Sara Tonelli, S. Menini, Giovanni Moretti
{"title":"Using Semantic Linking to Understand Persons’ Networks Extracted from Text","authors":"Alessio Palmero Aprosio, Sara Tonelli, S. Menini, Giovanni Moretti","doi":"10.3389/fdigh.2017.00022","DOIUrl":null,"url":null,"abstract":"In this work, we describe a methodology to interpret large persons' networks extracted from text by classifying cliques using the DBpedia ontology. The approach relies on a combination of NLP, Semantic web technologies and network analysis. The classification methodology that first starts from single nodes and then generalises to cliques is effective in terms of performance and is able to deal also with nodes that are not linked to Wikipedia. The gold standard manually developed for evaluation shows that groups of co-occurring entities share in most of the cases a category that can be automatically assigned. This holds for both languages considered in this study. The outcome of this work may be of interest to enhance the readability of large networks and to provide an additional semantic layer on top of cliques. This would greatly help humanities scholars when dealing with large amounts of textual data that need to be interpreted or categorised. Furthermore, it represents an unsupervised approach to automatically extend DBpedia starting from a corpus.","PeriodicalId":227954,"journal":{"name":"Frontiers Digit. Humanit.","volume":"49 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-11-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Frontiers Digit. Humanit.","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3389/fdigh.2017.00022","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

Abstract

In this work, we describe a methodology to interpret large persons' networks extracted from text by classifying cliques using the DBpedia ontology. The approach relies on a combination of NLP, Semantic web technologies and network analysis. The classification methodology that first starts from single nodes and then generalises to cliques is effective in terms of performance and is able to deal also with nodes that are not linked to Wikipedia. The gold standard manually developed for evaluation shows that groups of co-occurring entities share in most of the cases a category that can be automatically assigned. This holds for both languages considered in this study. The outcome of this work may be of interest to enhance the readability of large networks and to provide an additional semantic layer on top of cliques. This would greatly help humanities scholars when dealing with large amounts of textual data that need to be interpreted or categorised. Furthermore, it represents an unsupervised approach to automatically extend DBpedia starting from a corpus.
利用语义连接理解文本中提取的人物网络
在这项工作中,我们描述了一种方法,通过使用DBpedia本体对派系进行分类,来解释从文本中提取的大型人员网络。该方法依赖于自然语言处理、语义网技术和网络分析的结合。首先从单个节点开始,然后推广到小组的分类方法在性能方面是有效的,并且能够处理没有链接到维基百科的节点。手动开发用于评估的黄金标准表明,在大多数情况下,共同出现的实体组共享一个可以自动分配的类别。这项研究中考虑的两种语言都是如此。这项工作的结果可能会增强大型网络的可读性,并在派系之上提供额外的语义层。这将极大地帮助人文学者在处理需要解释或分类的大量文本数据时。此外,它代表了一种从语料库开始自动扩展DBpedia的无监督方法。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信