TGCEL:一种基于主题关系图的中文实体链接方法

Yi Chen, Yusong Tan, Q. Wu, Wei Wang
{"title":"TGCEL:一种基于主题关系图的中文实体链接方法","authors":"Yi Chen, Yusong Tan, Q. Wu, Wei Wang","doi":"10.1109/ICCSNT.2017.8343692","DOIUrl":null,"url":null,"abstract":"Entity linking has an important basic research value for Natural Language Processing, the task of which is to link different entity mentions in the given text with their referent entities in a knowledge base. And it is widely used in such fields as expanding knowledge base, Q&A system, machine translation. We propose a Chinese collective entity linking algorithm based on the extracted topic features. We construct the topic relation graph of ambiguous entities in the same text, extract the topic characteristics from the multiple topic models, calculate the topic relevance, and select the topic subgraph with maximum score to reason and realize the batch linking. We experiment with both the news test corpus and the microblog test corpus, compare the performance of the adopted topic model, and analyze their applicable scene. When compared with the traditional algorithm, the maximum performance of our algorithm is improved by about 9% in microblog corpus and over 15% in news corpus, which indicates that our algorithm is potentially effective.","PeriodicalId":163433,"journal":{"name":"2017 6th International Conference on Computer Science and Network Technology (ICCSNT)","volume":"47 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"TGCEL: A Chinese entity linking method based on topic relation graph\",\"authors\":\"Yi Chen, Yusong Tan, Q. Wu, Wei Wang\",\"doi\":\"10.1109/ICCSNT.2017.8343692\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Entity linking has an important basic research value for Natural Language Processing, the task of which is to link different entity mentions in the given text with their referent entities in a knowledge base. And it is widely used in such fields as expanding knowledge base, Q&A system, machine translation. We propose a Chinese collective entity linking algorithm based on the extracted topic features. We construct the topic relation graph of ambiguous entities in the same text, extract the topic characteristics from the multiple topic models, calculate the topic relevance, and select the topic subgraph with maximum score to reason and realize the batch linking. We experiment with both the news test corpus and the microblog test corpus, compare the performance of the adopted topic model, and analyze their applicable scene. When compared with the traditional algorithm, the maximum performance of our algorithm is improved by about 9% in microblog corpus and over 15% in news corpus, which indicates that our algorithm is potentially effective.\",\"PeriodicalId\":163433,\"journal\":{\"name\":\"2017 6th International Conference on Computer Science and Network Technology (ICCSNT)\",\"volume\":\"47 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2017-10-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2017 6th International Conference on Computer Science and Network Technology (ICCSNT)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICCSNT.2017.8343692\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 6th International Conference on Computer Science and Network Technology (ICCSNT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCSNT.2017.8343692","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

摘要

实体链接在自然语言处理中具有重要的基础研究价值,它的任务是将给定文本中提到的不同实体与其知识库中的参考实体联系起来。广泛应用于知识库扩展、问答系统、机器翻译等领域。我们提出了一种基于抽取主题特征的中文集体实体链接算法。我们构建同一文本中歧义实体的主题关系图,从多个主题模型中提取主题特征,计算主题相关性,选择得分最高的主题子图进行推理,实现批量链接。我们对新闻测试语料库和微博测试语料库进行了实验,比较了所采用的主题模型的性能,并分析了它们的适用场景。与传统算法相比,本文算法在微博语料库上的最大性能提高了9%左右,在新闻语料库上的最大性能提高了15%以上,表明本文算法具有潜在的有效性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
TGCEL: A Chinese entity linking method based on topic relation graph
Entity linking has an important basic research value for Natural Language Processing, the task of which is to link different entity mentions in the given text with their referent entities in a knowledge base. And it is widely used in such fields as expanding knowledge base, Q&A system, machine translation. We propose a Chinese collective entity linking algorithm based on the extracted topic features. We construct the topic relation graph of ambiguous entities in the same text, extract the topic characteristics from the multiple topic models, calculate the topic relevance, and select the topic subgraph with maximum score to reason and realize the batch linking. We experiment with both the news test corpus and the microblog test corpus, compare the performance of the adopted topic model, and analyze their applicable scene. When compared with the traditional algorithm, the maximum performance of our algorithm is improved by about 9% in microblog corpus and over 15% in news corpus, which indicates that our algorithm is potentially effective.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信