Named Entity Recognition Method with Word Position

Yanrui Du, Weixiang Zhao
{"title":"Named Entity Recognition Method with Word Position","authors":"Yanrui Du, Weixiang Zhao","doi":"10.1109/IWECAI50956.2020.00038","DOIUrl":null,"url":null,"abstract":"Named entity recognition (also known as entity recognition, entity segmentation and entity extraction) is a sub task of information extraction. It aims to locate and classify named entities in text into predefined categories, such as people, organization, location, time expression, etc. Compared with English, there are more unsolved problems in Chinese named entity recognition. Named entities in English have obvious formal signs, that is, the first letter of every word in entities should be capitalized, and entity boundary recognition is relatively easy. Compared with English, the task of Chinese named entity recognition is more complex, and the recognition of entity boundary is more difficult. In this paper, we propose a named entity method by adding the word position, which embeds the word position of each word into the word vector, in order to better recognize the boundary of Chinese named entity. The experimental results show that the F1 value of the named entity recognition method proposed in this paper increases by about 1%.","PeriodicalId":364789,"journal":{"name":"2020 International Workshop on Electronic Communication and Artificial Intelligence (IWECAI)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 International Workshop on Electronic Communication and Artificial Intelligence (IWECAI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IWECAI50956.2020.00038","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3

Abstract

Named entity recognition (also known as entity recognition, entity segmentation and entity extraction) is a sub task of information extraction. It aims to locate and classify named entities in text into predefined categories, such as people, organization, location, time expression, etc. Compared with English, there are more unsolved problems in Chinese named entity recognition. Named entities in English have obvious formal signs, that is, the first letter of every word in entities should be capitalized, and entity boundary recognition is relatively easy. Compared with English, the task of Chinese named entity recognition is more complex, and the recognition of entity boundary is more difficult. In this paper, we propose a named entity method by adding the word position, which embeds the word position of each word into the word vector, in order to better recognize the boundary of Chinese named entity. The experimental results show that the F1 value of the named entity recognition method proposed in this paper increases by about 1%.
具有词位置的命名实体识别方法
命名实体识别(又称实体识别、实体分割和实体抽取)是信息抽取的一个子任务。它旨在将文本中的命名实体定位并分类为预定义的类别,如人员、组织、位置、时间表达式等。与英语相比,中文命名实体识别中存在着更多尚未解决的问题。英文命名实体具有明显的形式符号,即实体中每个单词的首字母都要大写,实体边界识别相对容易。与英语相比,中文命名实体识别任务更为复杂,实体边界的识别难度更大。为了更好地识别中文命名实体的边界,本文提出了一种添加词位置的命名实体方法,该方法将每个词的词位置嵌入到词向量中。实验结果表明,本文提出的命名实体识别方法的F1值提高了约1%。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信