Optimized Named Entity Recognition of Electric Power Field Based on Word-Struct BiGRU

Li Wensong, Hu Zhuqing, Zhang Jinhui, Liu Xuejing, Lin Feng, Yu Jun
{"title":"Optimized Named Entity Recognition of Electric Power Field Based on Word-Struct BiGRU","authors":"Li Wensong, Hu Zhuqing, Zhang Jinhui, Liu Xuejing, Lin Feng, Yu Jun","doi":"10.1109/iSPEC53008.2021.9735481","DOIUrl":null,"url":null,"abstract":"In view of the characteristics of small corpus size, nested entities and abbreviated entities in electric field, this paper studied the methods of entity recognition in professional field, and proposed a method of entity recognition in electric field based on text vector feature enhancement. Firstly, by using low-intensity and preset word segmentation, the semantic information contained in Chinese words can be used reasonably to avoid the influence of semantic separation between single Chinese character and word on text vectors; Then WS-BiGRU is designed to learn the structural features of words, and a new feature enhancement vector is formed by fusing word vectors, parts of speech and word lengths. Furthermore, the training of entity recognition model is completed by BiGRU-Attention-CRF. For electric corpus, the P value is 86.74% and the R value is 87.3%. The research results can be used in electric dispatching knowledge graph, metadata model, intelligent report query and other business scenarios.","PeriodicalId":417862,"journal":{"name":"2021 IEEE Sustainable Power and Energy Conference (iSPEC)","volume":"34 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-12-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE Sustainable Power and Energy Conference (iSPEC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/iSPEC53008.2021.9735481","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

In view of the characteristics of small corpus size, nested entities and abbreviated entities in electric field, this paper studied the methods of entity recognition in professional field, and proposed a method of entity recognition in electric field based on text vector feature enhancement. Firstly, by using low-intensity and preset word segmentation, the semantic information contained in Chinese words can be used reasonably to avoid the influence of semantic separation between single Chinese character and word on text vectors; Then WS-BiGRU is designed to learn the structural features of words, and a new feature enhancement vector is formed by fusing word vectors, parts of speech and word lengths. Furthermore, the training of entity recognition model is completed by BiGRU-Attention-CRF. For electric corpus, the P value is 86.74% and the R value is 87.3%. The research results can be used in electric dispatching knowledge graph, metadata model, intelligent report query and other business scenarios.
基于词结构BiGRU的电场命名实体识别优化
针对电场中语料库规模小、实体嵌套、实体缩略的特点,研究了专业领域的实体识别方法,提出了一种基于文本向量特征增强的电场中实体识别方法。首先,通过低强度和预设分词,合理利用汉语词所包含的语义信息,避免单个汉字和词之间的语义分离对文本向量的影响;然后设计WS-BiGRU学习单词的结构特征,通过融合单词向量、词性和单词长度形成新的特征增强向量。实体识别模型的训练由BiGRU-Attention-CRF完成。电语料库的P值为86.74%,R值为87.3%。研究成果可用于电力调度知识图谱、元数据模型、智能报表查询等业务场景。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信