基于异构图和动态注意网络的中文命名实体识别

Yuke Wang, Ling Lu, Wu Yang, Yinong Chen
{"title":"基于异构图和动态注意网络的中文命名实体识别","authors":"Yuke Wang, Ling Lu, Wu Yang, Yinong Chen","doi":"10.1109/ISADS56919.2023.10092180","DOIUrl":null,"url":null,"abstract":"Abstract Lexicon have been proved to enhance character representation to help Chinese named entity recognition (NER) model distinguish entities. Although lexicon information includes both semantic and boundary information of words, existing studies usually use only part of them and has low utilization of lexicon information. To efficiently extract dictionary features and integrate character representation, we propose a Heterogeneous Graph and Dynamic Attention Network (HGDAN), aiming at fusing contextual information and capturing dynamic associations between characters and words, thus improving the performance of Chinese NER. PGDNA uses the boundary information of the dictionary to construct a heterogeneous graph and uses the graph attention method to extract semantic information, as well as suppressing lexical noise through the gating unit. In addition, we found the traditional attention model has a non-zero attention problem that will distract the attention of the model, and proposed a simple and effective method to solve it. Experiments on the performance and inference speed of HGDAN on four Chinese datasets have proved its superiority.","PeriodicalId":412453,"journal":{"name":"2023 IEEE 15th International Symposium on Autonomous Decentralized System (ISADS)","volume":"60 7","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-03-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Chinese named entity recognition based on Heterogeneous Graph and Dynamic Attention Network\",\"authors\":\"Yuke Wang, Ling Lu, Wu Yang, Yinong Chen\",\"doi\":\"10.1109/ISADS56919.2023.10092180\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Abstract Lexicon have been proved to enhance character representation to help Chinese named entity recognition (NER) model distinguish entities. Although lexicon information includes both semantic and boundary information of words, existing studies usually use only part of them and has low utilization of lexicon information. To efficiently extract dictionary features and integrate character representation, we propose a Heterogeneous Graph and Dynamic Attention Network (HGDAN), aiming at fusing contextual information and capturing dynamic associations between characters and words, thus improving the performance of Chinese NER. PGDNA uses the boundary information of the dictionary to construct a heterogeneous graph and uses the graph attention method to extract semantic information, as well as suppressing lexical noise through the gating unit. In addition, we found the traditional attention model has a non-zero attention problem that will distract the attention of the model, and proposed a simple and effective method to solve it. Experiments on the performance and inference speed of HGDAN on four Chinese datasets have proved its superiority.\",\"PeriodicalId\":412453,\"journal\":{\"name\":\"2023 IEEE 15th International Symposium on Autonomous Decentralized System (ISADS)\",\"volume\":\"60 7\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-03-15\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2023 IEEE 15th International Symposium on Autonomous Decentralized System (ISADS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ISADS56919.2023.10092180\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 IEEE 15th International Symposium on Autonomous Decentralized System (ISADS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISADS56919.2023.10092180","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

摘要:词汇已被证明可以增强汉字的表征能力,以帮助中文命名实体识别(NER)模型识别实体。虽然词汇信息包括词的语义信息和边界信息,但现有的研究通常只使用其中的一部分,对词汇信息的利用率较低。为了有效地提取字典特征和整合字符表示,我们提出了一种异构图和动态注意网络(HGDAN),旨在融合上下文信息和捕获字符与单词之间的动态关联,从而提高汉语NER的性能。PGDNA利用字典的边界信息构建异构图,利用图注意方法提取语义信息,并通过门控单元抑制词汇噪声。此外,我们发现传统的注意力模型存在非零注意力问题,会分散模型的注意力,并提出了一种简单有效的方法来解决这个问题。在四个中文数据集上对HGDAN的性能和推理速度进行了实验,证明了它的优越性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Chinese named entity recognition based on Heterogeneous Graph and Dynamic Attention Network
Abstract Lexicon have been proved to enhance character representation to help Chinese named entity recognition (NER) model distinguish entities. Although lexicon information includes both semantic and boundary information of words, existing studies usually use only part of them and has low utilization of lexicon information. To efficiently extract dictionary features and integrate character representation, we propose a Heterogeneous Graph and Dynamic Attention Network (HGDAN), aiming at fusing contextual information and capturing dynamic associations between characters and words, thus improving the performance of Chinese NER. PGDNA uses the boundary information of the dictionary to construct a heterogeneous graph and uses the graph attention method to extract semantic information, as well as suppressing lexical noise through the gating unit. In addition, we found the traditional attention model has a non-zero attention problem that will distract the attention of the model, and proposed a simple and effective method to solve it. Experiments on the performance and inference speed of HGDAN on four Chinese datasets have proved its superiority.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信