Chinese Semantic Role Labeling Integrating Global Information and Attention Mechanism

Ao Zhu, Guoyi Che, F. Wan, Ning Ma
{"title":"Chinese Semantic Role Labeling Integrating Global Information and Attention Mechanism","authors":"Ao Zhu, Guoyi Che, F. Wan, Ning Ma","doi":"10.1109/ICPECA51329.2021.9362502","DOIUrl":null,"url":null,"abstract":"With the rapid development of artificial intelligence and Chinese information processing technology, natural language processing related research has gradually deepened to the level of semantic understanding, and Chinese semantic role labeling is the core technology in the field of semantic understanding. In the field of Chinese information processing where statistical machine learning is still the mainstream, traditional labeling methods rely heavily on the parsing degree of sentence syntax and semantics, so the labeling accuracy is limited and cannot meet current needs. In response to the above problems, this paper proposes a CNN-BiLSTM-Attention-CRF fusion model for Chinese semantic role tagging, and at the same time the model performance optimization research. In the model training stage, convolution kernels of different sizes are used to capture the local features of the sentence, and then the different local features are spliced into a new feature vector through the average pooling technology, which is the global feature that integrates the semantic information of the entire sentence sequence, the same part of speech, sentence phrase Multi-level linguistic feature groups such as structure are fed into the model together. Through multiple sets of experimental demonstrations, the feature set integrated into the sequence global information can significantly improve the various indicators of the model.","PeriodicalId":119798,"journal":{"name":"2021 IEEE International Conference on Power Electronics, Computer Applications (ICPECA)","volume":"21 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-01-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE International Conference on Power Electronics, Computer Applications (ICPECA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICPECA51329.2021.9362502","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

With the rapid development of artificial intelligence and Chinese information processing technology, natural language processing related research has gradually deepened to the level of semantic understanding, and Chinese semantic role labeling is the core technology in the field of semantic understanding. In the field of Chinese information processing where statistical machine learning is still the mainstream, traditional labeling methods rely heavily on the parsing degree of sentence syntax and semantics, so the labeling accuracy is limited and cannot meet current needs. In response to the above problems, this paper proposes a CNN-BiLSTM-Attention-CRF fusion model for Chinese semantic role tagging, and at the same time the model performance optimization research. In the model training stage, convolution kernels of different sizes are used to capture the local features of the sentence, and then the different local features are spliced into a new feature vector through the average pooling technology, which is the global feature that integrates the semantic information of the entire sentence sequence, the same part of speech, sentence phrase Multi-level linguistic feature groups such as structure are fed into the model together. Through multiple sets of experimental demonstrations, the feature set integrated into the sequence global information can significantly improve the various indicators of the model.
整合全局信息和注意机制的汉语语义角色标注
随着人工智能和中文信息处理技术的快速发展,自然语言处理相关研究逐渐深入到语义理解层面,而中文语义角色标注是语义理解领域的核心技术。在统计机器学习仍为主流的中文信息处理领域,传统标注方法严重依赖于句子句法和语义的解析程度,标注精度有限,无法满足当前需求。针对上述问题,本文提出了一种面向中文语义角色标注的CNN-BiLSTM-Attention-CRF融合模型,并对模型性能进行了优化研究。在模型训练阶段,利用不同大小的卷积核捕获句子的局部特征,然后通过平均池化技术将不同的局部特征拼接成一个新的特征向量,即综合了整个句子序列语义信息的全局特征,将同一词性、句子短语结构等多层次语言特征组一起馈入模型。通过多组实验论证,将特征集集成到序列全局信息中,可以显著提高模型的各项指标。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信