结合BERT模型和递归神经网络的中文事件检测

Zhang Wei, Wang Yongli
{"title":"结合BERT模型和递归神经网络的中文事件检测","authors":"Zhang Wei, Wang Yongli","doi":"10.1109/ICMCCE51767.2020.00356","DOIUrl":null,"url":null,"abstract":"With the rapid development of computer technology and the scale of the Internet, it is particularly important to extract useful information from the increasing amount of information on the Internet. Among them, event extraction is one of the research hotspots in the field of natural language processing and an important research direction in the field of information extraction. Event detection is the first step of the event extraction task, which plays a decisive role in the subsequent event extraction work. The article adopts BERT for word vector training, joint lexical vector, named entity vector, and semantic dependency vector as the input of Bi-LSTM, and then input them into the CRF sequence tagging layer after acquiring the features of sentences to achieve the recognition of event trigger words and the classification of event categories. The CEC corpus is selected as the training and test set, and experiments show that the method is effective in event detection, with F-values up to more than 70%.","PeriodicalId":6712,"journal":{"name":"2020 5th International Conference on Mechanical, Control and Computer Engineering (ICMCCE)","volume":"11 1","pages":"1625-1629"},"PeriodicalIF":0.0000,"publicationDate":"2020-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Chinese Event Detection Combining BERT Model with Recurrent Neural Networks\",\"authors\":\"Zhang Wei, Wang Yongli\",\"doi\":\"10.1109/ICMCCE51767.2020.00356\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"With the rapid development of computer technology and the scale of the Internet, it is particularly important to extract useful information from the increasing amount of information on the Internet. Among them, event extraction is one of the research hotspots in the field of natural language processing and an important research direction in the field of information extraction. Event detection is the first step of the event extraction task, which plays a decisive role in the subsequent event extraction work. The article adopts BERT for word vector training, joint lexical vector, named entity vector, and semantic dependency vector as the input of Bi-LSTM, and then input them into the CRF sequence tagging layer after acquiring the features of sentences to achieve the recognition of event trigger words and the classification of event categories. The CEC corpus is selected as the training and test set, and experiments show that the method is effective in event detection, with F-values up to more than 70%.\",\"PeriodicalId\":6712,\"journal\":{\"name\":\"2020 5th International Conference on Mechanical, Control and Computer Engineering (ICMCCE)\",\"volume\":\"11 1\",\"pages\":\"1625-1629\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-12-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2020 5th International Conference on Mechanical, Control and Computer Engineering (ICMCCE)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICMCCE51767.2020.00356\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 5th International Conference on Mechanical, Control and Computer Engineering (ICMCCE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICMCCE51767.2020.00356","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

摘要

随着计算机技术的飞速发展和互联网规模的扩大,如何从日益增多的互联网信息中提取有用的信息就显得尤为重要。其中,事件提取是自然语言处理领域的研究热点之一,也是信息提取领域的一个重要研究方向。事件检测是事件提取任务的第一步,对后续的事件提取工作起着决定性的作用。本文采用BERT进行词向量训练,联合词法向量、命名实体向量、语义依赖向量作为Bi-LSTM的输入,获取句子特征后输入到CRF序列标注层中,实现事件触发词的识别和事件类别的分类。选择CEC语料库作为训练和测试集,实验表明该方法在事件检测中是有效的,f值高达70%以上。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Chinese Event Detection Combining BERT Model with Recurrent Neural Networks
With the rapid development of computer technology and the scale of the Internet, it is particularly important to extract useful information from the increasing amount of information on the Internet. Among them, event extraction is one of the research hotspots in the field of natural language processing and an important research direction in the field of information extraction. Event detection is the first step of the event extraction task, which plays a decisive role in the subsequent event extraction work. The article adopts BERT for word vector training, joint lexical vector, named entity vector, and semantic dependency vector as the input of Bi-LSTM, and then input them into the CRF sequence tagging layer after acquiring the features of sentences to achieve the recognition of event trigger words and the classification of event categories. The CEC corpus is selected as the training and test set, and experiments show that the method is effective in event detection, with F-values up to more than 70%.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信