{"title":"LwF4IEE: An Incremental Learning Method for Interactive Event Extraction","authors":"Jiashun Duan, Xin Zhang, Chi Xu","doi":"10.1109/CyberC55534.2022.00026","DOIUrl":null,"url":null,"abstract":"Albeit great progress has been witnessed in event extraction, the accuracies achieved up to now by various automatic models still can not meet the performance requirements of some special applications such as disaster monitoring and rescue. It motivates us to introduce a new human-in-loop extraction mode called interactive event extraction (IEE), which works iteratively. Each iteration consists of three main steps: \"model recommending candidate results → manual selecting and correcting → model re-training and updating\". For candidate recommendation, we build an MRC (Machine Reading Comprehension)-based model that can output several most likely candidate elements, i.e., candidate triggers and arguments, by confidence evaluation. For model re-training and updating, we proposed an incremental learning method named LwF4IEE (Learning without Forgetting for IEE), which employs manual selected and corrected results as hard label and prediction of original model as soft label to avoid catastrophic forgetting. We conduct extensive experiments on datasets constructed from real-world Chinese texts. The results show that when setting the number of candidates to be 5, recalls of triggers and arguments reach 93.80% and 90.58% respectively, which is 11.51% and 11.33% higher compared with the basic MRC-based automatic extraction model. Moreover, LwF4IEE increases the recall of triggers by 2.71% on specific event types and only decreases by 0.24% on other types, achieving the purpose of learning without forgetting.","PeriodicalId":234632,"journal":{"name":"2022 International Conference on Cyber-Enabled Distributed Computing and Knowledge Discovery (CyberC)","volume":"14 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 International Conference on Cyber-Enabled Distributed Computing and Knowledge Discovery (CyberC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CyberC55534.2022.00026","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Albeit great progress has been witnessed in event extraction, the accuracies achieved up to now by various automatic models still can not meet the performance requirements of some special applications such as disaster monitoring and rescue. It motivates us to introduce a new human-in-loop extraction mode called interactive event extraction (IEE), which works iteratively. Each iteration consists of three main steps: "model recommending candidate results → manual selecting and correcting → model re-training and updating". For candidate recommendation, we build an MRC (Machine Reading Comprehension)-based model that can output several most likely candidate elements, i.e., candidate triggers and arguments, by confidence evaluation. For model re-training and updating, we proposed an incremental learning method named LwF4IEE (Learning without Forgetting for IEE), which employs manual selected and corrected results as hard label and prediction of original model as soft label to avoid catastrophic forgetting. We conduct extensive experiments on datasets constructed from real-world Chinese texts. The results show that when setting the number of candidates to be 5, recalls of triggers and arguments reach 93.80% and 90.58% respectively, which is 11.51% and 11.33% higher compared with the basic MRC-based automatic extraction model. Moreover, LwF4IEE increases the recall of triggers by 2.71% on specific event types and only decreases by 0.24% on other types, achieving the purpose of learning without forgetting.
尽管在事件提取方面取得了很大的进步,但目前各种自动模型所达到的精度仍不能满足一些特殊应用的性能要求,如灾害监测和救援。这促使我们引入一种新的人在循环提取模式,称为交互式事件提取(IEE),它是迭代工作的。每次迭代包括三个主要步骤:“模型推荐候选结果→人工选择和修正→模型再训练和更新”。对于候选推荐,我们构建了一个基于MRC(机器阅读理解)的模型,该模型可以通过置信度评估输出几个最可能的候选元素,即候选触发器和参数。对于模型的再训练和更新,我们提出了一种名为LwF4IEE (learning without Forgetting For IEE)的增量学习方法,该方法将人工选择和修正的结果作为硬标签,将原始模型的预测作为软标签,以避免灾难性遗忘。我们对真实世界的中文文本构建的数据集进行了广泛的实验。结果表明,当候选者数量为5时,触发器和参数的召回率分别达到93.80%和90.58%,比基于mrc的基本自动提取模型分别提高了11.51%和11.33%。LwF4IEE在特定事件类型上使触发器的回忆率提高了2.71%,在其他事件类型上仅降低了0.24%,达到了学习而不遗忘的目的。