Using Mention Segmentation to Improve Event Detection with Multi-head Attention

2019 International Conference on Asian Language Processing (IALP) Pub Date : 2019-11-01 DOI:10.1109/IALP48816.2019.9037683

Jiali Chen, Yu Hong, Jingli Zhang, Jianmin Yao

{"title":"Using Mention Segmentation to Improve Event Detection with Multi-head Attention","authors":"Jiali Chen, Yu Hong, Jingli Zhang, Jianmin Yao","doi":"10.1109/IALP48816.2019.9037683","DOIUrl":null,"url":null,"abstract":"Sentence-level event detection (ED) is a task of detecting words that describe specific types of events, including the subtasks of trigger word identification and event type classification. Previous work straightforwardly inputs a sentence into neural classification models and analyzes deep semantics of words in the sentence one by one. Relying on the semantics, probabilities of event classes can be predicted for each word, including the carefully defined ACE event classes and a “N/A” class(i.e., non-trigger word). The models achieve remarkable successes nowadays. However, our findings show that a natural sentence may posses more than one trigger word and thus entail different types of events. In particular, the closely related information of each event only lies in a unique sentence segment but has nothing to do with other segments. In order to reduce negative influences from noises in other segments, we propose to perform semantics learning for event detection only in the scope of segment instead of the whole sentence. Accordingly, we develop a novel ED method which integrates sentence segmentation into the neural event classification architecture. Bidirectional Long Short-Term Memory (Bi-LSTM) with multi-head attention is used as the classification model. Sentence segmentation is boiled down to a sequence labeling problem, where BERT is used. We combine embeddings, and use them as the input of the neural classification model. The experimental results show that the performance of our method reaches 76.8% and 74.2% $F_{1}-$scores for trigger identification and event type classification, which outperforms the state-of-the-art.","PeriodicalId":208066,"journal":{"name":"2019 International Conference on Asian Language Processing (IALP)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 International Conference on Asian Language Processing (IALP)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IALP48816.2019.9037683","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Sentence-level event detection (ED) is a task of detecting words that describe specific types of events, including the subtasks of trigger word identification and event type classification. Previous work straightforwardly inputs a sentence into neural classification models and analyzes deep semantics of words in the sentence one by one. Relying on the semantics, probabilities of event classes can be predicted for each word, including the carefully defined ACE event classes and a “N/A” class(i.e., non-trigger word). The models achieve remarkable successes nowadays. However, our findings show that a natural sentence may posses more than one trigger word and thus entail different types of events. In particular, the closely related information of each event only lies in a unique sentence segment but has nothing to do with other segments. In order to reduce negative influences from noises in other segments, we propose to perform semantics learning for event detection only in the scope of segment instead of the whole sentence. Accordingly, we develop a novel ED method which integrates sentence segmentation into the neural event classification architecture. Bidirectional Long Short-Term Memory (Bi-LSTM) with multi-head attention is used as the classification model. Sentence segmentation is boiled down to a sequence labeling problem, where BERT is used. We combine embeddings, and use them as the input of the neural classification model. The experimental results show that the performance of our method reaches 76.8% and 74.2% $F_{1}-$scores for trigger identification and event type classification, which outperforms the state-of-the-art.

查看原文本刊更多论文

基于提及分割改进多头注意事件检测

句子级事件检测是一项检测描述特定事件类型的词的任务，包括触发词识别和事件类型分类的子任务。以前的工作是直接将一个句子输入到神经分类模型中，逐个分析句子中单词的深层语义。依靠语义，可以预测每个单词的事件类概率，包括仔细定义的ACE事件类和“N/ a”类(即。(非触发词)。这些模型如今取得了显著的成功。然而，我们的研究结果表明，一个自然的句子可能有多个触发词，从而导致不同类型的事件。特别是，每个事件密切相关的信息只存在于一个独特的句段中，与其他句段无关。为了减少其他语段中噪声的负面影响，我们建议仅在语段范围内而不是在整个句子范围内进行语义学习以进行事件检测。因此，我们开发了一种新的ED方法，该方法将句子分词与神经事件分类体系结构相结合。采用具有多头注意的双向长短期记忆作为分类模型。句子切分被归结为一个序列标记问题，其中使用了BERT。我们结合嵌入，并使用它们作为神经分类模型的输入。实验结果表明，该方法在触发识别和事件类型分类上的得分分别达到76.8%和74.2%。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2019 International Conference on Asian Language Processing (IALP)

自引率

0.00%

发文量