Improving negation detection with negation-focused pre-training

North American Chapter of the Association for Computational Linguistics Pub Date : 2022-05-09 DOI:10.48550/arXiv.2205.04012

Thinh Hung Truong, Timothy Baldwin, Trevor Cohn, K. Verspoor

引用次数: 9

Abstract

Negation is a common linguistic feature that is crucial in many language understanding tasks, yet it remains a hard problem due to diversity in its expression in different types of text. Recent works show that state-of-the-art NLP models underperform on samples containing negation in various tasks, and that negation detection models do not transfer well across domains. We propose a new negation-focused pre-training strategy, involving targeted data augmentation and negation masking, to better incorporate negation information into language models. Extensive experiments on common benchmarks show that our proposed approach improves negation detection performance and generalizability over the strong baseline NegBERT (Khandelwal and Sawant, 2020).

查看原文本刊更多论文

以否定为中心的预训练改进否定检测

否定是一种常见的语言特征，在许多语言理解任务中起着至关重要的作用，但由于在不同类型的文本中表达的多样性，它一直是一个难题。最近的研究表明，最先进的NLP模型在各种任务中包含否定的样本上表现不佳，并且否定检测模型不能很好地跨域转移。我们提出了一种新的以否定为中心的预训练策略，包括有针对性的数据增强和否定屏蔽，以更好地将否定信息纳入语言模型。在常见基准上进行的大量实验表明，我们提出的方法比强基线NegBERT提高了否定检测性能和泛化性(Khandelwal和Sawant, 2020)。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

North American Chapter of the Association for Computational Linguistics

自引率

0.00%

发文量