Partial-input baselines show that NLI models can ignore context, but they don’t.

North American Chapter of the Association for Computational Linguistics Pub Date : 2022-05-24 DOI:10.48550/arXiv.2205.12181

Neha Srikanth, Rachel Rudinger

引用次数: 2

Abstract

When strong partial-input baselines reveal artifacts in crowdsourced NLI datasets, the performance of full-input models trained on such datasets is often dismissed as reliance on spurious correlations. We investigate whether state-of-the-art NLI models are capable of overriding default inferences made by a partial-input baseline. We introduce an evaluation set of 600 examples consisting of perturbed premises to examine a RoBERTa model’s sensitivity to edited contexts. Our results indicate that NLI models are still capable of learning to condition on context—a necessary component of inferential reasoning—despite being trained on artifact-ridden datasets.

查看原文本刊更多论文

部分输入基线显示NLI模型可以忽略上下文，但事实并非如此。

当强部分输入基线揭示了众包NLI数据集中的工件时，在这些数据集上训练的全输入模型的性能通常被视为依赖于虚假相关性而被忽略。我们研究了最先进的NLI模型是否能够覆盖由部分输入基线做出的默认推断。我们引入了600个由扰动前提组成的示例的评估集，以检查RoBERTa模型对编辑上下文的敏感性。我们的结果表明，NLI模型仍然能够学习情境条件——推理推理的必要组成部分——尽管是在人工数据集上训练的。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

North American Chapter of the Association for Computational Linguistics

自引率

0.00%

发文量