Context Quality Matters in Training Fusion-in-Decoder for Extractive Open-Domain Question Answering

Conference on Empirical Methods in Natural Language Processing Pub Date : 2024-03-21 DOI:10.18653/v1/2023.findings-emnlp.784

Kosuke Akimoto, Kunihiro Takeoka, M. Oyamada

{"title":"Context Quality Matters in Training Fusion-in-Decoder for Extractive Open-Domain Question Answering","authors":"Kosuke Akimoto, Kunihiro Takeoka, M. Oyamada","doi":"10.18653/v1/2023.findings-emnlp.784","DOIUrl":null,"url":null,"abstract":"Retrieval-augmented generation models augment knowledge encoded in a language model by providing additional relevant external knowledge (context) during generation. Although it has been shown that the quantity and quality of context impact the performance of retrieval-augmented generation models during inference, limited research explores how these characteristics affect model training. This paper explores how context quantity and quality during model training affect the performance of Fusion-in-Decoder (FiD), the state-of-the-art retrieval-augmented generation model, in extractive open-domain question answering tasks. Experimental results suggest that FiD models overfit to context quality during training and show suboptimal performance when evaluated on different context quality. Through the experimental results, we also reveal FiD models trained with different context quality have different cross-attention distribution patterns. Specifically, as context quality during training increases, FiD models tend to attend more uniformly to each passage in context. Finally, based on these observations, we propose a method to mitigate overfitting to specific context quality by introducing bias to the cross-attention distribution, which we demonstrate to be effective in improving the performance of FiD models on different context quality.","PeriodicalId":505350,"journal":{"name":"Conference on Empirical Methods in Natural Language Processing","volume":" 55","pages":"11711-11729"},"PeriodicalIF":0.0000,"publicationDate":"2024-03-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Conference on Empirical Methods in Natural Language Processing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.18653/v1/2023.findings-emnlp.784","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Retrieval-augmented generation models augment knowledge encoded in a language model by providing additional relevant external knowledge (context) during generation. Although it has been shown that the quantity and quality of context impact the performance of retrieval-augmented generation models during inference, limited research explores how these characteristics affect model training. This paper explores how context quantity and quality during model training affect the performance of Fusion-in-Decoder (FiD), the state-of-the-art retrieval-augmented generation model, in extractive open-domain question answering tasks. Experimental results suggest that FiD models overfit to context quality during training and show suboptimal performance when evaluated on different context quality. Through the experimental results, we also reveal FiD models trained with different context quality have different cross-attention distribution patterns. Specifically, as context quality during training increases, FiD models tend to attend more uniformly to each passage in context. Finally, based on these observations, we propose a method to mitigate overfitting to specific context quality by introducing bias to the cross-attention distribution, which we demonstrate to be effective in improving the performance of FiD models on different context quality.

查看原文本刊更多论文

为提取式开放域问题解答训练融合解码器时语境质量的重要性

检索增强生成模型通过在生成过程中提供额外的相关外部知识（语境）来增强语言模型中的编码知识。虽然已有研究表明，语境的数量和质量会影响检索增强生成模型在推理过程中的性能，但探讨这些特征如何影响模型训练的研究却很有限。本文探讨了在抽取式开放域问题解答任务中，模型训练过程中上下文的数量和质量如何影响最先进的检索增强生成模型 Fusion-in-Decoder (FiD) 的性能。实验结果表明，FiD 模型在训练过程中过度适应了上下文质量，在对不同上下文质量进行评估时表现不佳。通过实验结果，我们还发现在不同语境质量下训练的 FiD 模型具有不同的交叉注意力分布模式。具体来说，随着训练过程中上下文质量的提高，FiD 模型倾向于更均匀地关注上下文中的每个段落。最后，基于这些观察结果，我们提出了一种方法，通过在交叉注意力分布中引入偏差来减轻对特定语境质量的过度拟合。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Conference on Empirical Methods in Natural Language Processing

自引率

0.00%

发文量