Embedding Hallucination for Few-shot Language Fine-tuning

North American Chapter of the Association for Computational Linguistics Pub Date : 2022-05-03 DOI:10.48550/arXiv.2205.01307

Yiren Jian, Chongyang Gao, Soroush Vosoughi

引用次数: 0

Abstract

Few-shot language learners adapt knowledge from a pre-trained model to recognize novel classes from a few-labeled sentences. In such settings, fine-tuning a pre-trained language model can cause severe over-fitting. In this paper, we propose an Embedding Hallucination (EmbedHalluc) method, which generates auxiliary embedding-label pairs to expand the fine-tuning dataset. The hallucinator is trained by playing an adversarial game with the discriminator, such that the hallucinated embedding is indiscriminative to the real ones in the fine-tuning dataset. By training with the extended dataset, the language learner effectively learns from the diverse hallucinated embeddings to overcome the over-fitting issue. Experiments demonstrate that our proposed method is effective in a wide range of language tasks, outperforming current fine-tuning methods. Further, we show that EmbedHalluc outperforms other methods that address this over-fitting problem, such as common data augmentation, semi-supervised pseudo-labeling, and regularization.

查看原文本刊更多论文

嵌入幻觉的几次语言微调

少量的语言学习者从预先训练的模型中调整知识，从少量标记的句子中识别新的类别。在这种情况下，对预先训练好的语言模型进行微调可能会导致严重的过拟合。在本文中，我们提出了一种嵌入幻觉(embeddhalluc)方法，该方法生成辅助的嵌入标签对来扩展微调数据集。通过与鉴别器进行对抗游戏来训练幻觉器，使得幻觉嵌入对微调数据集中的真实嵌入是不区分的。通过扩展数据集的训练，语言学习者可以有效地从不同的幻觉嵌入中学习，克服过拟合问题。实验表明，我们提出的方法在广泛的语言任务中是有效的，优于当前的微调方法。此外，我们表明，embeddhalluc优于其他解决这种过度拟合问题的方法，如公共数据增强、半监督伪标记和正则化。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

North American Chapter of the Association for Computational Linguistics

自引率

0.00%

发文量