BiVaSE: A bilingual variational sentence encoder with randomly initialized Transformer layers

IF 0.4 3区文学 0 LANGUAGE & LINGUISTICS

Acta Linguistica Academica Pub Date : 2022-12-12 DOI:10.1556/2062.2022.00584

Bence Nyéki

引用次数: 0

Abstract

Transformer-based NLP models have achieved state-of-the-art results in many NLP tasks including text classification and text generation. However, the layers of these models do not output any explicit representations for texts units larger than tokens (e.g. sentences), although such representations are required to perform text classification. Sentence encodings are usually obtained by applying a pooling technique during fine-tuning on a specific task. In this paper, a new sentence encoder is introduced. Relying on an autoencoder architecture, it was trained to learn sentence representations from the very beginning of its training. The model was trained on bilingual data with variational Bayesian inference. Sentence representations were evaluated in downstream and linguistic probing tasks. Although the newly introduced encoder generally performs worse than well-known Transformer-based encoders, the experiments show that it was able to learn to incorporate linguistic information in the sentence representations.

查看原文本刊更多论文

BiVaSE:一种具有随机初始化Transformer层的双语变分句编码器

基于Transformer的NLP模型在许多NLP任务中取得了最先进的结果，包括文本分类和文本生成。然而，这些模型的层不输出大于标记（例如句子）的文本单元的任何显式表示，尽管执行文本分类需要这样的表示。句子编码通常是通过在对特定任务进行微调时应用池技术来获得的。本文介绍了一种新的句子编码器。依靠自动编码器架构，它从一开始就被训练来学习句子表示。该模型使用变分贝叶斯推理在双语数据上进行训练。在下游和语言探究任务中评估句子表征。尽管新引入的编码器通常比众所周知的基于Transformer的编码器性能更差，但实验表明，它能够学会将语言信息融入句子表示中。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Acta Linguistica Academica Arts and Humanities-Literature and Literary Theory

CiteScore

1.00

自引率

20.00%

发文量

期刊介绍： Acta Linguistica Academica publishes papers on general linguistics. Papers presenting empirical material must have strong theoretical implications. The scope of the journal is not restricted to the core areas of linguistics; it also covers areas such as socio- and psycholinguistics, neurolinguistics, discourse analysis, the philosophy of language, language typology, and formal semantics. The journal also publishes book and dissertation reviews and advertisements.