Investigating a Hierarchical Inductive Bias in L2-textbook Seq2Seq Language Model

Korean Journal of Applied Linguistics Pub Date : 2023-09-30 DOI:10.17154/kjal.2023.9.39.3.57

Euhee Kim, Keonwoo Koo

引用次数: 0

Abstract

The relations between words in natural language are governed by hierarchical structures rather than linear ordering. In recent years, artificial neural network-based language models (LMs) have demonstrated impressive achievements in tasks related to sentence processing. These models benefit from pre-training, which helps enhance their performance. However, our comprehension of the precise syntactic knowledge acquired by these models during sentence processing remains somewhat restricted. This paper examines whether the L2-textbook Seq2Seq (Sequence-to-Sequence) language model processes or transforms sentences based on a syntactic hierarchical inductive bias or a linear inductive bias through transformation tasks. We replicate several previous experiments and explore our model’s capacity to exhibit human-like behavior. Our experiments provide evidence that, in transformation tasks, our pre-trained L2-textbook LSTM-based Seq2Seq model performed based on the linear rule rather than hierarchical rule. In essence, our model showcased a linear inductive bias, consistent with the Scratch-Seq2Seq model.

查看原文本刊更多论文

l2 -教科书Seq2Seq语言模型的层次归纳偏差研究

在自然语言中，词与词之间的关系是由层次结构而不是线性顺序支配的。近年来，基于人工神经网络的语言模型(LMs)在句子处理相关任务中取得了令人瞩目的成就。这些模型受益于预训练，这有助于提高它们的性能。然而，我们对这些模型在句子处理过程中获得的精确句法知识的理解仍然受到一定的限制。本文考察了l2 -教科书Seq2Seq (Sequence-to-Sequence)语言模型是基于句法层次的归纳偏见还是通过转换任务的线性归纳偏见来处理或转换句子。我们重复了之前的几个实验，并探索了我们的模型展示类似人类行为的能力。我们的实验证明，在转换任务中，我们预先训练的基于l2教科书lstm的Seq2Seq模型是基于线性规则而不是分层规则执行的。从本质上讲，我们的模型显示出线性感应偏置，与Scratch-Seq2Seq模型一致。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Korean Journal of Applied Linguistics

自引率

0.00%

发文量