Information-Restricted Neural Language Models Reveal Different Brain Regions' Sensitivity to Semantics, Syntax and Context

IF 3.6 Q1 LINGUISTICS

Neurobiology of Language Pub Date : 2023-11-07 DOI:10.1162/nol_a_00125

Alexandre Pasquiou, Yair Lakretz, Bertrand Thirion, Christophe Pallier

{"title":"Information-Restricted Neural Language Models Reveal Different Brain Regions' Sensitivity to Semantics, Syntax and Context","authors":"Alexandre Pasquiou, Yair Lakretz, Bertrand Thirion, Christophe Pallier","doi":"10.1162/nol_a_00125","DOIUrl":null,"url":null,"abstract":"Abstract A fundamental question in neurolinguistics concerns the brain regions involved in syntactic and semantic processing during speech comprehension, both at the lexical (word processing) and supra-lexical levels (sentence and discourse processing). To what extent are these regions separated or intertwined? To address this question, we introduce a novel approach exploiting neural language models to generate high-dimensional feature sets that separately encode semantic and syntactic information. More precisely, we train a lexical language model, Glove, and a supra-lexical language model, GPT-2, on a text corpus from which we selectively removed either syntactic or semantic information. We then assess to what extent the features derived from these information-restricted models are still able to predict the fMRI time courses of humans listening to naturalistic text. Furthermore, to determine the windows of integration of brain regions involved in supra-lexical processing, we manipulate the size of contextual information provided to GPT-2. The analyses show that, while most brain regions involved in language comprehension are sensitive to both syntactic and semantic features, the relative magnitudes of these effects vary across these regions. Moreover, regions that are best fitted by semantic or syntactic features are more spatially dissociated in the left hemisphere than in the right one, and the right hemisphere shows sensitivity to longer contexts than the left. The novelty of our approach lies in the ability to control for the information encoded in the models’ embeddings by manipulating the training set. These “information-restricted” models complement previous studies that used language models to probe the neural bases of language, and shed new light on its spatial organization.","PeriodicalId":34845,"journal":{"name":"Neurobiology of Language","volume":"22 3‐4","pages":"0"},"PeriodicalIF":3.6000,"publicationDate":"2023-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Neurobiology of Language","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1162/nol_a_00125","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"LINGUISTICS","Score":null,"Total":0}

引用次数: 0

Abstract

Abstract A fundamental question in neurolinguistics concerns the brain regions involved in syntactic and semantic processing during speech comprehension, both at the lexical (word processing) and supra-lexical levels (sentence and discourse processing). To what extent are these regions separated or intertwined? To address this question, we introduce a novel approach exploiting neural language models to generate high-dimensional feature sets that separately encode semantic and syntactic information. More precisely, we train a lexical language model, Glove, and a supra-lexical language model, GPT-2, on a text corpus from which we selectively removed either syntactic or semantic information. We then assess to what extent the features derived from these information-restricted models are still able to predict the fMRI time courses of humans listening to naturalistic text. Furthermore, to determine the windows of integration of brain regions involved in supra-lexical processing, we manipulate the size of contextual information provided to GPT-2. The analyses show that, while most brain regions involved in language comprehension are sensitive to both syntactic and semantic features, the relative magnitudes of these effects vary across these regions. Moreover, regions that are best fitted by semantic or syntactic features are more spatially dissociated in the left hemisphere than in the right one, and the right hemisphere shows sensitivity to longer contexts than the left. The novelty of our approach lies in the ability to control for the information encoded in the models’ embeddings by manipulating the training set. These “information-restricted” models complement previous studies that used language models to probe the neural bases of language, and shed new light on its spatial organization.

查看原文本刊更多论文

信息受限的神经语言模型揭示了不同脑区对语义、句法和语境的敏感性

神经语言学的一个基本问题涉及语音理解过程中涉及句法和语义处理的大脑区域，包括词汇(词处理)和超词汇水平(句子和话语处理)。这些地区在多大程度上是分开或交织在一起的?为了解决这个问题，我们引入了一种利用神经语言模型来生成高维特征集的新方法，这些特征集分别编码语义和句法信息。更准确地说，我们在一个文本语料库上训练了一个词汇语言模型Glove和一个超词汇语言模型GPT-2，我们有选择性地从中删除语法或语义信息。然后，我们评估了从这些信息受限模型中得出的特征在多大程度上仍然能够预测人类听自然文本的fMRI时间过程。此外，为了确定参与超词汇处理的大脑区域的整合窗口，我们操纵提供给GPT-2的上下文信息的大小。分析表明，虽然大多数涉及语言理解的大脑区域对句法和语义特征都很敏感，但这些影响的相对程度在这些区域之间有所不同。此外，与右半球相比，最适合语义或句法特征的区域在左半球的空间分离程度更高，右半球比左半球对更长的上下文更敏感。我们方法的新颖之处在于能够通过操纵训练集来控制模型嵌入中编码的信息。这些“信息限制”模型补充了先前使用语言模型来探索语言的神经基础的研究，并为语言的空间组织提供了新的视角。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊