What's Surprising About Surprisal.

Computational brain & behavior Pub Date : 2025-01-01 Epub Date: 2025-02-21 DOI:10.1007/s42113-025-00237-9

Sophie Slaats, Andrea E Martin

{"title":"What's Surprising About Surprisal.","authors":"Sophie Slaats, Andrea E Martin","doi":"10.1007/s42113-025-00237-9","DOIUrl":null,"url":null,"abstract":"In the computational and experimental psycholinguistic literature, the mechanisms behind syntactic structure building (e.g., combining words into phrases and sentences) are the subject of considerable debate. Much experimental work has shown that surprisal is a good predictor of human behavioral and neural data. These findings have led some authors to model language comprehension in a purely probabilistic way. In this paper, we use simulation to exemplify why surprisal works so well to model human data and to illustrate why exclusive reliance on it can be problematic for the development of mechanistic theories of language comprehension, particularly those with emphasis on meaning composition. Rather than arguing for the importance of structural or probabilistic information to the exclusion or exhaustion of the other, we argue more emphasis should be placed on understanding how the brain leverages both types of information (viz., statistical and structured). We propose that probabilistic information is an important cue to the structure in the message, but is not a substitute for the structure itself-neither computationally, formally, nor conceptually. Surprisal and other probabilistic metrics must play a key role as theoretical objects in any explanatory mechanistic theory of language processing, but that role remains in the service of the brain's goal of constructing structured meaning from sensory input.Supplementary information: The online version contains supplementary material available at 10.1007/s42113-025-00237-9.","PeriodicalId":72660,"journal":{"name":"Computational brain & behavior","volume":"8 2","pages":"233-248"},"PeriodicalIF":0.0000,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12125142/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computational brain & behavior","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1007/s42113-025-00237-9","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/2/21 0:00:00","PubModel":"Epub","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

In the computational and experimental psycholinguistic literature, the mechanisms behind syntactic structure building (e.g., combining words into phrases and sentences) are the subject of considerable debate. Much experimental work has shown that surprisal is a good predictor of human behavioral and neural data. These findings have led some authors to model language comprehension in a purely probabilistic way. In this paper, we use simulation to exemplify why surprisal works so well to model human data and to illustrate why exclusive reliance on it can be problematic for the development of mechanistic theories of language comprehension, particularly those with emphasis on meaning composition. Rather than arguing for the importance of structural or probabilistic information to the exclusion or exhaustion of the other, we argue more emphasis should be placed on understanding how the brain leverages both types of information (viz., statistical and structured). We propose that probabilistic information is an important cue to the structure in the message, but is not a substitute for the structure itself-neither computationally, formally, nor conceptually. Surprisal and other probabilistic metrics must play a key role as theoretical objects in any explanatory mechanistic theory of language processing, but that role remains in the service of the brain's goal of constructing structured meaning from sensory input.

Supplementary information: The online version contains supplementary material available at 10.1007/s42113-025-00237-9.

Abstract Image

查看原文本刊更多论文

惊喜的惊喜之处。

在计算和实验心理语言学文献中，句法结构构建背后的机制（例如，将单词组合成短语和句子）是相当有争议的主题。许多实验工作表明，惊讶是人类行为和神经数据的一个很好的预测器。这些发现导致一些作者以纯粹的概率方式来模拟语言理解。在本文中，我们使用模拟来举例说明为什么惊讶可以很好地模拟人类数据，并说明为什么完全依赖于它可能会对语言理解的机械理论的发展产生问题，特别是那些强调意义构成的理论。与其争论结构或概率信息的重要性而排斥或耗尽其他信息，我们认为应该更加强调理解大脑如何利用这两种类型的信息（即统计和结构）。我们提出，概率信息是信息结构的重要线索，但不是结构本身的替代品——无论是计算上、形式上还是概念上。在任何语言处理的解释机制理论中，惊喜和其他概率度量必须作为理论对象发挥关键作用，但这种作用仍然服务于大脑从感觉输入构建结构化意义的目标。补充信息：在线版本包含补充资料，可在10.1007/s42113-025-00237-9获得。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Computational brain & behavior

CiteScore

4.30

自引率

0.00%

发文量