论LLM surprisal作为N400和P600功能解释的局限性。

IF 2.6 4区医学 Q3 NEUROSCIENCES

Brain Research Pub Date : 2025-08-07 DOI:10.1016/j.brainres.2025.149841

Benedict Krieger , Harm Brouwer , Christoph Aurnhammer , Matthew W. Crocker

{"title":"论LLM surprisal作为N400和P600功能解释的局限性。","authors":"Benedict Krieger , Harm Brouwer , Christoph Aurnhammer , Matthew W. Crocker","doi":"10.1016/j.brainres.2025.149841","DOIUrl":null,"url":null,"abstract":"<div><div>Expectations about upcoming words play a central role in language comprehension, with expected words being processed more easily than less expected ones. Surprisal theory formalizes this relationship by positing that cognitive effort is proportional to a word’s negative log-probability in context, as determined by distributional, linguistic, and world knowledge constraints. The emergence of large language models (LLMs) demonstrating the capacity to compute richly contextualized surprisal estimates, has motivated their consideration as models of comprehension. We assess here the relationship of LLM surprisal with two key neural correlates of comprehension – the N400 and the P600 – which differ in sensitivity to semantic association and contextual expectancy. While prior work has focused on the N400, we propose that the P600 may offer a better index of surprisal, as it is unaffected by association while still patterning continuously with expectancy. Using regression-based ERPs (rERPs), we examine data from three German factorial studies to evaluate the extent to which LLM surprisal can account for ERP differences. Our results show that LLM surprisal captures neither component consistently. We find that it is contaminated by simple association, particularly in smaller LLMs. As a result, LLM surprisal can partially account for association-driven N400 effects, but not for the full attenuation of N400 effects. Correspondingly, this property of LLMs compromises their ability to model the P600, which is sensitive to expectancy but not to association.</div></div>","PeriodicalId":9083,"journal":{"name":"Brain Research","volume":"1865 ","pages":"Article 149841"},"PeriodicalIF":2.6000,"publicationDate":"2025-08-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"On the limits of LLM surprisal as a functional explanation of the N400 and P600\",\"authors\":\"Benedict Krieger , Harm Brouwer , Christoph Aurnhammer , Matthew W. Crocker\",\"doi\":\"10.1016/j.brainres.2025.149841\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Expectations about upcoming words play a central role in language comprehension, with expected words being processed more easily than less expected ones. Surprisal theory formalizes this relationship by positing that cognitive effort is proportional to a word’s negative log-probability in context, as determined by distributional, linguistic, and world knowledge constraints. The emergence of large language models (LLMs) demonstrating the capacity to compute richly contextualized surprisal estimates, has motivated their consideration as models of comprehension. We assess here the relationship of LLM surprisal with two key neural correlates of comprehension – the N400 and the P600 – which differ in sensitivity to semantic association and contextual expectancy. While prior work has focused on the N400, we propose that the P600 may offer a better index of surprisal, as it is unaffected by association while still patterning continuously with expectancy. Using regression-based ERPs (rERPs), we examine data from three German factorial studies to evaluate the extent to which LLM surprisal can account for ERP differences. Our results show that LLM surprisal captures neither component consistently. We find that it is contaminated by simple association, particularly in smaller LLMs. As a result, LLM surprisal can partially account for association-driven N400 effects, but not for the full attenuation of N400 effects. Correspondingly, this property of LLMs compromises their ability to model the P600, which is sensitive to expectancy but not to association.</div></div>\",\"PeriodicalId\":9083,\"journal\":{\"name\":\"Brain Research\",\"volume\":\"1865 \",\"pages\":\"Article 149841\"},\"PeriodicalIF\":2.6000,\"publicationDate\":\"2025-08-07\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Brain Research\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0006899325004020\",\"RegionNum\":4,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"NEUROSCIENCES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Brain Research","FirstCategoryId":"3","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0006899325004020","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"NEUROSCIENCES","Score":null,"Total":0}

引用次数: 0

摘要

对即将出现的单词的预期在语言理解中起着核心作用，预期的单词比不预期的单词更容易被处理。惊喜理论通过假设认知努力与单词在语境中的负对数概率成正比，将这种关系形式化，这是由分布、语言和世界知识约束决定的。大型语言模型（llm）的出现证明了计算丰富情境化的意外估计的能力，这促使他们考虑作为理解模型。我们在这里评估了LLM惊讶与理解的两个关键神经相关- N400和P600的关系，它们对语义关联和上下文期望的敏感性不同。虽然之前的工作主要集中在N400上，但我们认为P600可能会提供更好的惊喜指数，因为它不受联想的影响，同时仍然持续地与期望形成模式。使用基于回归的ERP (rERPs)，我们检查了三个德国析因研究的数据，以评估法学硕士惊讶度在多大程度上可以解释ERP差异。我们的结果表明，LLM surprisal一致地捕获了这两个组件。我们发现它受到简单关联的污染，特别是在较小的llm中。因此，LLM surprisal可以部分解释关联驱动的N400效应，但不能解释N400效应的完全衰减。相应地，法学硕士的这一特性损害了他们对P600建模的能力，P600对期望敏感，但对关联不敏感。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

On the limits of LLM surprisal as a functional explanation of the N400 and P600

查看原文本刊更多论文

On the limits of LLM surprisal as a functional explanation of the N400 and P600

Expectations about upcoming words play a central role in language comprehension, with expected words being processed more easily than less expected ones. Surprisal theory formalizes this relationship by positing that cognitive effort is proportional to a word’s negative log-probability in context, as determined by distributional, linguistic, and world knowledge constraints. The emergence of large language models (LLMs) demonstrating the capacity to compute richly contextualized surprisal estimates, has motivated their consideration as models of comprehension. We assess here the relationship of LLM surprisal with two key neural correlates of comprehension – the N400 and the P600 – which differ in sensitivity to semantic association and contextual expectancy. While prior work has focused on the N400, we propose that the P600 may offer a better index of surprisal, as it is unaffected by association while still patterning continuously with expectancy. Using regression-based ERPs (rERPs), we examine data from three German factorial studies to evaluate the extent to which LLM surprisal can account for ERP differences. Our results show that LLM surprisal captures neither component consistently. We find that it is contaminated by simple association, particularly in smaller LLMs. As a result, LLM surprisal can partially account for association-driven N400 effects, but not for the full attenuation of N400 effects. Correspondingly, this property of LLMs compromises their ability to model the P600, which is sensitive to expectancy but not to association.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Brain Research 医学-神经科学

CiteScore

5.90

自引率

3.40%

发文量

268

审稿时长

47 days

期刊介绍： An international multidisciplinary journal devoted to fundamental research in the brain sciences. Brain Research publishes papers reporting interdisciplinary investigations of nervous system structure and function that are of general interest to the international community of neuroscientists. As is evident from the journals name, its scope is broad, ranging from cellular and molecular studies through systems neuroscience, cognition and disease. Invited reviews are also published; suggestions for and inquiries about potential reviews are welcomed. With the appearance of the final issue of the 2011 subscription, Vol. 67/1-2 (24 June 2011), Brain Research Reviews has ceased publication as a distinct journal separate from Brain Research. Review articles accepted for Brain Research are now published in that journal.