Sentence processing by humans and machines: Large language models as a tool to better understand human reading.

IF 3 3区心理学 Q1 PSYCHOLOGY, EXPERIMENTAL

Psychonomic Bulletin & Review Pub Date : 2025-08-13 DOI:10.3758/s13423-025-02756-9

Nikki G Kaye, Peter C Gordon

{"title":"Sentence processing by humans and machines: Large language models as a tool to better understand human reading.","authors":"Nikki G Kaye, Peter C Gordon","doi":"10.3758/s13423-025-02756-9","DOIUrl":null,"url":null,"abstract":"<p><p>Online measures of reading have been studied with the goal of understanding how humans process language incrementally as they progress through a text. A focus of this research has been on pinpointing how the context of a word influences its processing. Quantitatively measuring the effects of context has proven difficult but with advances in artificial intelligence, large language models (LLMs) are more capable of generating humanlike language, drawing solely on information about the probabilistic relationships of units of language (e.g., words) occurring together. LLMs can be used to estimate the probability of any word in the model's vocabulary occurring as the next word in a given context. These next-word probabilities can be used in the calculation of information theoretic metrics, such as entropy and surprisal, which can be assessed as measures of word-by-word processing load. This is done by analyzing whether entropy and surprisal derived from language models predict variance in online measures of human reading comprehension (e.g., eye-movement, self-paced reading, ERP data). The present review synthesizes empirical findings on this topic and evaluates their methodological and theoretical implications.</p>","PeriodicalId":20763,"journal":{"name":"Psychonomic Bulletin & Review","volume":" ","pages":""},"PeriodicalIF":3.0000,"publicationDate":"2025-08-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Psychonomic Bulletin & Review","FirstCategoryId":"102","ListUrlMain":"https://doi.org/10.3758/s13423-025-02756-9","RegionNum":3,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"PSYCHOLOGY, EXPERIMENTAL","Score":null,"Total":0}

引用次数: 0

Abstract

Online measures of reading have been studied with the goal of understanding how humans process language incrementally as they progress through a text. A focus of this research has been on pinpointing how the context of a word influences its processing. Quantitatively measuring the effects of context has proven difficult but with advances in artificial intelligence, large language models (LLMs) are more capable of generating humanlike language, drawing solely on information about the probabilistic relationships of units of language (e.g., words) occurring together. LLMs can be used to estimate the probability of any word in the model's vocabulary occurring as the next word in a given context. These next-word probabilities can be used in the calculation of information theoretic metrics, such as entropy and surprisal, which can be assessed as measures of word-by-word processing load. This is done by analyzing whether entropy and surprisal derived from language models predict variance in online measures of human reading comprehension (e.g., eye-movement, self-paced reading, ERP data). The present review synthesizes empirical findings on this topic and evaluates their methodological and theoretical implications.

查看原文本刊更多论文

人类和机器的句子处理：大型语言模型作为更好地理解人类阅读的工具。

人们研究在线阅读的方法，目的是了解人类在阅读文本时是如何逐步处理语言的。这项研究的一个重点是确定一个词的上下文如何影响它的处理。定量测量上下文的影响已被证明是困难的，但随着人工智能的进步，大型语言模型（llm）更有能力产生类似人类的语言，仅利用语言单位（例如，单词）一起发生的概率关系的信息。llm可用于估计模型词汇表中任何单词作为给定上下文中的下一个单词出现的概率。这些下一个单词的概率可以用于计算信息理论度量，如熵和惊讶度，它们可以作为逐字处理负荷的度量来评估。这是通过分析从语言模型中获得的熵和惊奇值是否能预测人类阅读理解在线测量（例如，眼动、自定节奏阅读、ERP数据）的差异来完成的。本综述综合了这一主题的实证研究结果，并评估了其方法论和理论意义。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Psychonomic Bulletin & Review Multiple-

CiteScore

6.70

自引率

2.90%

发文量

165

期刊介绍： The journal provides coverage spanning a broad spectrum of topics in all areas of experimental psychology. The journal is primarily dedicated to the publication of theory and review articles and brief reports of outstanding experimental work. Areas of coverage include cognitive psychology broadly construed, including but not limited to action, perception, & attention, language, learning & memory, reasoning & decision making, and social cognition. We welcome submissions that approach these issues from a variety of perspectives such as behavioral measurements, comparative psychology, development, evolutionary psychology, genetics, neuroscience, and quantitative/computational modeling. We particularly encourage integrative research that crosses traditional content and methodological boundaries.