Chunk-based incremental processing and learning: An integrated theory of word discovery, implicit statistical learning, and speed of lexical processing.

IF 5.8 1区心理学 Q1 PSYCHOLOGY

Psychological review Pub Date : 2025-05-12 DOI:10.1037/rev0000564

Andrew Jessop, Julian Pine, Fernand Gobet

{"title":"Chunk-based incremental processing and learning: An integrated theory of word discovery, implicit statistical learning, and speed of lexical processing.","authors":"Andrew Jessop, Julian Pine, Fernand Gobet","doi":"10.1037/rev0000564","DOIUrl":null,"url":null,"abstract":"<p><p>According to chunking theories, children discover their first words by extracting subsequences embedded in their continuous input. However, the mechanisms proposed in these accounts are often incompatible with data from other areas of language development. We present a new theory to connect the chunking accounts of word discovery with the broader developmental literature. We argue that (a) children build a diverse collection of chunks, including words, multiword phrases, and sublexical units; (b) these chunks have different processing times determined by how often each chunk is used to recode the input; and (c) these processing times interact with short-term memory limitations and incremental processing to constrain learning. We implemented this theory as a computational modeling architecture called Chunk-Based Incremental Processing and Learning (CIPAL). Across nine studies, we demonstrate that CIPAL can model word discovery in different contexts. First, we trained the model with 70 child-directed speech corpora from 15 languages. CIPAL gradually discovered words in each language, with cross-linguistic variation in performance. The model's average processing time also improved with experience, resembling the developmental changes observed in children's speed of processing. Second, we showed that CIPAL could simulate seven influential effects reported in statistical learning experiments with artificial languages. This included a preference for words over nonwords, part words, frequency-matched part words, phantom words, and sublexical units. On this basis, we argue that incremental chunking is an effective implicit statistical learning mechanism that may be central to children's vocabulary development. (PsycInfo Database Record (c) 2025 APA, all rights reserved).</p>","PeriodicalId":21016,"journal":{"name":"Psychological review","volume":" ","pages":""},"PeriodicalIF":5.8000,"publicationDate":"2025-05-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Psychological review","FirstCategoryId":"102","ListUrlMain":"https://doi.org/10.1037/rev0000564","RegionNum":1,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"PSYCHOLOGY","Score":null,"Total":0}

引用次数: 0

Abstract

According to chunking theories, children discover their first words by extracting subsequences embedded in their continuous input. However, the mechanisms proposed in these accounts are often incompatible with data from other areas of language development. We present a new theory to connect the chunking accounts of word discovery with the broader developmental literature. We argue that (a) children build a diverse collection of chunks, including words, multiword phrases, and sublexical units; (b) these chunks have different processing times determined by how often each chunk is used to recode the input; and (c) these processing times interact with short-term memory limitations and incremental processing to constrain learning. We implemented this theory as a computational modeling architecture called Chunk-Based Incremental Processing and Learning (CIPAL). Across nine studies, we demonstrate that CIPAL can model word discovery in different contexts. First, we trained the model with 70 child-directed speech corpora from 15 languages. CIPAL gradually discovered words in each language, with cross-linguistic variation in performance. The model's average processing time also improved with experience, resembling the developmental changes observed in children's speed of processing. Second, we showed that CIPAL could simulate seven influential effects reported in statistical learning experiments with artificial languages. This included a preference for words over nonwords, part words, frequency-matched part words, phantom words, and sublexical units. On this basis, we argue that incremental chunking is an effective implicit statistical learning mechanism that may be central to children's vocabulary development. (PsycInfo Database Record (c) 2025 APA, all rights reserved).

查看原文本刊更多论文

基于块的增量加工和学习：单词发现、内隐统计学习和词汇加工速度的综合理论。

根据分块理论，孩子们通过提取嵌入在连续输入中的子序列来发现他们的第一个单词。然而，这些描述中提出的机制往往与其他语言发展领域的数据不相容。我们提出了一种新的理论，将单词发现的分块描述与更广泛的发展文献联系起来。我们认为(a)儿童建立了不同的块集合，包括单词、多词短语和亚词汇单位；(b)这些数据块具有不同的处理时间，这取决于每个数据块用于重新编码输入的频率；(c)这些处理时间与短期记忆限制和增量处理相互作用，从而限制学习。我们将这一理论实现为一种称为基于块的增量处理和学习（CIPAL）的计算建模体系结构。通过九项研究，我们证明CIPAL可以在不同的语境中模拟单词发现。首先，我们使用来自15种语言的70个儿童导向语音语料库来训练模型。CIPAL逐渐在每种语言中发现单词，这些单词在表现上存在跨语言差异。模型的平均处理时间也随着经验的增加而提高，类似于观察到的儿童处理速度的发展变化。其次，我们发现CIPAL可以模拟人工语言统计学习实验中报道的七种影响效应。这包括对单词的偏好，而不是非单词、部分单词、频率匹配的部分单词、幻影单词和亚词汇单位。在此基础上，我们认为增量分块是一种有效的内隐统计学习机制，可能是儿童词汇发展的核心。（PsycInfo Database Record (c) 2025 APA，版权所有）。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Psychological review 医学-心理学

CiteScore

9.70

自引率

5.60%

发文量

期刊介绍： Psychological Review publishes articles that make important theoretical contributions to any area of scientific psychology, including systematic evaluation of alternative theories.