{"title":"Skewed distributions facilitate infants' word segmentation","authors":"Lucie Wolters , Mitsuhiko Ota , Inbal Arnon","doi":"10.1016/j.cognition.2025.106221","DOIUrl":null,"url":null,"abstract":"<div><div>Infants can use statistical patterns to segment continuous speech into words, a crucial task in language acquisition. Experimental studies typically investigate this ability using artificial languages with a uniform frequency distribution, where all words occur equally often. However, words in natural language follow a highly skewed distribution conforming to a Zipfian power law, in which few words occur frequently while many occur infrequently. Prior work shows that such skewed distributions facilitate word segmentation, but the experimental evidence for this has been limited to individuals aged ten years or older, leaving unclear whether this effect arises from accumulated linguistic experience or is already present in the early stages of language learning. To address this, we conducted a word segmentation study with 7- to 9-month-old infants. Infants were exposed to a continuous speech stream containing four artificial words, presented either in a uniform or skewed frequency distribution. We found that infants exposed to the skewed distribution showed a greater looking time difference between familiar and unfamiliar words compared to those in the uniform condition. These findings suggest that skewed distributions facilitate learning during early linguistic development, highlighting the impact of such distributions on language acquisition. Moreover, these findings suggest that the widespread use of uniform distributions in lab-based studies may underestimate infants' segmentation abilities.</div></div>","PeriodicalId":48455,"journal":{"name":"Cognition","volume":"263 ","pages":"Article 106221"},"PeriodicalIF":2.8000,"publicationDate":"2025-06-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Cognition","FirstCategoryId":"102","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0010027725001611","RegionNum":1,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"PSYCHOLOGY, EXPERIMENTAL","Score":null,"Total":0}
引用次数: 0
Abstract
Infants can use statistical patterns to segment continuous speech into words, a crucial task in language acquisition. Experimental studies typically investigate this ability using artificial languages with a uniform frequency distribution, where all words occur equally often. However, words in natural language follow a highly skewed distribution conforming to a Zipfian power law, in which few words occur frequently while many occur infrequently. Prior work shows that such skewed distributions facilitate word segmentation, but the experimental evidence for this has been limited to individuals aged ten years or older, leaving unclear whether this effect arises from accumulated linguistic experience or is already present in the early stages of language learning. To address this, we conducted a word segmentation study with 7- to 9-month-old infants. Infants were exposed to a continuous speech stream containing four artificial words, presented either in a uniform or skewed frequency distribution. We found that infants exposed to the skewed distribution showed a greater looking time difference between familiar and unfamiliar words compared to those in the uniform condition. These findings suggest that skewed distributions facilitate learning during early linguistic development, highlighting the impact of such distributions on language acquisition. Moreover, these findings suggest that the widespread use of uniform distributions in lab-based studies may underestimate infants' segmentation abilities.
期刊介绍:
Cognition is an international journal that publishes theoretical and experimental papers on the study of the mind. It covers a wide variety of subjects concerning all the different aspects of cognition, ranging from biological and experimental studies to formal analysis. Contributions from the fields of psychology, neuroscience, linguistics, computer science, mathematics, ethology and philosophy are welcome in this journal provided that they have some bearing on the functioning of the mind. In addition, the journal serves as a forum for discussion of social and political aspects of cognitive science.