Abhilasha A. Kumar , Nancy B. Lundin , Michael N. Jones
{"title":"What’s in my cluster? Evaluating automated clustering methods to understand idiosyncratic search behavior in verbal fluency","authors":"Abhilasha A. Kumar , Nancy B. Lundin , Michael N. Jones","doi":"10.1016/j.jml.2024.104606","DOIUrl":null,"url":null,"abstract":"<div><div>Individuals routinely search through memory for concepts. This behavior is commonly studied via the verbal fluency task (VFT), where participants are typically asked to generate as many exemplars as they can from a given category (e.g., animals) or letter label (e.g., F) within a fixed amount of time. Responses in the VFT tend to be clustered in meaningful ways but individuals widely differ in the manner in which they cluster items. Despite the development of several (hand-coded and automated) methods of defining clusters and switches in the VFT, there is currently no consensus on which scoring method provides the best mechanistic account of how <em>individuals</em> search through memory in the VFT. In this work, we provide an empirical evaluation of several automated methods for defining clusters and switches in the VFT by comparing model-predicted clusters with participant-designated clusters. We find that a method that combines gradual rises and drops in a weighted composite of semantic <em>and</em> phonological similarity best predicts participant-designated cluster-switch events across three domains (<em>animals</em>, <em>foods</em>, and <em>occupations</em>). Furthermore, we propose a novel approach to understand idiosyncratic search behavior by computing a measure of discordance for each pairwise transition based on a large dataset of cluster-switch designations from independent raters (<em>N</em> = 211) for the same transitions via a pre-registered experiment. We find that transitions with high idiosyncratic scores have low lexical content (i.e., semantic and phonological similarity), and an individual’s score on one domain is predictive of their score on another domain, suggesting that idiosyncratic scores may be capturing meaningful information about non-lexical sources and processes that contribute to memory search at the individual level.</div></div>","PeriodicalId":16493,"journal":{"name":"Journal of memory and language","volume":"141 ","pages":"Article 104606"},"PeriodicalIF":2.9000,"publicationDate":"2025-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of memory and language","FirstCategoryId":"102","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0749596X24001098","RegionNum":1,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"LINGUISTICS","Score":null,"Total":0}
引用次数: 0
Abstract
Individuals routinely search through memory for concepts. This behavior is commonly studied via the verbal fluency task (VFT), where participants are typically asked to generate as many exemplars as they can from a given category (e.g., animals) or letter label (e.g., F) within a fixed amount of time. Responses in the VFT tend to be clustered in meaningful ways but individuals widely differ in the manner in which they cluster items. Despite the development of several (hand-coded and automated) methods of defining clusters and switches in the VFT, there is currently no consensus on which scoring method provides the best mechanistic account of how individuals search through memory in the VFT. In this work, we provide an empirical evaluation of several automated methods for defining clusters and switches in the VFT by comparing model-predicted clusters with participant-designated clusters. We find that a method that combines gradual rises and drops in a weighted composite of semantic and phonological similarity best predicts participant-designated cluster-switch events across three domains (animals, foods, and occupations). Furthermore, we propose a novel approach to understand idiosyncratic search behavior by computing a measure of discordance for each pairwise transition based on a large dataset of cluster-switch designations from independent raters (N = 211) for the same transitions via a pre-registered experiment. We find that transitions with high idiosyncratic scores have low lexical content (i.e., semantic and phonological similarity), and an individual’s score on one domain is predictive of their score on another domain, suggesting that idiosyncratic scores may be capturing meaningful information about non-lexical sources and processes that contribute to memory search at the individual level.
期刊介绍:
Articles in the Journal of Memory and Language contribute to the formulation of scientific issues and theories in the areas of memory, language comprehension and production, and cognitive processes. Special emphasis is given to research articles that provide new theoretical insights based on a carefully laid empirical foundation. The journal generally favors articles that provide multiple experiments. In addition, significant theoretical papers without new experimental findings may be published.
The Journal of Memory and Language is a valuable tool for cognitive scientists, including psychologists, linguists, and others interested in memory and learning, language, reading, and speech.
Research Areas include:
• Topics that illuminate aspects of memory or language processing
• Linguistics
• Neuropsychology.