Leveraging Context for Perceptual Prediction Using Word Embeddings

IF 2.4 2区心理学 Q2 PSYCHOLOGY, EXPERIMENTAL

Cognitive Science Pub Date : 2025-06-07 DOI:10.1111/cogs.70072

Georgia-Ann Carter, Frank Keller, Paul Hoffman

{"title":"Leveraging Context for Perceptual Prediction Using Word Embeddings","authors":"Georgia-Ann Carter, Frank Keller, Paul Hoffman","doi":"10.1111/cogs.70072","DOIUrl":null,"url":null,"abstract":"<p>Word embeddings derived from large language corpora have been successfully used in cognitive science and artificial intelligence to represent linguistic meaning. However, there is continued debate as to how well they encode useful information about the perceptual qualities of concepts. This debate is critical to identifying the scope of embodiment in human semantics. If perceptual object properties can be inferred from word embeddings derived from language alone, this suggests that language provides a useful adjunct to direct perceptual experience for acquiring this kind of conceptual knowledge. Previous research has shown mixed performance when embeddings are used to predict perceptual qualities. Here, we tested if we could improve performance by leveraging the ability of Transformer-based language models to represent word meaning in context. To this end, we conducted two experiments. Our first experiment investigated noun representations. We generated decontextualized (“charcoal”) and contextualized (“the brightness of charcoal”) Word2Vec and BERT embeddings for a large set of concepts and compared their ability to predict human ratings of the concepts’ brightness. We repeated this procedure to also probe for the shape of those concepts. In general, we found very good prediction performance for shape, and a more modest performance for brightness. The addition of context did not improve perceptual prediction performance. In Experiment 2, we investigated representations of adjective–noun phrases. Perceptual prediction performance was generally found to be good, with the nonadditive nature of adjective brightness reflected in the word embeddings. We also found that the addition of context had a limited impact on how well perceptual features could be predicted. We frame these results against current work on the interpretability of language models and debates surrounding embodiment in human conceptual processing.</p>","PeriodicalId":48349,"journal":{"name":"Cognitive Science","volume":"49 6","pages":""},"PeriodicalIF":2.4000,"publicationDate":"2025-06-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1111/cogs.70072","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Cognitive Science","FirstCategoryId":"102","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1111/cogs.70072","RegionNum":2,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"PSYCHOLOGY, EXPERIMENTAL","Score":null,"Total":0}

引用次数: 0

Abstract

Word embeddings derived from large language corpora have been successfully used in cognitive science and artificial intelligence to represent linguistic meaning. However, there is continued debate as to how well they encode useful information about the perceptual qualities of concepts. This debate is critical to identifying the scope of embodiment in human semantics. If perceptual object properties can be inferred from word embeddings derived from language alone, this suggests that language provides a useful adjunct to direct perceptual experience for acquiring this kind of conceptual knowledge. Previous research has shown mixed performance when embeddings are used to predict perceptual qualities. Here, we tested if we could improve performance by leveraging the ability of Transformer-based language models to represent word meaning in context. To this end, we conducted two experiments. Our first experiment investigated noun representations. We generated decontextualized (“charcoal”) and contextualized (“the brightness of charcoal”) Word2Vec and BERT embeddings for a large set of concepts and compared their ability to predict human ratings of the concepts’ brightness. We repeated this procedure to also probe for the shape of those concepts. In general, we found very good prediction performance for shape, and a more modest performance for brightness. The addition of context did not improve perceptual prediction performance. In Experiment 2, we investigated representations of adjective–noun phrases. Perceptual prediction performance was generally found to be good, with the nonadditive nature of adjective brightness reflected in the word embeddings. We also found that the addition of context had a limited impact on how well perceptual features could be predicted. We frame these results against current work on the interpretability of language models and debates surrounding embodiment in human conceptual processing.

Abstract Image

查看原文本刊更多论文

利用词嵌入利用上下文进行感知预测

来源于大型语料库的词嵌入已经成功地应用于认知科学和人工智能中来表示语言意义。然而，关于它们如何很好地编码关于概念感知质量的有用信息，仍然存在争议。这一争论对于确定人类语义学中体现的范围至关重要。如果感知对象的属性可以从单独来源于语言的词嵌入中推断出来，这表明语言为获取这种概念性知识提供了直接感知经验的有用辅助。先前的研究表明，当嵌入用于预测感知品质时，效果好坏参半。在这里，我们测试了是否可以通过利用基于transformer的语言模型在上下文中表示单词含义的能力来提高性能。为此，我们进行了两次实验。我们的第一个实验研究名词表征。我们为一大批概念生成了去语境化（“木炭”）和情境化（“木炭的亮度”）的Word2Vec和BERT嵌入，并比较了它们预测人类对概念亮度评级的能力。我们重复这个过程来探索这些概念的形状。总的来说，我们发现形状的预测性能非常好，而亮度的预测性能则比较适中。语境的加入并没有提高感知预测的性能。在实验2中，我们研究了形容词-名词短语的表征。感知预测性能普遍较好，形容词亮度的非加性体现在词嵌入中。我们还发现，背景的添加对预测感知特征的影响有限。我们将这些结果与当前关于语言模型可解释性的工作和围绕人类概念处理中体现的争论相比较。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Cognitive Science PSYCHOLOGY, EXPERIMENTAL-

CiteScore

4.10

自引率

8.00%

发文量

139

期刊介绍： Cognitive Science publishes articles in all areas of cognitive science, covering such topics as knowledge representation, inference, memory processes, learning, problem solving, planning, perception, natural language understanding, connectionism, brain theory, motor control, intentional systems, and other areas of interdisciplinary concern. Highest priority is given to research reports that are specifically written for a multidisciplinary audience. The audience is primarily researchers in cognitive science and its associated fields, including anthropologists, education researchers, psychologists, philosophers, linguists, computer scientists, neuroscientists, and roboticists.