没有基础的大型语言模型可以恢复人类概念的非感觉运动特征，但不能恢复感觉运动特征

IF 15.9 1区心理学 Q1 MULTIDISCIPLINARY SCIENCES

Nature Human Behaviour Pub Date : 2025-06-04 DOI:10.1038/s41562-025-02203-8

Qihui Xu, Yingying Peng, Samuel A. Nastase, Martin Chodorow, Minghua Wu, Ping Li

{"title":"没有基础的大型语言模型可以恢复人类概念的非感觉运动特征，但不能恢复感觉运动特征","authors":"Qihui Xu, Yingying Peng, Samuel A. Nastase, Martin Chodorow, Minghua Wu, Ping Li","doi":"10.1038/s41562-025-02203-8","DOIUrl":null,"url":null,"abstract":"To what extent can language give rise to complex conceptual representation? Is multisensory experience essential? Recent large language models (LLMs) challenge the necessity of grounding for concept formation: whether LLMs without grounding nevertheless exhibit human-like representations. Here we compare multidimensional representations of ~4,442 lexical concepts between humans (the Glasgow Norms1, N = 829; and the Lancaster Norms2, N = 3,500) and state-of-the-art LLMs with and without visual learning, across non-sensorimotor, sensory and motor domains. We found that (1) the similarity between model and human representations decreases from non-sensorimotor to sensory domains and is minimal in motor domains, indicating a systematic divergence, and (2) models with visual learning exhibit enhanced similarity with human representations in visual-related dimensions. These results highlight the potential limitations of language in isolation for LLMs and that the integration of diverse modalities can potentially enhance alignment with human conceptual representation. Xu et al. find that large language models not only align with human representations in non-sensorimotor domains but also diverge in sensorimotor ones, with additional visual training associated with enhanced alignment.","PeriodicalId":19074,"journal":{"name":"Nature Human Behaviour","volume":"9 9","pages":"1871-1886"},"PeriodicalIF":15.9000,"publicationDate":"2025-06-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.nature.comhttps://www.nature.com/articles/s41562-025-02203-8.pdf","citationCount":"0","resultStr":"{\"title\":\"Large language models without grounding recover non-sensorimotor but not sensorimotor features of human concepts\",\"authors\":\"Qihui Xu, Yingying Peng, Samuel A. Nastase, Martin Chodorow, Minghua Wu, Ping Li\",\"doi\":\"10.1038/s41562-025-02203-8\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"To what extent can language give rise to complex conceptual representation? Is multisensory experience essential? Recent large language models (LLMs) challenge the necessity of grounding for concept formation: whether LLMs without grounding nevertheless exhibit human-like representations. Here we compare multidimensional representations of ~4,442 lexical concepts between humans (the Glasgow Norms1, N = 829; and the Lancaster Norms2, N = 3,500) and state-of-the-art LLMs with and without visual learning, across non-sensorimotor, sensory and motor domains. We found that (1) the similarity between model and human representations decreases from non-sensorimotor to sensory domains and is minimal in motor domains, indicating a systematic divergence, and (2) models with visual learning exhibit enhanced similarity with human representations in visual-related dimensions. These results highlight the potential limitations of language in isolation for LLMs and that the integration of diverse modalities can potentially enhance alignment with human conceptual representation. Xu et al. find that large language models not only align with human representations in non-sensorimotor domains but also diverge in sensorimotor ones, with additional visual training associated with enhanced alignment.\",\"PeriodicalId\":19074,\"journal\":{\"name\":\"Nature Human Behaviour\",\"volume\":\"9 9\",\"pages\":\"1871-1886\"},\"PeriodicalIF\":15.9000,\"publicationDate\":\"2025-06-04\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.nature.comhttps://www.nature.com/articles/s41562-025-02203-8.pdf\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Nature Human Behaviour\",\"FirstCategoryId\":\"102\",\"ListUrlMain\":\"https://www.nature.com/articles/s41562-025-02203-8\",\"RegionNum\":1,\"RegionCategory\":\"心理学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"MULTIDISCIPLINARY SCIENCES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Nature Human Behaviour","FirstCategoryId":"102","ListUrlMain":"https://www.nature.com/articles/s41562-025-02203-8","RegionNum":1,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"MULTIDISCIPLINARY SCIENCES","Score":null,"Total":0}

引用次数: 0

摘要

语言在多大程度上可以产生复杂的概念表征？多感官体验必不可少吗？最近的大型语言模型（llm）对概念形成的基础的必要性提出了挑战：没有基础的llm是否仍然表现出类似人类的表征。在这里，我们比较了人类之间约4,442个词汇概念的多维表示(格拉斯哥标准1，N = 829；和兰开斯特标准2，N = 3500)，以及最先进的llm，包括视觉学习和非视觉学习，包括非感觉运动、感觉和运动领域。我们发现(1)模型和人类表征之间的相似性从非感觉运动域到感觉域降低，并且在运动域最小，表明存在系统差异；(2)具有视觉学习的模型在视觉相关维度上与人类表征具有增强的相似性。这些结果突出了llm孤立语言的潜在局限性，并且多种模式的整合可以潜在地增强与人类概念表征的一致性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

Large language models without grounding recover non-sensorimotor but not sensorimotor features of human concepts

查看原文本刊更多论文

Large language models without grounding recover non-sensorimotor but not sensorimotor features of human concepts

To what extent can language give rise to complex conceptual representation? Is multisensory experience essential? Recent large language models (LLMs) challenge the necessity of grounding for concept formation: whether LLMs without grounding nevertheless exhibit human-like representations. Here we compare multidimensional representations of ~4,442 lexical concepts between humans (the Glasgow Norms1, N = 829; and the Lancaster Norms2, N = 3,500) and state-of-the-art LLMs with and without visual learning, across non-sensorimotor, sensory and motor domains. We found that (1) the similarity between model and human representations decreases from non-sensorimotor to sensory domains and is minimal in motor domains, indicating a systematic divergence, and (2) models with visual learning exhibit enhanced similarity with human representations in visual-related dimensions. These results highlight the potential limitations of language in isolation for LLMs and that the integration of diverse modalities can potentially enhance alignment with human conceptual representation. Xu et al. find that large language models not only align with human representations in non-sensorimotor domains but also diverge in sensorimotor ones, with additional visual training associated with enhanced alignment.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Nature Human Behaviour Psychology-Social Psychology

CiteScore

36.80

自引率

1.00%

发文量

227

期刊介绍： Nature Human Behaviour is a journal that focuses on publishing research of outstanding significance into any aspect of human behavior.The research can cover various areas such as psychological, biological, and social bases of human behavior.It also includes the study of origins, development, and disorders related to human behavior.The primary aim of the journal is to increase the visibility of research in the field and enhance its societal reach and impact.