Toward accurate psychological simulations: Investigating LLMs’ responses to personality and cultural variables

IF 8.9 1区心理学 Q1 PSYCHOLOGY, EXPERIMENTAL

Computers in Human Behavior Pub Date : 2025-04-28 DOI:10.1016/j.chb.2025.108687

Chihao Li , Yue Qi

{"title":"Toward accurate psychological simulations: Investigating LLMs’ responses to personality and cultural variables","authors":"Chihao Li , Yue Qi","doi":"10.1016/j.chb.2025.108687","DOIUrl":null,"url":null,"abstract":"<div><div>Large language models (LLMs) have demonstrated complex language comprehension, making them potential tools for psychological research. However, challenges remain in assessing their psychological properties, particularly regarding prompt design, comparison with human data, and the ability to simulate real-world psychological differences across cultural groups. This study investigates how LLMs respond to personality assessments and cultural variables, addressing gaps in previous research.</div><div>In three studies, we explored the impact of model parameters and prompt variations on LLM responses to personality tests. Study 1 examined how temperature, model type, and prompt templates influenced LLM responses, revealing that while temperature settings had minimal impact, prompt variations led to significant differences. Study 2 compared LLMs' self-report personality scores with human data (N = 18,192–49,159), finding that LLMs scored higher in positive traits (e.g., extroversion) and lower in negative traits (e.g., psychopathy), reflecting their training biases. Study 3 tested LLMs’ ability to simulate cultural differences by assessing virtual subjects from China and the USA. While significant differences were observed between the groups, both demonstrated East Asian self-construal patterns, indicating limitations in simulating authentic cultural psychological differences.</div><div>These findings highlight the influence of prompt design on LLM responses, the divergence between LLM and human personality profiles, and the difficulty of simulating accurate cultural psychological differences. These results underscore the need for more refined methodologies in psychological simulations using LLMs and suggest that current models struggle to represent diverse human psychological traits accurately.</div></div>","PeriodicalId":48471,"journal":{"name":"Computers in Human Behavior","volume":"170 ","pages":"Article 108687"},"PeriodicalIF":8.9000,"publicationDate":"2025-04-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computers in Human Behavior","FirstCategoryId":"102","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0747563225001347","RegionNum":1,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"PSYCHOLOGY, EXPERIMENTAL","Score":null,"Total":0}

引用次数: 0

Abstract

Large language models (LLMs) have demonstrated complex language comprehension, making them potential tools for psychological research. However, challenges remain in assessing their psychological properties, particularly regarding prompt design, comparison with human data, and the ability to simulate real-world psychological differences across cultural groups. This study investigates how LLMs respond to personality assessments and cultural variables, addressing gaps in previous research.

In three studies, we explored the impact of model parameters and prompt variations on LLM responses to personality tests. Study 1 examined how temperature, model type, and prompt templates influenced LLM responses, revealing that while temperature settings had minimal impact, prompt variations led to significant differences. Study 2 compared LLMs' self-report personality scores with human data (N = 18,192–49,159), finding that LLMs scored higher in positive traits (e.g., extroversion) and lower in negative traits (e.g., psychopathy), reflecting their training biases. Study 3 tested LLMs’ ability to simulate cultural differences by assessing virtual subjects from China and the USA. While significant differences were observed between the groups, both demonstrated East Asian self-construal patterns, indicating limitations in simulating authentic cultural psychological differences.

These findings highlight the influence of prompt design on LLM responses, the divergence between LLM and human personality profiles, and the difficulty of simulating accurate cultural psychological differences. These results underscore the need for more refined methodologies in psychological simulations using LLMs and suggest that current models struggle to represent diverse human psychological traits accurately.

查看原文本刊更多论文

走向准确的心理模拟：调查法学硕士对个性和文化变量的反应

大型语言模型（llm）已经证明了复杂的语言理解能力，使其成为心理学研究的潜在工具。然而，在评估它们的心理特性方面仍然存在挑战，特别是在提示设计、与人类数据的比较以及模拟不同文化群体真实心理差异的能力方面。本研究调查了法学硕士对人格评估和文化变量的反应，解决了以往研究中的空白。在三项研究中，我们探讨了模型参数和提示变量对法学硕士人格测试反应的影响。研究1考察了温度、模型类型和提示模板如何影响LLM的反应，揭示了虽然温度设置的影响很小，但提示的变化导致了显著的差异。研究2将法学硕士的自我报告人格得分与人类数据（N = 18,192-49,159）进行比较，发现法学硕士在积极特征（如外向性）上得分较高，在消极特征（如精神病）上得分较低，这反映了他们的训练偏见。研究3通过评估来自中国和美国的虚拟对象来测试法学硕士模拟文化差异的能力。虽然两组之间存在显著差异，但都表现出东亚人的自我解释模式，这表明在模拟真实的文化心理差异方面存在局限性。这些发现突出了提示设计对法学硕士反应的影响，法学硕士与人类人格特征的差异，以及准确模拟文化心理差异的难度。这些结果强调了在使用法学硕士的心理模拟中需要更精细的方法，并表明当前的模型难以准确地代表不同的人类心理特征。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Computers in Human Behavior Multiple-

CiteScore

19.10

自引率

4.00%

发文量

381

审稿时长

40 days

期刊介绍： Computers in Human Behavior is a scholarly journal that explores the psychological aspects of computer use. It covers original theoretical works, research reports, literature reviews, and software and book reviews. The journal examines both the use of computers in psychology, psychiatry, and related fields, and the psychological impact of computer use on individuals, groups, and society. Articles discuss topics such as professional practice, training, research, human development, learning, cognition, personality, and social interactions. It focuses on human interactions with computers, considering the computer as a medium through which human behaviors are shaped and expressed. Professionals interested in the psychological aspects of computer use will find this journal valuable, even with limited knowledge of computers.