{"title":"A Richer Vocabulary of Chinese Personality Traits: Leveraging Word Embedding Technology for Mining Personality Descriptors.","authors":"Yigang Ding, Feijun Zheng, Linjie Xu, Xinru Yang, Yiyun Jia","doi":"10.1007/s10936-024-10060-1","DOIUrl":null,"url":null,"abstract":"<p><p>This study uses a data-driven approach to mine the distribution of personality traits among Chinese people in the Chinese social context. Based on the hypothesis of personality lexicology, word embedding technology was employed in machine learning to mine personality vocabulary from Tencent's word embedding database. More than 10,000 Chinese personality descriptors were extracted and analyzed using Gaussian Mixture Model Cluster and Hierarchical clustering analysis. The data was collected from 658 Chinese people randomly from all parts of China through an online questionnaire method. The results reveal six personality traits in the Chinese context, expanding the personality thesaurus and providing examples to illustrate each trait. The findings coincide with previous research on the five-factor model, which partially describes the personality traits of Chinese people, but does not offer a complete explanation of their typical social behavior patterns. Additionally, the study supports the notion of cultural particularity in personality traits. The approach used in this study offers a richer personality vocabulary than traditional personality mining methods, and word embedding technology captures richer semantic information in Chinese. The six Chinese personality traits identified in this study will also be used to explore how to quantify and evaluate personality traits based on word embedding and personality descriptors.</p>","PeriodicalId":47689,"journal":{"name":"Journal of Psycholinguistic Research","volume":"53 3","pages":"33"},"PeriodicalIF":1.6000,"publicationDate":"2024-03-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Psycholinguistic Research","FirstCategoryId":"98","ListUrlMain":"https://doi.org/10.1007/s10936-024-10060-1","RegionNum":2,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"LINGUISTICS","Score":null,"Total":0}
引用次数: 0
Abstract
This study uses a data-driven approach to mine the distribution of personality traits among Chinese people in the Chinese social context. Based on the hypothesis of personality lexicology, word embedding technology was employed in machine learning to mine personality vocabulary from Tencent's word embedding database. More than 10,000 Chinese personality descriptors were extracted and analyzed using Gaussian Mixture Model Cluster and Hierarchical clustering analysis. The data was collected from 658 Chinese people randomly from all parts of China through an online questionnaire method. The results reveal six personality traits in the Chinese context, expanding the personality thesaurus and providing examples to illustrate each trait. The findings coincide with previous research on the five-factor model, which partially describes the personality traits of Chinese people, but does not offer a complete explanation of their typical social behavior patterns. Additionally, the study supports the notion of cultural particularity in personality traits. The approach used in this study offers a richer personality vocabulary than traditional personality mining methods, and word embedding technology captures richer semantic information in Chinese. The six Chinese personality traits identified in this study will also be used to explore how to quantify and evaluate personality traits based on word embedding and personality descriptors.
期刊介绍:
Journal of Psycholinguistic Research publishes carefully selected papers from the several disciplines engaged in psycholinguistic research, providing a single, recognized medium for communications among linguists, psychologists, biologists, sociologists, and others. The journal covers a broad range of approaches to the study of the communicative process, including: the social and anthropological bases of communication; development of speech and language; semantics (problems in linguistic meaning); and biological foundations. Papers dealing with the psychopathology of language and cognition, and the neuropsychology of language and cognition, are also included.