Surfing the OCEAN: The machine learning psycholexical approach 2.0 to detect personality traits in texts

IF 5 1区心理学 Q1 Psychology

Journal of Personality Pub Date : 2024-01-13 DOI:10.1111/jopy.12915

Federico Giannini, Marco Marelli, Fabio Stella, Dario Monzani, Luca Pancani

{"title":"Surfing the OCEAN: The machine learning psycholexical approach 2.0 to detect personality traits in texts","authors":"Federico Giannini, Marco Marelli, Fabio Stella, Dario Monzani, Luca Pancani","doi":"10.1111/jopy.12915","DOIUrl":null,"url":null,"abstract":"<div>\n \n \n <section>\n \n <h3> Objective</h3>\n \n <p>We aimed to develop a machine learning model to infer OCEAN traits from text.</p>\n </section>\n \n <section>\n \n <h3> Background</h3>\n \n <p>The psycholexical approach allows retrieving information about personality traits from human language. However, it has rarely been applied because of methodological and practical issues that current computational advancements could overcome.</p>\n </section>\n \n <section>\n \n <h3> Method</h3>\n \n <p>Classical taxonomies and a large Yelp corpus were leveraged to learn an embedding for each personality trait. These embeddings were used to train a feedforward neural network for predicting trait values. Their generalization performances have been evaluated through two external validation studies involving experts (<i>N</i> = 11) and laypeople (<i>N</i> = 100) in a discrimination task about the best markers of each trait and polarity.</p>\n </section>\n \n <section>\n \n <h3> Results</h3>\n \n <p>Intrinsic validation of the model yielded excellent results, with <i>R</i><sup>2</sup> values greater than 0.78. The validation studies showed a high proportion of matches between participants' choices and model predictions, confirming its efficacy in identifying new terms related to the OCEAN traits. The best performance was observed for agreeableness and extraversion, especially for their positive polarities. The model was less efficient in identifying the negative polarity of openness and conscientiousness.</p>\n </section>\n \n <section>\n \n <h3> Conclusions</h3>\n \n <p>This innovative methodology can be considered a “psycholexical approach 2.0,” contributing to research in personality and its practical applications in many fields.</p>\n </section>\n </div>","PeriodicalId":48421,"journal":{"name":"Journal of Personality","volume":"92 6","pages":"1602-1615"},"PeriodicalIF":5.0000,"publicationDate":"2024-01-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Personality","FirstCategoryId":"102","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1111/jopy.12915","RegionNum":1,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"Psychology","Score":null,"Total":0}

引用次数: 0

Abstract

Objective

We aimed to develop a machine learning model to infer OCEAN traits from text.

Background

The psycholexical approach allows retrieving information about personality traits from human language. However, it has rarely been applied because of methodological and practical issues that current computational advancements could overcome.

Method

Classical taxonomies and a large Yelp corpus were leveraged to learn an embedding for each personality trait. These embeddings were used to train a feedforward neural network for predicting trait values. Their generalization performances have been evaluated through two external validation studies involving experts (N = 11) and laypeople (N = 100) in a discrimination task about the best markers of each trait and polarity.

Results

Intrinsic validation of the model yielded excellent results, with R² values greater than 0.78. The validation studies showed a high proportion of matches between participants' choices and model predictions, confirming its efficacy in identifying new terms related to the OCEAN traits. The best performance was observed for agreeableness and extraversion, especially for their positive polarities. The model was less efficient in identifying the negative polarity of openness and conscientiousness.

Conclusions

This innovative methodology can be considered a “psycholexical approach 2.0,” contributing to research in personality and its practical applications in many fields.

查看原文本刊更多论文

冲浪海洋：机器学习心理词汇学方法 2.0 检测文本中的个性特征

我们的目标是开发一个机器学习模型，从文本中推断出 OCEAN 的特征。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Journal of Personality PSYCHOLOGY, SOCIAL-

CiteScore

9.60

自引率

6.00%

发文量

100

期刊介绍： Journal of Personality publishes scientific investigations in the field of personality. It focuses particularly on personality and behavior dynamics, personality development, and individual differences in the cognitive, affective, and interpersonal domains. The journal reflects and stimulates interest in the growth of new theoretical and methodological approaches in personality psychology.