Big5PersonalityEssays: Introducing a Novel Synthetic Generated Dataset Consisting of Short State-of-Consciousness Essays Annotated Based on the Five Factor Model of Personality
{"title":"Big5PersonalityEssays: Introducing a Novel Synthetic Generated Dataset Consisting of Short State-of-Consciousness Essays Annotated Based on the Five Factor Model of Personality","authors":"Iustin Floroiu","doi":"arxiv-2407.17586","DOIUrl":null,"url":null,"abstract":"Given the high advances of large language models (LLM) it is of vital\nimportance to study their behaviors and apply their utility in all kinds of\nscientific fields. Psychology has been, in recent years, poorly approached\nusing novel computational tools. One of the reasons is the high complexity of\nthe data required for a proper analysis. Moreover, psychology, with a focus on\npsychometry, has few datasets available for analysis and artificial\nintelligence usage. Because of these facts, this study introduces a synthethic\ndatabase of short essays labeled based on the five factor model (FFM) of\npersonality traits.","PeriodicalId":501310,"journal":{"name":"arXiv - CS - Other Computer Science","volume":"71 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-05-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - CS - Other Computer Science","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2407.17586","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Given the high advances of large language models (LLM) it is of vital
importance to study their behaviors and apply their utility in all kinds of
scientific fields. Psychology has been, in recent years, poorly approached
using novel computational tools. One of the reasons is the high complexity of
the data required for a proper analysis. Moreover, psychology, with a focus on
psychometry, has few datasets available for analysis and artificial
intelligence usage. Because of these facts, this study introduces a synthethic
database of short essays labeled based on the five factor model (FFM) of
personality traits.