Personality Traits Prediction from Text via Machine Learning

Alessandro Bruno, Gurmeet Singh
{"title":"Personality Traits Prediction from Text via Machine Learning","authors":"Alessandro Bruno, Gurmeet Singh","doi":"10.1109/AIC55036.2022.9848937","DOIUrl":null,"url":null,"abstract":"Social media platforms have been expanding their user bases. For example, LinkedIn counts 917 million monthly visitors, while Twitter has 3.62 billion monthly visitors. YouTube has 22.77 billion monthly visitors, and Instagram has 2.86 billion monthly visitors. Reports confirm data size increase of the social media networks above by 20–30% every day. With the spread of COVID-19, the same platforms have been broadly used by the worldwide collectiveness to socialize and stay amongst people. Analyzing text from Social Networking sites helps recognize individuals' personality traits automatically. A person's personality refers to their unique characteristics that shape their habits, behaviour, attitude, and cognitive tendencies. In this work, several machine learning techniques are surveyed to estimate personality traits from input text using the Myers-Briggs Type Indicator (MBTI) model. Experiments are run over a freely accessible dataset from Kaggle. In addition, techniques such as tokenization, word stemming, stop word elimination, and feature selection, utilizing TF-IDF, are used to analyze personality traits further.","PeriodicalId":433590,"journal":{"name":"2022 IEEE World Conference on Applied Intelligence and Computing (AIC)","volume":"31 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-06-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE World Conference on Applied Intelligence and Computing (AIC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/AIC55036.2022.9848937","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3

Abstract

Social media platforms have been expanding their user bases. For example, LinkedIn counts 917 million monthly visitors, while Twitter has 3.62 billion monthly visitors. YouTube has 22.77 billion monthly visitors, and Instagram has 2.86 billion monthly visitors. Reports confirm data size increase of the social media networks above by 20–30% every day. With the spread of COVID-19, the same platforms have been broadly used by the worldwide collectiveness to socialize and stay amongst people. Analyzing text from Social Networking sites helps recognize individuals' personality traits automatically. A person's personality refers to their unique characteristics that shape their habits, behaviour, attitude, and cognitive tendencies. In this work, several machine learning techniques are surveyed to estimate personality traits from input text using the Myers-Briggs Type Indicator (MBTI) model. Experiments are run over a freely accessible dataset from Kaggle. In addition, techniques such as tokenization, word stemming, stop word elimination, and feature selection, utilizing TF-IDF, are used to analyze personality traits further.
通过机器学习从文本中预测人格特征
社交媒体平台一直在扩大其用户基础。例如,LinkedIn的月访问量为9.17亿,而Twitter的月访问量为36.2亿。YouTube月访问量为227.7亿,Instagram月访问量为28.6亿。报告证实,上述社交媒体网络的数据量每天以20-30%的速度增长。随着新冠肺炎疫情的蔓延,世界各地的人们广泛使用相同的平台进行社交和交流。分析来自社交网站的文本有助于自动识别个人的个性特征。一个人的个性是指他们独特的特征,这些特征塑造了他们的习惯、行为、态度和认知倾向。在这项工作中,研究了几种机器学习技术,使用迈尔斯-布里格斯类型指标(MBTI)模型从输入文本中估计人格特征。实验是在Kaggle免费访问的数据集上进行的。此外,利用TF-IDF,使用标记化、词干提取、停止词消除和特征选择等技术进一步分析人格特征。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信