{"title":"Personality Traits Prediction from Text via Machine Learning","authors":"Alessandro Bruno, Gurmeet Singh","doi":"10.1109/AIC55036.2022.9848937","DOIUrl":null,"url":null,"abstract":"Social media platforms have been expanding their user bases. For example, LinkedIn counts 917 million monthly visitors, while Twitter has 3.62 billion monthly visitors. YouTube has 22.77 billion monthly visitors, and Instagram has 2.86 billion monthly visitors. Reports confirm data size increase of the social media networks above by 20–30% every day. With the spread of COVID-19, the same platforms have been broadly used by the worldwide collectiveness to socialize and stay amongst people. Analyzing text from Social Networking sites helps recognize individuals' personality traits automatically. A person's personality refers to their unique characteristics that shape their habits, behaviour, attitude, and cognitive tendencies. In this work, several machine learning techniques are surveyed to estimate personality traits from input text using the Myers-Briggs Type Indicator (MBTI) model. Experiments are run over a freely accessible dataset from Kaggle. In addition, techniques such as tokenization, word stemming, stop word elimination, and feature selection, utilizing TF-IDF, are used to analyze personality traits further.","PeriodicalId":433590,"journal":{"name":"2022 IEEE World Conference on Applied Intelligence and Computing (AIC)","volume":"31 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-06-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE World Conference on Applied Intelligence and Computing (AIC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/AIC55036.2022.9848937","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3
Abstract
Social media platforms have been expanding their user bases. For example, LinkedIn counts 917 million monthly visitors, while Twitter has 3.62 billion monthly visitors. YouTube has 22.77 billion monthly visitors, and Instagram has 2.86 billion monthly visitors. Reports confirm data size increase of the social media networks above by 20–30% every day. With the spread of COVID-19, the same platforms have been broadly used by the worldwide collectiveness to socialize and stay amongst people. Analyzing text from Social Networking sites helps recognize individuals' personality traits automatically. A person's personality refers to their unique characteristics that shape their habits, behaviour, attitude, and cognitive tendencies. In this work, several machine learning techniques are surveyed to estimate personality traits from input text using the Myers-Briggs Type Indicator (MBTI) model. Experiments are run over a freely accessible dataset from Kaggle. In addition, techniques such as tokenization, word stemming, stop word elimination, and feature selection, utilizing TF-IDF, are used to analyze personality traits further.