{"title":"Predicting Personality Using Facebook Status Based on Semi-supervised Learning","authors":"Heci Zheng, Chunhua Wu","doi":"10.1145/3318299.3318363","DOIUrl":null,"url":null,"abstract":"Personality analysis on social media is a research hotspot due to the importance of personality research in psychology as well as the rapid development of social media. Many studies have used social media status to analyze user's personality, but most of them are conducted on inadequate label data and linguistic features. In this paper, to explore the usage of unlabeled data on personality analysis, a personality analysis framework based on semi-supervised learning is introduced. Besides, for making full use of the language information in social media status, the well-known n-gram model is adopted to extract linguistic features. The experimental results demonstrate the semi-supervised learning can take advantage of unlabeled data and improve the accuracy of prediction model.","PeriodicalId":164987,"journal":{"name":"International Conference on Machine Learning and Computing","volume":"29 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-02-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"18","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Conference on Machine Learning and Computing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3318299.3318363","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 18
Abstract
Personality analysis on social media is a research hotspot due to the importance of personality research in psychology as well as the rapid development of social media. Many studies have used social media status to analyze user's personality, but most of them are conducted on inadequate label data and linguistic features. In this paper, to explore the usage of unlabeled data on personality analysis, a personality analysis framework based on semi-supervised learning is introduced. Besides, for making full use of the language information in social media status, the well-known n-gram model is adopted to extract linguistic features. The experimental results demonstrate the semi-supervised learning can take advantage of unlabeled data and improve the accuracy of prediction model.