{"title":"Micro-blog User Profiling: A Supervised Clustering based Approach for Age and Gender Classification","authors":"J. Qiu, Lin Li, Yunpei Zheng","doi":"10.1109/BESC48373.2019.8963155","DOIUrl":null,"url":null,"abstract":"User profiling is the process of tagging user attributes, such as gender, age, location and so on. Currently the popular way is to do classification by treating attributes as labels. In addition, ensemble classifications are used to future improve classification quality. However, the base classifier of traditional ensemble classification is trained by using random sampling, which cannot guarantee a constantly good classification performance. This paper proposes a supervised clustering of ensemble classification approach to tag micro-blog users with age and gender attributes. The proposed approach is to divide the training data with a same label into multiple clusters and then combine different clusters from training data with different labels into training data subsets. Each subset is used to train a base classifier. Experimental results show that the proposed approach can improve the gender and age classification accuracy classification by 1.79 % and 0.67 % respectively, compared with the traditional ensemble classification approach.","PeriodicalId":190867,"journal":{"name":"2019 6th International Conference on Behavioral, Economic and Socio-Cultural Computing (BESC)","volume":"10 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 6th International Conference on Behavioral, Economic and Socio-Cultural Computing (BESC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/BESC48373.2019.8963155","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
User profiling is the process of tagging user attributes, such as gender, age, location and so on. Currently the popular way is to do classification by treating attributes as labels. In addition, ensemble classifications are used to future improve classification quality. However, the base classifier of traditional ensemble classification is trained by using random sampling, which cannot guarantee a constantly good classification performance. This paper proposes a supervised clustering of ensemble classification approach to tag micro-blog users with age and gender attributes. The proposed approach is to divide the training data with a same label into multiple clusters and then combine different clusters from training data with different labels into training data subsets. Each subset is used to train a base classifier. Experimental results show that the proposed approach can improve the gender and age classification accuracy classification by 1.79 % and 0.67 % respectively, compared with the traditional ensemble classification approach.