{"title":"Occupational Representativeness in Twitter","authors":"S. Kim, Stephen Wan, Cécile Paris","doi":"10.1145/3015022.3015036","DOIUrl":null,"url":null,"abstract":"This paper describes an approach to detect one particular demographic characteristic, occupation (or profession) in Twitter user profiles. In this paper, we show how effective the approach is for estimating occupational population statistics in Australian Twitter by correlating them with real-world population obtained from 2011 Australian census data. We also demonstrate that we can gain more reliable social media insights in the context of occupational representativeness in Twitter if a non-standard occupation name is mapped into a standard occupation name. To our knowledge, this is the first attempt to build a machine learning model that automatically identifies linguistically noisy or open-ended occupations in Twitter, resulting in more reliable occupational population.","PeriodicalId":334601,"journal":{"name":"Proceedings of the 21st Australasian Document Computing Symposium","volume":"164 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-12-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 21st Australasian Document Computing Symposium","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3015022.3015036","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3
Abstract
This paper describes an approach to detect one particular demographic characteristic, occupation (or profession) in Twitter user profiles. In this paper, we show how effective the approach is for estimating occupational population statistics in Australian Twitter by correlating them with real-world population obtained from 2011 Australian census data. We also demonstrate that we can gain more reliable social media insights in the context of occupational representativeness in Twitter if a non-standard occupation name is mapped into a standard occupation name. To our knowledge, this is the first attempt to build a machine learning model that automatically identifies linguistically noisy or open-ended occupations in Twitter, resulting in more reliable occupational population.