{"title":"User-annotated microtext data for modeling and analyzing users' sociolinguistic characteristics and age grading","authors":"N. Moseley, Cecilia Ovesdotter Alm, M. Rege","doi":"10.1109/RCIS.2014.6861046","DOIUrl":null,"url":null,"abstract":"Information from Twitter messages have become an important area for research in computational analysis of natural language. As yet, much latent user attribute analysis on Twitter is unexplored. One reason is that only few latent attributes are explicitly defined by users on Twitter. This work presents and analyzes a data set annotated by Twitter users themselves for age and other useful attributes for use in latent attribute inference applications. We report on statistical analysis of the collected latent attributes and tweet information using association mining.","PeriodicalId":288073,"journal":{"name":"2014 IEEE Eighth International Conference on Research Challenges in Information Science (RCIS)","volume":"27 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2014-05-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2014 IEEE Eighth International Conference on Research Challenges in Information Science (RCIS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/RCIS.2014.6861046","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 5
Abstract
Information from Twitter messages have become an important area for research in computational analysis of natural language. As yet, much latent user attribute analysis on Twitter is unexplored. One reason is that only few latent attributes are explicitly defined by users on Twitter. This work presents and analyzes a data set annotated by Twitter users themselves for age and other useful attributes for use in latent attribute inference applications. We report on statistical analysis of the collected latent attributes and tweet information using association mining.