Inferring Latent Attributes of an Indian Twitter User using Celebrities and Class Influencers

Puneet Singh Ludu
{"title":"Inferring Latent Attributes of an Indian Twitter User using Celebrities and Class Influencers","authors":"Puneet Singh Ludu","doi":"10.1145/2806655.2806657","DOIUrl":null,"url":null,"abstract":"In this paper we classify a user into three categories: \"Gender\", \"Age\" and \"Political Affiliation\" with an application to Indian Twitter users. Our approach automatically predicts these attributes by leveraging observable information such as the tweet behavior, linguistic content of the user's Twitter feed and the celebrities followed by the user. This paper would also use a novel feature that we would define in this paper as \"class influencers\". Class influencers are the Twitter users which influence a particular class so much that, they themselves can be used as a discriminating feature. Our approach first extracts the linguistic content based features using LIWC dictionary. Then, we derive features like smiley types, smiley count, tweet frequency, night-time tweet frequency, etc. We have also derived celebrity based feature: age, genre, gender (using Wikipedia and Freebase) of the celebrities a user is following. Finally, we refine the results using class influencers. Results show that rich linguistic features combined with popular neighborhood and influencers prove valuables and promising for additional user classification needs.","PeriodicalId":112658,"journal":{"name":"Proceedings of the 1st ACM Workshop on Social Media World Sensors","volume":"31 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2015-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"7","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 1st ACM Workshop on Social Media World Sensors","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/2806655.2806657","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 7

Abstract

In this paper we classify a user into three categories: "Gender", "Age" and "Political Affiliation" with an application to Indian Twitter users. Our approach automatically predicts these attributes by leveraging observable information such as the tweet behavior, linguistic content of the user's Twitter feed and the celebrities followed by the user. This paper would also use a novel feature that we would define in this paper as "class influencers". Class influencers are the Twitter users which influence a particular class so much that, they themselves can be used as a discriminating feature. Our approach first extracts the linguistic content based features using LIWC dictionary. Then, we derive features like smiley types, smiley count, tweet frequency, night-time tweet frequency, etc. We have also derived celebrity based feature: age, genre, gender (using Wikipedia and Freebase) of the celebrities a user is following. Finally, we refine the results using class influencers. Results show that rich linguistic features combined with popular neighborhood and influencers prove valuables and promising for additional user classification needs.
使用名人和阶级影响者推断印度Twitter用户的潜在属性
在本文中,我们将用户分为三类:“性别”,“年龄”和“政治派别”,并应用于印度Twitter用户。我们的方法通过利用可观察到的信息,如推文行为、用户推特feed的语言内容和用户关注的名人,自动预测这些属性。本文还将使用一个新颖的特征,我们将其定义为“阶级影响者”。阶级影响者是Twitter用户,他们对特定阶级的影响如此之大,以至于他们自己可以被用作区分特征。我们的方法首先使用LIWC字典提取基于语言内容的特征。然后,我们推导出表情符号类型、表情符号数量、tweet频率、夜间tweet频率等特征。我们还衍生了基于名人的特征:用户关注的名人的年龄、类型、性别(使用维基百科和Freebase)。最后,我们使用类影响因子来改进结果。结果表明,丰富的语言特征与受欢迎的邻居和有影响力的人相结合,证明了额外的用户分类需求的价值和前景。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信