Mirroring the real world in social media: twitter, geolocation, and sentiment analysis

Eric Baucom, Azade Sanjari, Xiaozhong Liu, Miao Chen
{"title":"Mirroring the real world in social media: twitter, geolocation, and sentiment analysis","authors":"Eric Baucom, Azade Sanjari, Xiaozhong Liu, Miao Chen","doi":"10.1145/2513549.2513559","DOIUrl":null,"url":null,"abstract":"In recent years social media has been used to characterize and predict real world events, and in this research we seek to investigate how closely Twitter mirrors the real world. Specifically, we wish to characterize the relationship between the language used on Twitter and the results of the 2011 NBA Playoff games. We hypothesize that the language used by Twitter users will be useful in classifying the users' locations combined with the current status of which team is in the lead during the game. This is based on the common assumption that \"fans\" of a team have more positive sentiment and will accordingly use different language when their team is doing well. We investigate this hypothesis by labeling each tweet according the the location of the user along with the team that is in the lead at the time of the tweet. The hypothesized difference in language (as measured by tfidf) should then have predictive power over the tweet labels. We find that indeed it does and we experiment further by adding semantic orientation (SO) information as part of the feature set. The SO does not offer much improvement over tf-idf alone. We discuss the relative strengths of the two types of features for our data.","PeriodicalId":126426,"journal":{"name":"Proceedings of the 2013 international workshop on Mining unstructured big data using natural language processing","volume":"109 11","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2013-10-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"38","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2013 international workshop on Mining unstructured big data using natural language processing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/2513549.2513559","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 38

Abstract

In recent years social media has been used to characterize and predict real world events, and in this research we seek to investigate how closely Twitter mirrors the real world. Specifically, we wish to characterize the relationship between the language used on Twitter and the results of the 2011 NBA Playoff games. We hypothesize that the language used by Twitter users will be useful in classifying the users' locations combined with the current status of which team is in the lead during the game. This is based on the common assumption that "fans" of a team have more positive sentiment and will accordingly use different language when their team is doing well. We investigate this hypothesis by labeling each tweet according the the location of the user along with the team that is in the lead at the time of the tweet. The hypothesized difference in language (as measured by tfidf) should then have predictive power over the tweet labels. We find that indeed it does and we experiment further by adding semantic orientation (SO) information as part of the feature set. The SO does not offer much improvement over tf-idf alone. We discuss the relative strengths of the two types of features for our data.
在社交媒体上反映现实世界:推特、地理定位和情感分析
近年来,社交媒体已经被用来描述和预测现实世界的事件,在这项研究中,我们试图调查Twitter对现实世界的反映有多密切。具体来说,我们希望描述Twitter上使用的语言与2011年NBA季后赛结果之间的关系。我们假设Twitter用户使用的语言将有助于对用户的位置进行分类,并结合哪支球队在比赛中处于领先地位。这是基于一个普遍的假设,即一支球队的“球迷”有更积极的情绪,因此当他们的球队表现良好时,他们会使用不同的语言。我们通过根据用户的位置以及推文发布时领先的团队标记每条推文来调查这一假设。假设的语言差异(由tfidf测量)应该对tweet标签具有预测能力。我们发现确实如此,我们通过添加语义方向(SO)信息作为特征集的一部分进行了进一步的实验。与单独使用tf-idf相比,SO并没有提供多少改进。我们讨论了两种类型的特征对于我们的数据的相对优势。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信