基于词典的推特情感分析,使用表情符号和N-gram特征进行投票份额预测

Q2 Social Sciences
Barkha Bansal, S. Srivastava
{"title":"基于词典的推特情感分析,使用表情符号和N-gram特征进行投票份额预测","authors":"Barkha Bansal, S. Srivastava","doi":"10.1504/IJWBC.2019.10018048","DOIUrl":null,"url":null,"abstract":"Recently, Twitter sentiment analysis (TSA) has been successfully employed to monitor and forecast elections in many studies. However, most of the existing studies rely on extracting sentiments from explicit textual features. Moreover, only few studies have included non-textual features such as emojis for election forecasts. In this study, we incorporated N-gram features to predict vote shares of 2017 Uttar Pradesh (UP) legislative elections. Also, sentiment distribution of tweets containing emojis was significantly different from tweets without emojis. Therefore, emoji sentiments were detected and incorporated to predict the vote shares. We collected more than 0.3 million tweets, wherein geo-tagging was applied on search keywords that were not exclusive to elections. We employed seven lexicons for labelling tweets and compared two methods to reduce prediction error: sentiment magnitude-based criteria and polarity of tweets. Results show that proposed method of incorporating N-gram features and emoji sentiments significantly decreases prediction error.","PeriodicalId":39041,"journal":{"name":"International Journal of Web Based Communities","volume":"1 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2019-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"19","resultStr":"{\"title\":\"Lexicon Based Twitter Sentiment Analysis for Vote Share Prediction Using Emoji and N-gram Features\",\"authors\":\"Barkha Bansal, S. Srivastava\",\"doi\":\"10.1504/IJWBC.2019.10018048\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Recently, Twitter sentiment analysis (TSA) has been successfully employed to monitor and forecast elections in many studies. However, most of the existing studies rely on extracting sentiments from explicit textual features. Moreover, only few studies have included non-textual features such as emojis for election forecasts. In this study, we incorporated N-gram features to predict vote shares of 2017 Uttar Pradesh (UP) legislative elections. Also, sentiment distribution of tweets containing emojis was significantly different from tweets without emojis. Therefore, emoji sentiments were detected and incorporated to predict the vote shares. We collected more than 0.3 million tweets, wherein geo-tagging was applied on search keywords that were not exclusive to elections. We employed seven lexicons for labelling tweets and compared two methods to reduce prediction error: sentiment magnitude-based criteria and polarity of tweets. Results show that proposed method of incorporating N-gram features and emoji sentiments significantly decreases prediction error.\",\"PeriodicalId\":39041,\"journal\":{\"name\":\"International Journal of Web Based Communities\",\"volume\":\"1 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-04-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"19\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"International Journal of Web Based Communities\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1504/IJWBC.2019.10018048\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"Social Sciences\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Web Based Communities","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1504/IJWBC.2019.10018048","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"Social Sciences","Score":null,"Total":0}
引用次数: 19

摘要

最近,推特情绪分析(TSA)在许多研究中被成功地用于监测和预测选举。然而,现有的研究大多依赖于从明确的文本特征中提取情感。此外,只有少数研究包含了非文本特征,如用于选举预测的表情符号。在这项研究中,我们结合了N-gram特征来预测2017年北方邦立法选举的选票份额。此外,包含表情符号的推文的情绪分布与没有表情符号的微博明显不同。因此,表情符号情绪被检测并被纳入预测投票份额。我们收集了超过30万条推文,其中对非选举专用的搜索关键词进行了地理标记。我们使用了七个词典来标记推文,并比较了两种减少预测误差的方法:基于情绪大小的标准和推文的极性。结果表明,所提出的将N符号特征和表情符号情感相结合的方法显著降低了预测误差。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Lexicon Based Twitter Sentiment Analysis for Vote Share Prediction Using Emoji and N-gram Features
Recently, Twitter sentiment analysis (TSA) has been successfully employed to monitor and forecast elections in many studies. However, most of the existing studies rely on extracting sentiments from explicit textual features. Moreover, only few studies have included non-textual features such as emojis for election forecasts. In this study, we incorporated N-gram features to predict vote shares of 2017 Uttar Pradesh (UP) legislative elections. Also, sentiment distribution of tweets containing emojis was significantly different from tweets without emojis. Therefore, emoji sentiments were detected and incorporated to predict the vote shares. We collected more than 0.3 million tweets, wherein geo-tagging was applied on search keywords that were not exclusive to elections. We employed seven lexicons for labelling tweets and compared two methods to reduce prediction error: sentiment magnitude-based criteria and polarity of tweets. Results show that proposed method of incorporating N-gram features and emoji sentiments significantly decreases prediction error.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
International Journal of Web Based Communities
International Journal of Web Based Communities Social Sciences-Communication
CiteScore
2.00
自引率
0.00%
发文量
30
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信