Text emotion mining on Twitter

Suboh Alkhushayni, Daniel C Zellmer, Ryan J DeBusk, Du’a Al-zaleq
{"title":"Text emotion mining on Twitter","authors":"Suboh Alkhushayni, Daniel C Zellmer, Ryan J DeBusk, Du’a Al-zaleq","doi":"10.1088/2633-1357/abc01e","DOIUrl":null,"url":null,"abstract":"Twitter has become a medium through which a substantial percentage of the global population communicates their feelings and reactions to current events. Emotion mining from text aims to capture these emotions by using a series of algorithms to evaluate the contents of each tweet. In this study, tweets that expressed at least one of seven basic emotions were collected. The resulting dataset was a corpus of 42,000 tweets with a balanced presence of each emotion. From this corpus a lexicon of roughly 40,000 words, each associated with a weighted vector corresponding to one of the emotions, was created. Next, different methods of identifying emotion in these ‘cleaned’ tweets were performed and evaluated. These methods included both lexically-based classification and supervised machine learning-based classification. Finally, an ensemble method involving several multi-class classifiers trained on unigram features of the lexicon was evaluated. This evaluation revealed that the ensemble method outperformed all other tested methods when tested on existing datasets as well as on the dataset created for this study.","PeriodicalId":93771,"journal":{"name":"IOP SciNotes","volume":" ","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2020-10-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IOP SciNotes","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1088/2633-1357/abc01e","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2

Abstract

Twitter has become a medium through which a substantial percentage of the global population communicates their feelings and reactions to current events. Emotion mining from text aims to capture these emotions by using a series of algorithms to evaluate the contents of each tweet. In this study, tweets that expressed at least one of seven basic emotions were collected. The resulting dataset was a corpus of 42,000 tweets with a balanced presence of each emotion. From this corpus a lexicon of roughly 40,000 words, each associated with a weighted vector corresponding to one of the emotions, was created. Next, different methods of identifying emotion in these ‘cleaned’ tweets were performed and evaluated. These methods included both lexically-based classification and supervised machine learning-based classification. Finally, an ensemble method involving several multi-class classifiers trained on unigram features of the lexicon was evaluated. This evaluation revealed that the ensemble method outperformed all other tested methods when tested on existing datasets as well as on the dataset created for this study.
Twitter上的文本情感挖掘
推特已经成为一种媒介,全球很大一部分人通过它来交流他们对当前事件的感受和反应。从文本中挖掘情感的目的是通过使用一系列算法来评估每条推文的内容,从而捕捉这些情感。在这项研究中,收集了至少表达七种基本情绪之一的推文。由此产生的数据集是一个包含42000条推文的语料库,每种情绪的存在都是平衡的。从这个语料库中创建了大约40,000个单词的词典,每个单词都与对应于一种情绪的加权向量相关联。接下来,对这些“清理过的”推文中识别情绪的不同方法进行了执行和评估。这些方法包括基于词汇的分类和基于监督机器学习的分类。最后,对基于单图特征训练的多类分类器集成方法进行了评价。该评估表明,当在现有数据集以及为本研究创建的数据集上进行测试时,集成方法优于所有其他测试方法。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
审稿时长
14 weeks
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信