An Ensemble-Classifier Based Approach for Multiclass Emotion Classification of Short Text

Shivangi Chawla, Monica Mehrotra
{"title":"An Ensemble-Classifier Based Approach for Multiclass Emotion Classification of Short Text","authors":"Shivangi Chawla, Monica Mehrotra","doi":"10.1109/ICRITO.2018.8748757","DOIUrl":null,"url":null,"abstract":"The profusion of social media textual content coupled with emotion mining methodologies, present exciting opportunities for researches to unveil the hidden emotions behind these texts. Despite recent growth and development in the field of Textual Emotion Mining (TEM), previous studies of emotion classification mainly focused on the use of simple classifiers over Ekman (6 emotions) or Plutchik (8 emotions) emotion models. In this study, Parrott’s hierarchy of emotion is utilized to build three emotion-labelled datasets of tweets corresponding to three levels(primary, secondary and tertiary) of emotion categories. We then present an ensemble-classifier based approach for multiclass textual emotion classification problem. The ensemble was created using four diverse classifiers including naive bayes, multiclass SVM, logistic regression and SGDunder three algorithms bagging, boosting and voting, in order to constitute a promising model which combines the benefits of base classifiers. The experimental investigation over three crawled datasets of hashtag-annotated english tweets, showed promising results and indicated that the proposed ensemble-classifier based approach improved the performance of base learners. Also, voting proved to be most suitable and outperformed both bagging and boosting ensembles.","PeriodicalId":439047,"journal":{"name":"2018 7th International Conference on Reliability, Infocom Technologies and Optimization (Trends and Future Directions) (ICRITO)","volume":"116 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 7th International Conference on Reliability, Infocom Technologies and Optimization (Trends and Future Directions) (ICRITO)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICRITO.2018.8748757","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 4

Abstract

The profusion of social media textual content coupled with emotion mining methodologies, present exciting opportunities for researches to unveil the hidden emotions behind these texts. Despite recent growth and development in the field of Textual Emotion Mining (TEM), previous studies of emotion classification mainly focused on the use of simple classifiers over Ekman (6 emotions) or Plutchik (8 emotions) emotion models. In this study, Parrott’s hierarchy of emotion is utilized to build three emotion-labelled datasets of tweets corresponding to three levels(primary, secondary and tertiary) of emotion categories. We then present an ensemble-classifier based approach for multiclass textual emotion classification problem. The ensemble was created using four diverse classifiers including naive bayes, multiclass SVM, logistic regression and SGDunder three algorithms bagging, boosting and voting, in order to constitute a promising model which combines the benefits of base classifiers. The experimental investigation over three crawled datasets of hashtag-annotated english tweets, showed promising results and indicated that the proposed ensemble-classifier based approach improved the performance of base learners. Also, voting proved to be most suitable and outperformed both bagging and boosting ensembles.
基于集成分类器的短文本多类情感分类方法
社交媒体文本内容的丰富,加上情感挖掘方法,为研究揭示这些文本背后隐藏的情感提供了令人兴奋的机会。尽管文本情感挖掘(TEM)领域近年来有所发展,但以往的情感分类研究主要集中在简单分类器对Ekman(6种情感)或Plutchik(8种情感)情感模型的使用上。在本研究中,利用Parrott的情感层次,构建了三个tweet的情感标记数据集,分别对应于情感类别的三个层次(一级、二级和三级)。然后,我们提出了一种基于集成分类器的多类文本情感分类方法。利用朴素贝叶斯、多类支持向量机、逻辑回归和sgd4种不同的分类器,在bagging、boosting和voting三种算法下,构建了一个集基分类器优点于一体的有前途的模型。在三个抓取的带标签注释的英文tweets数据集上进行的实验研究显示了令人满意的结果,并表明所提出的基于集成分类器的方法提高了基学习器的性能。此外,投票被证明是最合适的,表现优于套袋和提振组合。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信