Aspect oriented Sentiment classification of COVID-19 twitter data; an enhanced LDA based text analytic approach

Junaid Abdul Wahid, S. Hussain, Hailing Wang, Zhaoyang Wu, Lei Shi, Yufei Gao
{"title":"Aspect oriented Sentiment classification of COVID-19 twitter data; an enhanced LDA based text analytic approach","authors":"Junaid Abdul Wahid, S. Hussain, Hailing Wang, Zhaoyang Wu, Lei Shi, Yufei Gao","doi":"10.1109/ICCEAI52939.2021.00054","DOIUrl":null,"url":null,"abstract":"Social media has become one of the most important sources of information dissemination during crisis and pandemics. The unknown nature of these disasters makes it hard to analyze the comprehensive situational awareness through different aspects and sentiments to support authorities. Current aspect detection and sentiment analysis system largely relies on labelled data and also categorize the aspects manually. So, in this research, we proposed a hybrid text analytical framework to do aspect level public sentiments analysis. Our approach consists of three layers, first we extracted and clustered the aspects from the data by utilizing the widely used Latent dirichlet allocation (LDA) topic modelling, then we extracted the sentiments and label the dataset by using the linguistic inquiry and word count (LIWC) lexicon, then in third layer of our framework we mapped the aspects into sentiments and sentiments are then classified with well-known machine learning classifiers. Experiments with real dataset gives us promising results as compared to existing aspect oriented sentiment analysis approaches and our method with different variant of classifiers outperforms existing methods with highest F1 scores of 91 %.","PeriodicalId":331409,"journal":{"name":"2021 International Conference on Computer Engineering and Artificial Intelligence (ICCEAI)","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2021-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 International Conference on Computer Engineering and Artificial Intelligence (ICCEAI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCEAI52939.2021.00054","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3

Abstract

Social media has become one of the most important sources of information dissemination during crisis and pandemics. The unknown nature of these disasters makes it hard to analyze the comprehensive situational awareness through different aspects and sentiments to support authorities. Current aspect detection and sentiment analysis system largely relies on labelled data and also categorize the aspects manually. So, in this research, we proposed a hybrid text analytical framework to do aspect level public sentiments analysis. Our approach consists of three layers, first we extracted and clustered the aspects from the data by utilizing the widely used Latent dirichlet allocation (LDA) topic modelling, then we extracted the sentiments and label the dataset by using the linguistic inquiry and word count (LIWC) lexicon, then in third layer of our framework we mapped the aspects into sentiments and sentiments are then classified with well-known machine learning classifiers. Experiments with real dataset gives us promising results as compared to existing aspect oriented sentiment analysis approaches and our method with different variant of classifiers outperforms existing methods with highest F1 scores of 91 %.
面向方面的COVID-19 twitter数据情感分类一种基于LDA的增强文本分析方法
社交媒体已成为危机和大流行病期间最重要的信息传播来源之一。由于这些灾害的未知性质,很难通过不同的方面和情绪来分析综合的态势感知,以支持当局。目前的方面检测和情感分析系统在很大程度上依赖于标记数据,并且还需要人工对方面进行分类。因此,在本研究中,我们提出了一个混合文本分析框架来进行面向层面的民情分析。我们的方法由三层组成,首先我们利用广泛使用的Latent dirichlet allocation (LDA)主题建模从数据中提取和聚类方面,然后我们使用语言查询和单词计数(LIWC)词典提取情感并标记数据集,然后在我们的框架的第三层我们将方面映射到情感中,然后使用知名的机器学习分类器对情感进行分类。与现有的面向方面的情感分析方法相比,真实数据集的实验给了我们很有希望的结果,我们的方法使用不同的分类器变体,以91%的最高F1分数优于现有的方法。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信