{"title":"Sentiment Analysis on Large-Scale Covid-19 Tweets using Hybrid Convolutional LSTM Based on Naïve Bayes Sentiment Modeling","authors":"A. Chakraborty, Dipankar Das, A. Kolya","doi":"10.37936/ecti-cit.2023173.252549","DOIUrl":null,"url":null,"abstract":"Millions of lives were affected rapidly throughout the world when the Covid-19 outbreak spread by leaps and bounds. During this catastrophic period, people used to express their condolence as well as emotions through different social networks. In order to analyze the public comments on Twitter, an experimental approach is developed based on popular words regarding this pandemic. In this paper, various NLP-based research works are discussed on sentiment analysis, trend prediction, topic modeling, learning mechanisms, etc. Furthermore, the hybrid deep learning models are developed based on the Naïve Bayes sentiment model to predict the sentiment from the collected huge number of Coronavirus-related tweets. After performing the n-gram analysis, the Covid-19 specific words are extracted based on their popularity. The public sentiment trend has been analyzed using the extracted topics related to Covid-19 and the tweets are classified according to their sentiment scores. The distinguished sentiment ratings are assigned to the collected tweets based on their sentiment class. Then Convo-Sequential and Convo-Bidirectional long-short term networks are trained using fine-grained sentiment-rated tweets to categorize Covid-19 tweets into five different sentiment classes. Finally, our proposed Convo-Sequential and Convo-Bidirectional LSTM models achieved 84.52% and 85.03% of validation accuracy respectively for the first phase dataset whereas using the second phase dataset the models obtained the validation accuracy of 86.58% and 87.22% respectively.","PeriodicalId":38808,"journal":{"name":"Transactions on Electrical Engineering, Electronics, and Communications","volume":"169 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2023-08-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Transactions on Electrical Engineering, Electronics, and Communications","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.37936/ecti-cit.2023173.252549","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"Engineering","Score":null,"Total":0}
引用次数: 0
Abstract
Millions of lives were affected rapidly throughout the world when the Covid-19 outbreak spread by leaps and bounds. During this catastrophic period, people used to express their condolence as well as emotions through different social networks. In order to analyze the public comments on Twitter, an experimental approach is developed based on popular words regarding this pandemic. In this paper, various NLP-based research works are discussed on sentiment analysis, trend prediction, topic modeling, learning mechanisms, etc. Furthermore, the hybrid deep learning models are developed based on the Naïve Bayes sentiment model to predict the sentiment from the collected huge number of Coronavirus-related tweets. After performing the n-gram analysis, the Covid-19 specific words are extracted based on their popularity. The public sentiment trend has been analyzed using the extracted topics related to Covid-19 and the tweets are classified according to their sentiment scores. The distinguished sentiment ratings are assigned to the collected tweets based on their sentiment class. Then Convo-Sequential and Convo-Bidirectional long-short term networks are trained using fine-grained sentiment-rated tweets to categorize Covid-19 tweets into five different sentiment classes. Finally, our proposed Convo-Sequential and Convo-Bidirectional LSTM models achieved 84.52% and 85.03% of validation accuracy respectively for the first phase dataset whereas using the second phase dataset the models obtained the validation accuracy of 86.58% and 87.22% respectively.