Sentiment Analysis using Deep Learning: A Domain Independent Approach

2023 Second International Conference on Electronics and Renewable Systems (ICEARS) Pub Date : 2023-03-02 DOI:10.1109/ICEARS56392.2023.10085676

Mohammad Qamar, Hamnah Rao, Sheikh Afaan Farooq, Ajatray Swagat Bhuyan

{"title":"Sentiment Analysis using Deep Learning: A Domain Independent Approach","authors":"Mohammad Qamar, Hamnah Rao, Sheikh Afaan Farooq, Ajatray Swagat Bhuyan","doi":"10.1109/ICEARS56392.2023.10085676","DOIUrl":null,"url":null,"abstract":"The practice of finding emotion embedded in textual data is known as sentiment analysis, sometimes known as opinion mining. Various sentiment analysis algorithms, including classic Machine Learning models and Deep Learning models, have been suggested up until now. Some Machine Learning-based models, such as Naive Bayes, Decision Tree, SVM, and others, have demonstrated exceptional performance in sentiment categorization. Although Machine Learning algorithms have demonstrated high performance, they are constrained by the quantity of the dataset employed and include feature extraction tasks, which are time demanding. As a result, this study considers Deep Learning (DL)-based models, which include automated feature extraction and can handle massive amounts of data. One of the major issues with existing sentiment analysis models is that they are domain-dependent; hence, if there is a dataset available from a domain on which the model was not trained on, its accuracy is significantly reduced. To make the model domain agnostic, it is trained on datasets from three distinct domains: Twitter US Airline Review dataset, the IMDb Movie Review dataset, and the US Presidential Election dataset. The suggested sentiment analysis model is trained on five different deep learning models: CNN-GRU, CNN-LSTM, CNN, LSTM and GRU. The model's performance was evaluated using test data from three datasets on which the model was trained, as well as a fresh book review dataset scraped from the Amazon website.","PeriodicalId":338611,"journal":{"name":"2023 Second International Conference on Electronics and Renewable Systems (ICEARS)","volume":"7 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-03-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 Second International Conference on Electronics and Renewable Systems (ICEARS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICEARS56392.2023.10085676","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 2

Abstract

The practice of finding emotion embedded in textual data is known as sentiment analysis, sometimes known as opinion mining. Various sentiment analysis algorithms, including classic Machine Learning models and Deep Learning models, have been suggested up until now. Some Machine Learning-based models, such as Naive Bayes, Decision Tree, SVM, and others, have demonstrated exceptional performance in sentiment categorization. Although Machine Learning algorithms have demonstrated high performance, they are constrained by the quantity of the dataset employed and include feature extraction tasks, which are time demanding. As a result, this study considers Deep Learning (DL)-based models, which include automated feature extraction and can handle massive amounts of data. One of the major issues with existing sentiment analysis models is that they are domain-dependent; hence, if there is a dataset available from a domain on which the model was not trained on, its accuracy is significantly reduced. To make the model domain agnostic, it is trained on datasets from three distinct domains: Twitter US Airline Review dataset, the IMDb Movie Review dataset, and the US Presidential Election dataset. The suggested sentiment analysis model is trained on five different deep learning models: CNN-GRU, CNN-LSTM, CNN, LSTM and GRU. The model's performance was evaluated using test data from three datasets on which the model was trained, as well as a fresh book review dataset scraped from the Amazon website.

查看原文本刊更多论文

使用深度学习的情感分析:一种领域独立的方法

在文本数据中寻找情感的做法被称为情感分析，有时也被称为观点挖掘。到目前为止，已经提出了各种情感分析算法，包括经典的机器学习模型和深度学习模型。一些基于机器学习的模型，如朴素贝叶斯、决策树、支持向量机等，在情感分类中表现出优异的性能。尽管机器学习算法已经证明了高性能，但它们受到所使用的数据集数量的限制，并且包括需要时间的特征提取任务。因此，本研究考虑了基于深度学习(DL)的模型，其中包括自动特征提取并可以处理大量数据。现有情感分析模型的主要问题之一是它们依赖于领域;因此，如果有一个可用的数据集，而模型并没有在这个数据集上进行训练，那么它的准确性就会大大降低。为了使模型领域不可知，它在三个不同领域的数据集上进行训练:Twitter美国航空公司评论数据集、IMDb电影评论数据集和美国总统选举数据集。提出的情感分析模型在CNN-GRU、CNN-LSTM、CNN、LSTM和GRU五种不同的深度学习模型上进行训练。该模型的性能使用来自三个数据集的测试数据进行评估，这些数据集是模型训练的基础，以及从亚马逊网站上抓取的新书评数据集。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2023 Second International Conference on Electronics and Renewable Systems (ICEARS)

自引率

0.00%

发文量