Sentiment Analysis using a CNN-BiLSTM Deep Model Based on Attention Classification

Information-An International Interdisciplinary Journal Pub Date : 2023-09-15 DOI:10.47880/inf2603-02

Wang Yue, Li Lei

{"title":"Sentiment Analysis using a CNN-BiLSTM Deep Model Based on Attention Classification","authors":"Wang Yue, Li Lei","doi":"10.47880/inf2603-02","DOIUrl":null,"url":null,"abstract":"With the rapid development of the Internet, the number of social media and e-commerce platforms increased dramatically. Users from all over world share their comments and sentiments on the Internet become a new tradition. Applying natural language processing technology to analyze the text on the Internet for mining the emotional tendencies has become the main way in the social public opinion monitoring and the after-sale feedback of manufactory. Thus, the study on text sentiment analysis has shown important social significance and commercial value. Sentiment analysis is a hot research topic in the field of natural language processing and data mining in recent ten years. The paper starts with the topic of \"Sentiment Analysis using a CNN-BiLSTM deep model based on attention mechanism classification\". First, it conducts an in-depth investigation on the current research status and commonly used algorithms at home and abroad, and briefly introduces and analyzes the current mainstream sentiment analysis methods. As a direction of machine learning, deep learning has become a hot research topic in emotion classification in the field of natural language processing. This paper uses deep learning models to study the sentiment classification problem of short and long text sentiment classification tasks. The main research contents are as follows. Firstly, Traditional neural network based short text classification algorithms for sentiment classification is easy to find the errors. The feature dimension is too high, and the feature information of the pool layer is lost, which leads to the loss of the details of the emotion vocabulary. To solve this problem, the Word Vector Model (Word2vec), Bidirectional Long-term and Short-term Memory networks (BiLSTM) and convolutional neural network (CNN) are combined in Quora dataset. The experiment shows that the accuracy of CNN-BiLSTM model associated with Word2vec word embedding achieved 91.48%. This proves that the hybrid network model performs better than the single structure neural network in short text. Convolutional neural network (CNN) models use convolutional layers and maximum pooling or max-overtime pooling layers to extract higher-level features, while LSTM models can capture long- term dependencies between words hence are better used for text classification. However, even with the hybrid approach that leverages the powers of these two deep-learning models, the number of features to remember for classification remains huge, hence hindering the training process. Secondly, we propose an attention based CNN-BiLSTM hybrid model that capitalize on the advantages of LSTM and CNN with an additional attention mechanism in IMDB movie reviews dataset. In the experiment, under the control of single variable of Data volume and Epoch, the proposed hybrid model was compared with the results of various indicators including recall, precision, F1 score and accuracy of CNN, LSTM and CNN-LSTM in long text. When the data size was 13 k, the proposed model had the highest accuracy at 0.908, and the F1 score also showed the highest performance at 0.883. When the epoch value for obtaining the optimal accuracy of each model was 10 for CNN, 14 for LSTM, 5 for MLP and 15 epochs for CNN-LSTM, which took the longest learning time. The F1 score also showed the best performance of the proposed model at 0.906, and accuracy of the proposed model was the highest at 0.929. Finally, the experimental results show that the bidirectional long- and short-term memory convolutional neural network (BiLSTM-CNN) model based on attention mechanism can effectively improve the performance of sentiment classification of data sets when processing long-text sentiment classification tasks. Keywords: sentiment analysis, CNN, BiLSTM, attention mechanism, text classification","PeriodicalId":50362,"journal":{"name":"Information-An International Interdisciplinary Journal","volume":"28 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-09-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Information-An International Interdisciplinary Journal","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.47880/inf2603-02","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

With the rapid development of the Internet, the number of social media and e-commerce platforms increased dramatically. Users from all over world share their comments and sentiments on the Internet become a new tradition. Applying natural language processing technology to analyze the text on the Internet for mining the emotional tendencies has become the main way in the social public opinion monitoring and the after-sale feedback of manufactory. Thus, the study on text sentiment analysis has shown important social significance and commercial value. Sentiment analysis is a hot research topic in the field of natural language processing and data mining in recent ten years. The paper starts with the topic of "Sentiment Analysis using a CNN-BiLSTM deep model based on attention mechanism classification". First, it conducts an in-depth investigation on the current research status and commonly used algorithms at home and abroad, and briefly introduces and analyzes the current mainstream sentiment analysis methods. As a direction of machine learning, deep learning has become a hot research topic in emotion classification in the field of natural language processing. This paper uses deep learning models to study the sentiment classification problem of short and long text sentiment classification tasks. The main research contents are as follows. Firstly, Traditional neural network based short text classification algorithms for sentiment classification is easy to find the errors. The feature dimension is too high, and the feature information of the pool layer is lost, which leads to the loss of the details of the emotion vocabulary. To solve this problem, the Word Vector Model (Word2vec), Bidirectional Long-term and Short-term Memory networks (BiLSTM) and convolutional neural network (CNN) are combined in Quora dataset. The experiment shows that the accuracy of CNN-BiLSTM model associated with Word2vec word embedding achieved 91.48%. This proves that the hybrid network model performs better than the single structure neural network in short text. Convolutional neural network (CNN) models use convolutional layers and maximum pooling or max-overtime pooling layers to extract higher-level features, while LSTM models can capture long- term dependencies between words hence are better used for text classification. However, even with the hybrid approach that leverages the powers of these two deep-learning models, the number of features to remember for classification remains huge, hence hindering the training process. Secondly, we propose an attention based CNN-BiLSTM hybrid model that capitalize on the advantages of LSTM and CNN with an additional attention mechanism in IMDB movie reviews dataset. In the experiment, under the control of single variable of Data volume and Epoch, the proposed hybrid model was compared with the results of various indicators including recall, precision, F1 score and accuracy of CNN, LSTM and CNN-LSTM in long text. When the data size was 13 k, the proposed model had the highest accuracy at 0.908, and the F1 score also showed the highest performance at 0.883. When the epoch value for obtaining the optimal accuracy of each model was 10 for CNN, 14 for LSTM, 5 for MLP and 15 epochs for CNN-LSTM, which took the longest learning time. The F1 score also showed the best performance of the proposed model at 0.906, and accuracy of the proposed model was the highest at 0.929. Finally, the experimental results show that the bidirectional long- and short-term memory convolutional neural network (BiLSTM-CNN) model based on attention mechanism can effectively improve the performance of sentiment classification of data sets when processing long-text sentiment classification tasks. Keywords: sentiment analysis, CNN, BiLSTM, attention mechanism, text classification

查看原文本刊更多论文

基于注意力分类的CNN-BiLSTM深度模型情感分析

随着互联网的快速发展，社交媒体和电子商务平台的数量急剧增加。来自世界各地的用户在互联网上分享他们的评论和情绪成为一种新的传统。应用自然语言处理技术对网络文本进行分析，挖掘情感倾向，已成为社会舆情监测和厂商售后反馈的主要方式。因此，对文本情感分析的研究具有重要的社会意义和商业价值。情感分析是近十年来自然语言处理和数据挖掘领域的一个研究热点。本文以“基于注意机制分类的CNN-BiLSTM深度模型的情感分析”为主题。首先，对目前国内外的研究现状和常用算法进行了深入的调研，并对目前主流的情感分析方法进行了简要的介绍和分析。深度学习作为机器学习的一个方向，已经成为自然语言处理领域情感分类的研究热点。本文利用深度学习模型研究了短文本和长文本情感分类任务的情感分类问题。主要研究内容如下:首先，传统的基于神经网络的短文本情感分类算法容易发现错误。特征维数过高，丢失了池层的特征信息，导致情感词汇的细节丢失。为了解决这个问题，在Quora数据集中结合了词向量模型(Word2vec)、双向长短期记忆网络(BiLSTM)和卷积神经网络(CNN)。实验表明，与Word2vec词嵌入相关联的CNN-BiLSTM模型准确率达到91.48%。这证明了混合网络模型在短文本中的性能优于单一结构的神经网络。卷积神经网络(CNN)模型使用卷积层和最大池化或最大超时池化层来提取更高级的特征，而LSTM模型可以捕获词之间的长期依赖关系，因此更适合用于文本分类。然而，即使利用这两种深度学习模型的混合方法，需要记住的分类特征数量仍然很大，因此阻碍了训练过程。其次，我们提出了一种基于注意力的CNN- bilstm混合模型，该模型利用了LSTM和CNN的优势，并在IMDB电影评论数据集中增加了额外的注意力机制。在实验中，在Data volume和Epoch这一单一变量的控制下，将所提出的混合模型与CNN、LSTM和CNN-LSTM在长文本中的查全率、查准率、F1分数和准确率等多个指标的结果进行比较。当数据量为13 k时，该模型的准确率最高，为0.908,F1得分也最高，为0.883。当每个模型获得最优精度的epoch值为CNN为10,LSTM为14,MLP为5,CNN-LSTM为15 epoch时，其学习时间最长。F1得分为0.906时，模型表现最佳，准确率最高，为0.929。最后，实验结果表明，基于注意机制的双向长短期记忆卷积神经网络(BiLSTM-CNN)模型在处理长文情感分类任务时，能够有效提高数据集的情感分类性能。关键词:情感分析，CNN, BiLSTM，注意机制，文本分类

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Information-An International Interdisciplinary Journal 工程技术-工程：综合

自引率

0.00%

发文量

审稿时长

8.3 months