Ariyo Oluwasanmi, Shokanbi Akeem, Jackson Jehoaida, Muhammad Umar Aftab, N. Hundera, Bulbula Kumeda, Edward Baagere, Zhiguang Qin
{"title":"Sequential Multi-Kernel Convolutional Recurrent Network for Sentiment Classification","authors":"Ariyo Oluwasanmi, Shokanbi Akeem, Jackson Jehoaida, Muhammad Umar Aftab, N. Hundera, Bulbula Kumeda, Edward Baagere, Zhiguang Qin","doi":"10.1109/ICSESS47205.2019.9040746","DOIUrl":null,"url":null,"abstract":"The emergence of deep learning as a commanding technique for learning heterogeneous layers of feature representations have consequently substituted traditional machine learning algorithms which are generally poor in analyzing compound sentences. Additionally, convolutional and recurrent neural networks have auspiciously yielded state-of-the-art results in sentiment classification and Natural Language Processing (NLP). In this paper, a deep sentiment representation model through the combination of multiple Convolutional Neural Networks (CNN) kernels with Long Short-Term Memory (LSTM) is proposed for sentiment classification. Our model gains word vector representation using pre-trained Global Vectors for Word Representation (GloVe) embeddings, thereafter used as input to the CNN layer which extracts higher local text representations. Finally, Bidirectional LSTM (biLSTM) generates sentiment classification of sentence representation based on context dependent features. Our combined approach of CNN and biLSTM was experimented using the Stanford Large Movie Review Dataset (IMDB) and Stanford Sentiment Treebank Dataset (SSTB) for binary classification. The evaluation achieves outstanding results in outperforming several existing approaches with 90.4% accuracy on the Stanford Sentiment Treebank dataset and 94.8% accuracy on the Stanford Large Movie Review dataset. These results are achieved with a drastic reduction of model parameters and without a pooling layer in the CNN architecture, helping to retain local and structural information in comparison to other existing deep neural network frameworks. (Abstract)","PeriodicalId":203944,"journal":{"name":"2019 IEEE 10th International Conference on Software Engineering and Service Science (ICSESS)","volume":"119 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 IEEE 10th International Conference on Software Engineering and Service Science (ICSESS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICSESS47205.2019.9040746","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
The emergence of deep learning as a commanding technique for learning heterogeneous layers of feature representations have consequently substituted traditional machine learning algorithms which are generally poor in analyzing compound sentences. Additionally, convolutional and recurrent neural networks have auspiciously yielded state-of-the-art results in sentiment classification and Natural Language Processing (NLP). In this paper, a deep sentiment representation model through the combination of multiple Convolutional Neural Networks (CNN) kernels with Long Short-Term Memory (LSTM) is proposed for sentiment classification. Our model gains word vector representation using pre-trained Global Vectors for Word Representation (GloVe) embeddings, thereafter used as input to the CNN layer which extracts higher local text representations. Finally, Bidirectional LSTM (biLSTM) generates sentiment classification of sentence representation based on context dependent features. Our combined approach of CNN and biLSTM was experimented using the Stanford Large Movie Review Dataset (IMDB) and Stanford Sentiment Treebank Dataset (SSTB) for binary classification. The evaluation achieves outstanding results in outperforming several existing approaches with 90.4% accuracy on the Stanford Sentiment Treebank dataset and 94.8% accuracy on the Stanford Large Movie Review dataset. These results are achieved with a drastic reduction of model parameters and without a pooling layer in the CNN architecture, helping to retain local and structural information in comparison to other existing deep neural network frameworks. (Abstract)
深度学习作为学习异构层特征表示的主要技术的出现,因此取代了传统的机器学习算法,这些算法通常在分析复合句方面很差。此外,卷积和递归神经网络在情感分类和自然语言处理(NLP)方面取得了可喜的进展。本文提出了一种将多个卷积神经网络(CNN)核与长短期记忆(LSTM)相结合的深度情感表示模型,用于情感分类。我们的模型使用预训练的全局词向量(Global Vectors for word representation, GloVe)嵌入来获得词向量表示,然后作为CNN层的输入,CNN层提取更高的局部文本表示。最后,双向LSTM (biLSTM)基于上下文相关特征生成句子表示的情感分类。我们使用斯坦福大型电影评论数据集(IMDB)和斯坦福情感树库数据集(SSTB)对CNN和biLSTM的组合方法进行了二值分类实验。该评估取得了出色的结果,优于几种现有的方法,在斯坦福情感树库数据集上的准确率为90.4%,在斯坦福大型电影评论数据集上的准确率为94.8%。这些结果是通过大幅减少模型参数实现的,并且在CNN架构中没有池化层,与其他现有的深度神经网络框架相比,有助于保留局部和结构信息。(抽象)