Attention based convolutional residual squeeze excited capsule network for aspect based sentiment classification in Malayalam movie reviews

IF 3.4 3区计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Computer Speech and Language Pub Date : 2026-10-01 Epub Date: 2026-02-03 DOI:10.1016/j.csl.2026.101952

Sharika TR , Julia Punithamalar Dhas

{"title":"Attention based convolutional residual squeeze excited capsule network for aspect based sentiment classification in Malayalam movie reviews","authors":"Sharika TR , Julia Punithamalar Dhas","doi":"10.1016/j.csl.2026.101952","DOIUrl":null,"url":null,"abstract":"<div><div>One of the main functions of Natural Language Processing (NLP) is sentiment analysis, which extracts attitudes, ideas, views or judgments about a given topic. The Internet is a vast and unstructured information source full of text documents, including evaluations and opinions. Firstly, the input texts are pre-processed using an efficient NLP method such as tokenization, stemming, removal of empty sets, stop words removal and morphological segmentation. These pre-processed texts serve as the input for the feature extraction stage. Using the three methods of Improved Term Frequency-Inverse Document Frequency (ITF-IDF), Latent Semantic Analysis (LSA) and Extended Bidirectional Encoder Representations from Transformers (E-BERT), the review-based features are extracted. Aspect-based features are extracted from the review text using the Aspect Related Feature (ARF) extraction method. By enhancing term weights with improved frequency scaling, the model improves on regular TF-IDF and includes more subtle contextual meanings and relationships with words. Finally, applying both types of features, a new Attention-based Convolutional Residual Squeeze Excited Capsule Network (A-CR-SECapNet) model is created to classify sentiment polarities as positive, negative and neutral. The Convolutional Residual Module captures spatial relationships to learn deeper networks that mitigate vanishing gradients. The SE Module improves the attentiveness of the network by dynamically reweighting the channel-wise information from features that correlate with important sentiment variables. The CapNet preserves the spatial relationships between words to maintain the dependence of sentiment between features. Finally, the performance of the model is further improved by fine-tuning the parameters using the Modified Gazelle Optimization (MGO) optimization method. In the results section, the proposed model is compared to the existing model in terms of precision, f1-score, accuracy, recall, mean absolute error (MSE) and mean absolute percentage error (MAPE). The proposed model produced the best results, demonstrating its superiority.</div></div>","PeriodicalId":50638,"journal":{"name":"Computer Speech and Language","volume":"100 ","pages":"Article 101952"},"PeriodicalIF":3.4000,"publicationDate":"2026-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computer Speech and Language","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S088523082600015X","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2026/2/3 0:00:00","PubModel":"Epub","JCR":"Q2","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

Abstract

One of the main functions of Natural Language Processing (NLP) is sentiment analysis, which extracts attitudes, ideas, views or judgments about a given topic. The Internet is a vast and unstructured information source full of text documents, including evaluations and opinions. Firstly, the input texts are pre-processed using an efficient NLP method such as tokenization, stemming, removal of empty sets, stop words removal and morphological segmentation. These pre-processed texts serve as the input for the feature extraction stage. Using the three methods of Improved Term Frequency-Inverse Document Frequency (ITF-IDF), Latent Semantic Analysis (LSA) and Extended Bidirectional Encoder Representations from Transformers (E-BERT), the review-based features are extracted. Aspect-based features are extracted from the review text using the Aspect Related Feature (ARF) extraction method. By enhancing term weights with improved frequency scaling, the model improves on regular TF-IDF and includes more subtle contextual meanings and relationships with words. Finally, applying both types of features, a new Attention-based Convolutional Residual Squeeze Excited Capsule Network (A-CR-SECapNet) model is created to classify sentiment polarities as positive, negative and neutral. The Convolutional Residual Module captures spatial relationships to learn deeper networks that mitigate vanishing gradients. The SE Module improves the attentiveness of the network by dynamically reweighting the channel-wise information from features that correlate with important sentiment variables. The CapNet preserves the spatial relationships between words to maintain the dependence of sentiment between features. Finally, the performance of the model is further improved by fine-tuning the parameters using the Modified Gazelle Optimization (MGO) optimization method. In the results section, the proposed model is compared to the existing model in terms of precision, f1-score, accuracy, recall, mean absolute error (MSE) and mean absolute percentage error (MAPE). The proposed model produced the best results, demonstrating its superiority.

查看原文本刊更多论文

基于注意的卷积残差挤激胶囊网络在马来语影评中基于方面的情感分类

自然语言处理（NLP）的主要功能之一是情感分析，即提取对给定主题的态度、想法、观点或判断。互联网是一个巨大的非结构化信息源，充满了文本文档，包括评估和意见。首先，使用高效的自然语言处理方法对输入文本进行预处理，如标记化、词干提取、空集去除、停止词去除和形态分割。这些预处理文本作为特征提取阶段的输入。采用改进词频-逆文档频率（ITF-IDF）、潜在语义分析（LSA）和扩展双向编码器表示（E-BERT）三种方法提取基于评论的特征。使用方面相关特征（Aspect Related Feature， ARF）提取方法从评审文本中提取基于方面的特征。通过改进频率缩放来增强术语权重，该模型改进了常规TF-IDF，并包含了更微妙的上下文含义和与单词的关系。最后，应用这两种类型的特征，创建了一个新的基于注意力的卷积残余挤压兴奋胶囊网络（a - cr - secapnet）模型，将情绪极性分为积极、消极和中性。卷积残差模块捕获空间关系来学习更深层的网络，以减轻消失的梯度。SE模块通过动态地重新加权与重要情绪变量相关的特征的通道信息来提高网络的注意力。CapNet通过保留词间的空间关系来保持特征间情感的依赖性。最后，采用修正瞪羚优化（Modified Gazelle Optimization， MGO）优化方法对模型参数进行微调，进一步提高了模型的性能。在结果部分，将提出的模型与现有模型在精度、f1-score、准确率、召回率、平均绝对误差（MSE）和平均绝对百分比误差（MAPE）方面进行比较。所提出的模型得到了最好的结果，证明了它的优越性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Computer Speech and Language 工程技术-计算机：人工智能

CiteScore

11.30

自引率

4.70%

发文量

审稿时长

22.9 weeks

期刊介绍： Computer Speech & Language publishes reports of original research related to the recognition, understanding, production, coding and mining of speech and language. The speech and language sciences have a long history, but it is only relatively recently that large-scale implementation of and experimentation with complex models of speech and language processing has become feasible. Such research is often carried out somewhat separately by practitioners of artificial intelligence, computer science, electronic engineering, information retrieval, linguistics, phonetics, or psychology.