{"title":"A systematic assessment of sentiment analysis models on iraqi dialect-based texts","authors":"Hafedh Hameed Hussein, Amir Lakizadeh","doi":"10.1016/j.sasc.2025.200203","DOIUrl":null,"url":null,"abstract":"<div><div>Social media allows individuals, groups, and companies to openly express their opinions, creating a rich resource for trend assessments through sentiment analysis. Sentiment Analysis (SA) uses natural language processing (NLP) to interpret these opinions from text. However, Arabic sentiment analysis faces challenges due to dialect variations, limited resources, and hidden sentiment words. This study proposes hybrid models combining Convolutional Neural Networks with Long Short-Term Memory called as CNN-LSTM, CNN with Gated Recurrent Unit called as CNN-GRU. and AraBERT, a deep transformer model, to enhance Iraqi sentiment analysis. These models were evaluated against various machine learning and deep learning models. For feature extraction, we utilized Continuous Bag of Words (CBOW) for deep learning models and BERT for the AraBERT model, while TF-IDF was used for machine learning models. According to the experimental results, the AraBERT model has been able to achieve superior performance and significantly improve the accuracy of sentiment analysis in case of Iraqi dialect-based texts.</div></div>","PeriodicalId":101205,"journal":{"name":"Systems and Soft Computing","volume":"7 ","pages":"Article 200203"},"PeriodicalIF":0.0000,"publicationDate":"2025-02-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Systems and Soft Computing","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2772941925000213","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Social media allows individuals, groups, and companies to openly express their opinions, creating a rich resource for trend assessments through sentiment analysis. Sentiment Analysis (SA) uses natural language processing (NLP) to interpret these opinions from text. However, Arabic sentiment analysis faces challenges due to dialect variations, limited resources, and hidden sentiment words. This study proposes hybrid models combining Convolutional Neural Networks with Long Short-Term Memory called as CNN-LSTM, CNN with Gated Recurrent Unit called as CNN-GRU. and AraBERT, a deep transformer model, to enhance Iraqi sentiment analysis. These models were evaluated against various machine learning and deep learning models. For feature extraction, we utilized Continuous Bag of Words (CBOW) for deep learning models and BERT for the AraBERT model, while TF-IDF was used for machine learning models. According to the experimental results, the AraBERT model has been able to achieve superior performance and significantly improve the accuracy of sentiment analysis in case of Iraqi dialect-based texts.