Aspect-Based Sentiment Analysis for Afaan Oromoo Movie Reviews Using Machine Learning Techniques

IF 2.4 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE
Obsa Gelchu Horsa, K. K. Tune
{"title":"Aspect-Based Sentiment Analysis for Afaan Oromoo Movie Reviews Using Machine Learning Techniques","authors":"Obsa Gelchu Horsa, K. K. Tune","doi":"10.1155/2023/3462691","DOIUrl":null,"url":null,"abstract":"Aspect-based sentiment analysis (ABSA) is the subfield of natural language processing that deals with essentially splitting data into aspects and finally extracting the sentiment polarity as positive, negative, or neutral. ABSA has been widely investigated and developed for many resource-rich languages such as English and French. However, little work has been done on indigenous African languages like Afaan Oromoo both at the document and sentence levels. In this paper, ABSA for Afaan Oromoo movie reviews was investigated and developed. To achieve the proposed objective, 2800 Afaan Oromoo movie reviews were collected from YouTube using YouTube Data API. Following the data preprocessing, predetermined aspects of the Afaan Oromoo movie were extracted and labeled into positive or negative aspects by domain experts. For implementation, different machine learning algorithms including random forest, logistic regression, SVM, and multinomial naïve Bayes in combination with BoW and TF-IDF were applied. To test and measure the proposed system, accuracy, precision, recall, and f1-score were used. In the case of random forest, the accuracy obtained in combination with both BoW and TF-IDF was 88%. Using the SVM, the accuracy generated with BoW and TF-IDF was 88% and 87%, respectively. Applying logistic regression, the accuracy generated with both BoW and TF-IDF was 87%. Using multinomial naïve Bayes, the accuracy generated in combination with both BoW and TF-IDF was 88%. To improve the optimal performance evaluation parameters, different hyperparameter tuning settings were applied. The implementation result shows that the optimal values of models’ performance evaluation parameters were generated using different hyperparameter tuning settings.","PeriodicalId":44894,"journal":{"name":"Applied Computational Intelligence and Soft Computing","volume":"22 4","pages":""},"PeriodicalIF":2.4000,"publicationDate":"2023-12-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Applied Computational Intelligence and Soft Computing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1155/2023/3462691","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0

Abstract

Aspect-based sentiment analysis (ABSA) is the subfield of natural language processing that deals with essentially splitting data into aspects and finally extracting the sentiment polarity as positive, negative, or neutral. ABSA has been widely investigated and developed for many resource-rich languages such as English and French. However, little work has been done on indigenous African languages like Afaan Oromoo both at the document and sentence levels. In this paper, ABSA for Afaan Oromoo movie reviews was investigated and developed. To achieve the proposed objective, 2800 Afaan Oromoo movie reviews were collected from YouTube using YouTube Data API. Following the data preprocessing, predetermined aspects of the Afaan Oromoo movie were extracted and labeled into positive or negative aspects by domain experts. For implementation, different machine learning algorithms including random forest, logistic regression, SVM, and multinomial naïve Bayes in combination with BoW and TF-IDF were applied. To test and measure the proposed system, accuracy, precision, recall, and f1-score were used. In the case of random forest, the accuracy obtained in combination with both BoW and TF-IDF was 88%. Using the SVM, the accuracy generated with BoW and TF-IDF was 88% and 87%, respectively. Applying logistic regression, the accuracy generated with both BoW and TF-IDF was 87%. Using multinomial naïve Bayes, the accuracy generated in combination with both BoW and TF-IDF was 88%. To improve the optimal performance evaluation parameters, different hyperparameter tuning settings were applied. The implementation result shows that the optimal values of models’ performance evaluation parameters were generated using different hyperparameter tuning settings.
利用机器学习技术对阿凡-奥罗莫语电影评论进行基于方面的情感分析
基于方面的情感分析(ABSA)是自然语言处理的一个子领域,它主要处理将数据分割成方面,并最终提取出积极、消极或中性的情感极性。针对英语和法语等资源丰富的语言,ABSA已经得到了广泛的研究和开发。然而,在文件和句子层面上,对Afaan Oromoo等非洲土著语言的研究却很少。本文对Afaan Oromoo电影评论的ABSA进行了研究和开发。为了实现所提出的目标,使用YouTube Data API从YouTube上收集了2800条Afaan Oromoo电影评论。在数据预处理之后,由领域专家提取Afaan Oromoo电影的预定方面并标记为积极或消极方面。为了实现,我们使用了不同的机器学习算法,包括随机森林、逻辑回归、SVM和多项naïve Bayes,并结合BoW和TF-IDF。为了测试和测量所提出的系统,准确度,精密度,召回率和f1-score被使用。在随机森林的情况下,结合BoW和TF-IDF获得的准确率为88%。使用SVM, BoW和TF-IDF生成的准确率分别为88%和87%。应用逻辑回归,BoW和TF-IDF产生的准确率均为87%。使用多项naïve Bayes,结合BoW和TF-IDF生成的准确率为88%。为了提高最优的性能评价参数,采用了不同的超参数调优设置。实现结果表明,使用不同的超参数调优设置,可以生成模型性能评价参数的最优值。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Applied Computational Intelligence and Soft Computing
Applied Computational Intelligence and Soft Computing COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE-
CiteScore
6.10
自引率
3.40%
发文量
59
审稿时长
21 weeks
期刊介绍: Applied Computational Intelligence and Soft Computing will focus on the disciplines of computer science, engineering, and mathematics. The scope of the journal includes developing applications related to all aspects of natural and social sciences by employing the technologies of computational intelligence and soft computing. The new applications of using computational intelligence and soft computing are still in development. Although computational intelligence and soft computing are established fields, the new applications of using computational intelligence and soft computing can be regarded as an emerging field, which is the focus of this journal.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信