Sentiment Analysis on Bengali Movie Reviews using Multinomial Naïve Bayes

Rifat Rahman, Md. Abdul Masud, Raonak Jahan Mimi, Mst. Nusrat Sultana Dina
{"title":"Sentiment Analysis on Bengali Movie Reviews using Multinomial Naïve Bayes","authors":"Rifat Rahman, Md. Abdul Masud, Raonak Jahan Mimi, Mst. Nusrat Sultana Dina","doi":"10.1109/ICCIT54785.2021.9689787","DOIUrl":null,"url":null,"abstract":"Opinion mining of consumers has become one of the undeniable approaches for finding gaps in businesses’ marketing strategies. For today’s gently growing film industry of Bangladesh, sentiment mining of viewers feedback on their specified work has become inevitably a dire need for the production company and also for the audiences to take decision about watching a film. With the availability of such a great extent of e-content on film reviews, it becomes handy to analyze the viewers’ sentiment on any film. Because of the lack of structured research work on Bengali movie review sentiment analysis, we are interested to focus this kind of research work. We collect 3000 Bengali movie reviews and extract TF-IDF features by using uni-gram, bi-gram, and tri-gram models. We use several machine learning classifiers for performing classification solutions on extracted TF-IDF features of the corpus. Experimental results show that the Multinomial Naïve Bayes classifier provides the highest accuracy, 86%, on uni-gram features of the validation data.","PeriodicalId":166450,"journal":{"name":"2021 24th International Conference on Computer and Information Technology (ICCIT)","volume":"28 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-12-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 24th International Conference on Computer and Information Technology (ICCIT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCIT54785.2021.9689787","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3

Abstract

Opinion mining of consumers has become one of the undeniable approaches for finding gaps in businesses’ marketing strategies. For today’s gently growing film industry of Bangladesh, sentiment mining of viewers feedback on their specified work has become inevitably a dire need for the production company and also for the audiences to take decision about watching a film. With the availability of such a great extent of e-content on film reviews, it becomes handy to analyze the viewers’ sentiment on any film. Because of the lack of structured research work on Bengali movie review sentiment analysis, we are interested to focus this kind of research work. We collect 3000 Bengali movie reviews and extract TF-IDF features by using uni-gram, bi-gram, and tri-gram models. We use several machine learning classifiers for performing classification solutions on extracted TF-IDF features of the corpus. Experimental results show that the Multinomial Naïve Bayes classifier provides the highest accuracy, 86%, on uni-gram features of the validation data.
基于多项式Naïve贝叶斯的孟加拉语影评情感分析
消费者意见挖掘已成为企业营销策略中发现差距的不可否认的方法之一。对于今天缓慢发展的孟加拉国电影工业来说,挖掘观众对他们特定作品的反馈已经不可避免地成为制作公司的迫切需要,也成为观众决定是否观看电影的迫切需要。随着影评电子内容的普及,分析观众对任何一部电影的看法都变得很方便。由于孟加拉语影评情感分析缺乏结构化的研究工作,我们有兴趣关注这类研究工作。我们收集了3000篇孟加拉语电影评论,并使用一元图、双元图和三元图模型提取TF-IDF特征。我们使用几个机器学习分类器对提取的语料库的TF-IDF特征执行分类解决方案。实验结果表明,多项式Naïve贝叶斯分类器对验证数据的单图特征的准确率最高,达到86%。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信