ML-SPD: Machine Learning based Sentiment Polarity Detection

Jelena Graovac, M. Radović, Berna Altinel Girgin
{"title":"ML-SPD: Machine Learning based Sentiment Polarity Detection","authors":"Jelena Graovac, M. Radović, Berna Altinel Girgin","doi":"10.1109/INISTA49547.2020.9194633","DOIUrl":null,"url":null,"abstract":"Internet revolution creates very important trends in people's life like news-portals, online-education, home-offices, online shopping, social media, etc. Without any controversy, social media is one of the most important outcomes of the Web. Today, social media is more than a communication channel where people have the opportunity to express their feelings, write their comments on microblogging sites, discussion groups, review sites, etc. These common habits have resulted in two important consequences: 1) Accumulation of very huge data on online platforms, 2) The requirement of automatic systems to classify these accumulated big data by subjective and sentimentally. In many cases, Sentiment Polarity Detection (SPD) in text may be an urgent requirement, rather than identifying the subject of the text. For instance, positively or negatively labeled product reviews may give sufficient summary information to readers about the review. In this study, to solve SPD problem we explore different text representation models in conjunction with state-of-the-art traditional Machine Learning techniques: Support Vector Machines (SVM), Neural Networks (NN), Nave Bayes (NB), and combination of NB and SVM classifier (NBSVM). We perform experiments on three publicly available benchmark movie review datasets in different languages: CornellPD in English, HUMIR in Turkish and SerbSPD-2C in Serbian. Experimental results confirm that the presented techniques achieve improvements over the previously published techniques applied to movie reviews datasets in Turkish and English. Developed software package “ML-SPD” is made publicly available to the research community so it can serve as a good baseline for future research.","PeriodicalId":124632,"journal":{"name":"2020 International Conference on INnovations in Intelligent SysTems and Applications (INISTA)","volume":"11 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 International Conference on INnovations in Intelligent SysTems and Applications (INISTA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/INISTA49547.2020.9194633","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Internet revolution creates very important trends in people's life like news-portals, online-education, home-offices, online shopping, social media, etc. Without any controversy, social media is one of the most important outcomes of the Web. Today, social media is more than a communication channel where people have the opportunity to express their feelings, write their comments on microblogging sites, discussion groups, review sites, etc. These common habits have resulted in two important consequences: 1) Accumulation of very huge data on online platforms, 2) The requirement of automatic systems to classify these accumulated big data by subjective and sentimentally. In many cases, Sentiment Polarity Detection (SPD) in text may be an urgent requirement, rather than identifying the subject of the text. For instance, positively or negatively labeled product reviews may give sufficient summary information to readers about the review. In this study, to solve SPD problem we explore different text representation models in conjunction with state-of-the-art traditional Machine Learning techniques: Support Vector Machines (SVM), Neural Networks (NN), Nave Bayes (NB), and combination of NB and SVM classifier (NBSVM). We perform experiments on three publicly available benchmark movie review datasets in different languages: CornellPD in English, HUMIR in Turkish and SerbSPD-2C in Serbian. Experimental results confirm that the presented techniques achieve improvements over the previously published techniques applied to movie reviews datasets in Turkish and English. Developed software package “ML-SPD” is made publicly available to the research community so it can serve as a good baseline for future research.
ML-SPD:基于机器学习的情感极性检测
互联网革命在人们的生活中创造了非常重要的趋势,如新闻门户、在线教育、家庭办公、网上购物、社交媒体等。毫无疑问,社交媒体是网络最重要的成果之一。今天,社交媒体不仅仅是一个沟通渠道,人们有机会表达自己的感受,在微博网站、讨论组、评论网站等上发表评论。这些共同的习惯导致了两个重要的后果:1)在网络平台上积累了非常庞大的数据,2)要求自动系统对这些积累的大数据进行主观和感性的分类。在许多情况下,文本中的情感极性检测(SPD)可能是一个迫切的需求,而不是识别文本的主题。例如,正面或负面标签的产品评论可能会给读者提供足够的总结信息。在本研究中,为了解决SPD问题,我们探索了不同的文本表示模型,并结合了最先进的传统机器学习技术:支持向量机(SVM)、神经网络(NN)、朴素贝叶斯(NB)以及NB和支持向量机分类器的组合(NBSVM)。我们在三个公开可用的不同语言的基准电影评论数据集上进行实验:英语的CornellPD,土耳其语的HUMIR和塞尔维亚语的SerbSPD-2C。实验结果证实,与先前发表的应用于土耳其语和英语电影评论数据集的技术相比,本文提出的技术取得了改进。开发的软件包“ML-SPD”公开提供给研究社区,因此它可以作为未来研究的良好基线。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信