Aspect-Based Sentiment Analysis in Beauty Product Reviews Using TF-IDF and SVM Algorithm

Nadira Putri Arthamevia, Adiwijaya, Mahendra Dwifebri Purbolaksono
{"title":"Aspect-Based Sentiment Analysis in Beauty Product Reviews Using TF-IDF and SVM Algorithm","authors":"Nadira Putri Arthamevia, Adiwijaya, Mahendra Dwifebri Purbolaksono","doi":"10.1109/ICoICT52021.2021.9527489","DOIUrl":null,"url":null,"abstract":"Product reviews are essential in e-commerce as they can help potential buyers make decisions prior to making purchases and help sellers get the measure of their products. A product can have thousands of reviews, making it burdensome for potential buyers and sellers to draw a conclusion from those abundant reviews. This research built a system that applies Aspect-based Sentiment Analysis (ABSA) with a dataset from product reviews on the Female Daily website. The system was built using TF-IDF as its feature extraction method combined with word bigram and word bigram. The Support Vector Machine (SVM) algorithm is used to classify the sentiments. This experiment results indicate that the preprocessing stage, especially the stemming and stopwords removal process are greatly affects the accuracy results. The choice of word N-gram is also crucial, where this research shows that the word unigram gives a higher accuracy than the word bigram. The final results of this research show that TF-IDF combined with word unigram and SVM with a linear kernel brings out the best accuracy, that is to say, 88.35%.","PeriodicalId":191671,"journal":{"name":"2021 9th International Conference on Information and Communication Technology (ICoICT)","volume":"16 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-08-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 9th International Conference on Information and Communication Technology (ICoICT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICoICT52021.2021.9527489","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3

Abstract

Product reviews are essential in e-commerce as they can help potential buyers make decisions prior to making purchases and help sellers get the measure of their products. A product can have thousands of reviews, making it burdensome for potential buyers and sellers to draw a conclusion from those abundant reviews. This research built a system that applies Aspect-based Sentiment Analysis (ABSA) with a dataset from product reviews on the Female Daily website. The system was built using TF-IDF as its feature extraction method combined with word bigram and word bigram. The Support Vector Machine (SVM) algorithm is used to classify the sentiments. This experiment results indicate that the preprocessing stage, especially the stemming and stopwords removal process are greatly affects the accuracy results. The choice of word N-gram is also crucial, where this research shows that the word unigram gives a higher accuracy than the word bigram. The final results of this research show that TF-IDF combined with word unigram and SVM with a linear kernel brings out the best accuracy, that is to say, 88.35%.
基于TF-IDF和SVM算法的美容产品评论情感分析
产品评论在电子商务中是必不可少的,因为它们可以帮助潜在买家在购买之前做出决定,并帮助卖家获得产品的衡量标准。一个产品可能有成千上万的评论,这使得潜在的买家和卖家很难从这些大量的评论中得出结论。这项研究建立了一个应用基于方面的情感分析(ABSA)的系统,该系统使用的数据集来自《女性日报》网站上的产品评论。采用TF-IDF作为特征提取方法,结合词重图和词重图构建了该系统。使用支持向量机算法对情感进行分类。实验结果表明,预处理阶段,特别是词干提取和停词去除过程对结果的准确性有很大影响。单词N-gram的选择也很重要,这项研究表明,单词unigram比单词bigram具有更高的准确性。本研究的最终结果表明,TF-IDF结合词元图和带线性核的SVM得到的准确率最好,为88.35%。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信