Hybrid Feature Vector Space based Ensemble Machine Learning Approach for Sentiment Analysis on Amazon Product Reviews

Md. Nazmul Islam, Mahmudul Hasan
{"title":"Hybrid Feature Vector Space based Ensemble Machine Learning Approach for Sentiment Analysis on Amazon Product Reviews","authors":"Md. Nazmul Islam, Mahmudul Hasan","doi":"10.1109/ICCIT54785.2021.9689876","DOIUrl":null,"url":null,"abstract":"In recent era, people are getting more attracted to micro blogs and social media to share their daily activities and express feelings and opinions. Machine learning based sentiment analysis becomes immensely popular to judge the feelings about a particular content on how positive or negative their feelings and opinions are before taking important decisions. In this paper, we propose an effective and combined machine learning approach with an enhanced hybrid feature vector space of latent concepts and external information features. The latent concepts are prepared by a supervised machine learning approach, and the external information features, estimating the quality of the information shared in the documents, are classified by the unsupervised rule-based learning approach. A Random Forest ensemble method has been utilized to build a classifier model, and some standard performance measures such as accuracy, precision, recall, f1-score and Cohen’s Kappa value have been taken into account to analyze the performance. The novelty of this paper lies in the hybridization of feature vector space of latent concepts and external information features along with the Random Forest ensemble classifier. Based on the analyses, the proposed approach outperforms its counterparts as well as provides better outcomes against other solo latent concept-oriented approaches.","PeriodicalId":166450,"journal":{"name":"2021 24th International Conference on Computer and Information Technology (ICCIT)","volume":"42 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-12-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 24th International Conference on Computer and Information Technology (ICCIT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCIT54785.2021.9689876","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

In recent era, people are getting more attracted to micro blogs and social media to share their daily activities and express feelings and opinions. Machine learning based sentiment analysis becomes immensely popular to judge the feelings about a particular content on how positive or negative their feelings and opinions are before taking important decisions. In this paper, we propose an effective and combined machine learning approach with an enhanced hybrid feature vector space of latent concepts and external information features. The latent concepts are prepared by a supervised machine learning approach, and the external information features, estimating the quality of the information shared in the documents, are classified by the unsupervised rule-based learning approach. A Random Forest ensemble method has been utilized to build a classifier model, and some standard performance measures such as accuracy, precision, recall, f1-score and Cohen’s Kappa value have been taken into account to analyze the performance. The novelty of this paper lies in the hybridization of feature vector space of latent concepts and external information features along with the Random Forest ensemble classifier. Based on the analyses, the proposed approach outperforms its counterparts as well as provides better outcomes against other solo latent concept-oriented approaches.
基于混合特征向量空间的Amazon产品评论情感分析集成机器学习方法
在最近的时代,人们越来越多地被微博和社交媒体所吸引,分享他们的日常活动,表达感受和观点。基于机器学习的情感分析变得非常流行,它可以在做出重要决定之前判断对特定内容的感受,判断他们的感受和观点是积极的还是消极的。在本文中,我们提出了一种有效的组合机器学习方法,该方法具有增强的潜在概念和外部信息特征的混合特征向量空间。潜在概念通过监督机器学习方法准备,外部信息特征,估计文档中共享信息的质量,通过基于无监督规则的学习方法进行分类。采用随机森林集成方法构建分类器模型,并考虑准确率、精密度、召回率、f1-score和Cohen’s Kappa值等标准性能指标进行性能分析。本文的新颖之处在于将潜在概念的特征向量空间与外部信息特征结合随机森林集成分类器进行杂交。基于分析,所提出的方法优于其同行,并且与其他单独潜在概念导向的方法相比提供了更好的结果。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信