A robust hybrid approach with product context-aware learning and explainable AI for sentiment analysis in Amazon user reviews

IF 3.7 4区 管理学 Q2 BUSINESS
Ehtesham Hashmi, Sule Yildirim Yayilgan
{"title":"A robust hybrid approach with product context-aware learning and explainable AI for sentiment analysis in Amazon user reviews","authors":"Ehtesham Hashmi, Sule Yildirim Yayilgan","doi":"10.1007/s10660-024-09896-5","DOIUrl":null,"url":null,"abstract":"<p>In the ever-changing world of business, gaining valuable insights from customer perspectives is crucial. Consumer evaluations are crucial performance indicators for businesses seeking to enhance their impact. Cyberspace is expanding with an increasing volume of reviews, making it challenging to extract relevant information for desired products. This research explores sentiment analysis for Amazon product reviews in the domain of communication technology, utilizing four publicly available datasets. Sentiment analysis is frequently employed to support E-Commerce platforms in monitoring customer feedback on their products and striving to understand customer needs and preferences. Acknowledging that solely relying on user reviews is insufficient to achieve the best performance, we enhance our approach by incorporating additional context from product titles and headlines for a more comprehensive understanding of the learning algorithm. This paper utilizes three distinct embedding methods, including TF-IDF, Word2Vec, and FastText. FastText outperformed other embeddings when stacked with XGBoost and CatBoost, resulting in the FastXCatStack model. This model achieved accuracy scores of 0.93, 0.93, and 0.94 on mobile electronics, major appliances, and personal care appliances datasets respectively, and linear SVM showed an accuracy score of 0.91 on software reviews when combined with FastText. This research study also provides a comprehensive analysis of deep learning-based models, including approaches like LSTM, GRU, and convolutional neural networks as well as transformer-based models such as BERT, RoBERTa, and XLNET. In the concluding phase, interpretability modeling was applied using Local Interpretable Model-Agnostic Explanations and Latent Dirichlet Allocation to gain deeper insights into the model’s decision-making process.</p>","PeriodicalId":47264,"journal":{"name":"Electronic Commerce Research","volume":"54 1","pages":""},"PeriodicalIF":3.7000,"publicationDate":"2024-08-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Electronic Commerce Research","FirstCategoryId":"91","ListUrlMain":"https://doi.org/10.1007/s10660-024-09896-5","RegionNum":4,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"BUSINESS","Score":null,"Total":0}
引用次数: 0

Abstract

In the ever-changing world of business, gaining valuable insights from customer perspectives is crucial. Consumer evaluations are crucial performance indicators for businesses seeking to enhance their impact. Cyberspace is expanding with an increasing volume of reviews, making it challenging to extract relevant information for desired products. This research explores sentiment analysis for Amazon product reviews in the domain of communication technology, utilizing four publicly available datasets. Sentiment analysis is frequently employed to support E-Commerce platforms in monitoring customer feedback on their products and striving to understand customer needs and preferences. Acknowledging that solely relying on user reviews is insufficient to achieve the best performance, we enhance our approach by incorporating additional context from product titles and headlines for a more comprehensive understanding of the learning algorithm. This paper utilizes three distinct embedding methods, including TF-IDF, Word2Vec, and FastText. FastText outperformed other embeddings when stacked with XGBoost and CatBoost, resulting in the FastXCatStack model. This model achieved accuracy scores of 0.93, 0.93, and 0.94 on mobile electronics, major appliances, and personal care appliances datasets respectively, and linear SVM showed an accuracy score of 0.91 on software reviews when combined with FastText. This research study also provides a comprehensive analysis of deep learning-based models, including approaches like LSTM, GRU, and convolutional neural networks as well as transformer-based models such as BERT, RoBERTa, and XLNET. In the concluding phase, interpretability modeling was applied using Local Interpretable Model-Agnostic Explanations and Latent Dirichlet Allocation to gain deeper insights into the model’s decision-making process.

Abstract Image

采用产品情境感知学习和可解释人工智能的稳健混合方法,对亚马逊用户评论进行情感分析
在瞬息万变的商业世界中,从客户角度获得有价值的见解至关重要。消费者的评价是企业提升影响力的关键绩效指标。网络空间正在不断扩大,评论数量日益增多,这给提取所需产品的相关信息带来了挑战。本研究利用四个公开可用的数据集,探讨了通信技术领域亚马逊产品评论的情感分析。情感分析经常被用于支持电子商务平台监测客户对其产品的反馈,并努力了解客户的需求和偏好。我们认识到,仅仅依靠用户评论不足以实现最佳性能,因此我们通过从产品标题和标题中获取更多上下文来增强我们的方法,从而更全面地了解学习算法。本文采用了三种不同的嵌入方法,包括 TF-IDF、Word2Vec 和 FastText。当 FastText 与 XGBoost 和 CatBoost 叠加时,FastXCatStack 模型的表现优于其他嵌入方法。该模型在移动电子产品、主要电器和个人护理电器数据集上的准确率分别达到了 0.93、0.93 和 0.94,而线性 SVM 与 FastText 结合后,在软件评论上的准确率达到了 0.91。本研究还全面分析了基于深度学习的模型,包括 LSTM、GRU 和卷积神经网络等方法,以及 BERT、RoBERTa 和 XLNET 等基于转换器的模型。在最后阶段,利用本地可解释模型诊断解释和潜在德里希勒分配(Latent Dirichlet Allocation)应用了可解释性建模,以深入了解模型的决策过程。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
7.50
自引率
12.80%
发文量
99
期刊介绍: The Internet and the World Wide Web have brought a fundamental change in the way that individuals access data, information and services. Individuals have access to vast amounts of data, to experts and services that are not limited in time or space. This has forced business to change the way in which they conduct their commercial transactions with their end customers and with other businesses, resulting in the development of a global market through the Internet. The emergence of the Internet and electronic commerce raises many new research issues. The Electronic Commerce Research journal will serve as a forum for stimulating and disseminating research into all facets of electronic commerce - from research into core enabling technologies to work on assessing and understanding the implications of these technologies on societies, economies, businesses and individuals. The journal concentrates on theoretical as well as empirical research that leads to better understanding of electronic commerce and its implications. Topics covered by the journal include, but are not restricted to the following subjects as they relate to the Internet and electronic commerce: Dissemination of services through the Internet;Intelligent agents technologies and their impact;The global impact of electronic commerce;The economics of electronic commerce;Fraud reduction on the Internet;Mobile electronic commerce;Virtual electronic commerce systems;Application of computer and communication technologies to electronic commerce;Electronic market mechanisms and their impact;Auctioning over the Internet;Business models of Internet based companies;Service creation and provisioning;The job market created by the Internet and electronic commerce;Security, privacy, authorization and authentication of users and transactions on the Internet;Electronic data interc hange over the Internet;Electronic payment systems and electronic funds transfer;The impact of electronic commerce on organizational structures and processes;Supply chain management through the Internet;Marketing on the Internet;User adaptive advertisement;Standards in electronic commerce and their analysis;Metrics, measurement and prediction of user activity;On-line stock markets and financial trading;User devices for accessing the Internet and conducting electronic transactions;Efficient search techniques and engines on the WWW;Web based languages (e.g., HTML, XML, VRML, Java);Multimedia storage and distribution;Internet;Collaborative learning, gaming and work;Presentation page design techniques and tools;Virtual reality on the net and 3D visualization;Browsers and user interfaces;Web site management techniques and tools;Managing middleware to support electronic commerce;Web based education, and training;Electronic journals and publishing on the Internet;Legal issues, taxation and property rights;Modeling and design of networks to support Internet applications;Modeling, design and sizing of web site servers;Reliability of intensive on-line applications;Pervasive devices and pervasive computing in electronic commerce;Workflow for electronic commerce applications;Coordination technologies for electronic commerce;Personalization and mass customization technologies;Marketing and customer relationship management in electronic commerce;Service creation and provisioning. Audience: Academics and professionals involved in electronic commerce research and the application and use of the Internet. Managers, consultants, decision-makers and developers who value the use of electronic com merce research results. Special Issues: Electronic Commerce Research publishes from time to time a special issue of the devoted to a single subject area. If interested in serving as a guest editor for a special issue, please contact the Editor-in-Chief J. Christopher Westland at westland@uic.edu with a proposal for the special issue. Officially cited as: Electron Commer Res
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信