SENTIMENT ANALYSIS TECHNOLOGY FOR USER FEEDBACK SUPPORT IN E-COMMERCE SYSTEMS BASED ON MACHINE LEARNING

IF 0.2 Q4 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE
S. Tchynetskyi, B. Polishchuk, V. Vysotska
{"title":"SENTIMENT ANALYSIS TECHNOLOGY FOR USER FEEDBACK SUPPORT IN E-COMMERCE SYSTEMS BASED ON MACHINE LEARNING","authors":"S. Tchynetskyi, B. Polishchuk, V. Vysotska","doi":"10.15588/1607-3274-2023-3-11","DOIUrl":null,"url":null,"abstract":"Context. The interaction between a company and its target audience has been studied for centuries. From the very beginning of commercial relations, the relationship between the service provider and the recipient has been valued almost above all else. Trade is built on trust and respect. The image of an entrepreneur is often more important than the product he sells. For hundreds of years, the relationship between the merchant and the buyer, the entrepreneur and the client has not lost its importance, and in the era of mass digitalization, the quality of the relationship between the company and the target audience of different sizes and professional feedback support with clients often start the success of e-business. To provide these additional tools and information technologies to help businessmen monitor e-business development opportunities in a specific location, as well as establish feedback with users through social networks and mass media. Obtaining such tools will significantly expand the vision of market opportunities for ebusiness, it will clarify which of them make sense to invest in, and which ones are not worth paying time for. Also see what idea has the future and what business model needs to be implemented/maintained/developed for the rapid development of territorial/interregional e-business. It will also help to understand which levers have the greatest effect for business changes: what not to touch, and what policies to change to ensure high speed in the implementation of the plan based on the analysis of relevant research results, for example, to receive: direct feedback from customers, the dynamics of changes in overall satisfaction or interest of the target audience and advantages/disadvantages from users using NLP analysis; support for the development of e-business in relation to the location of their enterprise and the best directions; – graphs of business development (improvement/deterioration) depending on the content of comments.
 Objective of the study is to develop information technology to support the development of e-business by analyzing business locations, processing feedback from users, analyzing and classifying customer feedback in real time from social networks: Twitter, Reddit, Facebook and others using deep learning and Natural methods. Language Processing of Ukrainian-speaking and Englishspeaking texts.
 Method. NLP-methods were used to analyze the opinions of users and customers. Among the methods of implementing the main functions of English-language news classification, the following machine learning methods are used: naive Bayesian classifier, logistic regression, and the method of support vectors. The Naive Bayes algorithm was used to classify Ukrainian-language user feedback, as it performs well on small amounts of data, is easy to train and operate, and works well with text data. Naive Bayes classifier is a very good option for our system and considering that the number of responses in the dataset is smaller compared to the averages.
 Results. A machine learning model was developed for the analysis and classification of Ukrainian- and English-language reviews from users of e-commerce systems.
 Conclusions. The created model shows excellent classification results on test data. The overall accuracy of the sentimental model for the analysis of Ukrainian-language content is quite satisfactory, 92.3%. The logistic regression method coped best with the task of analyzing the impact of English-language news on the financial market, which showed an accuracy of 75.67%. This is certainly not the desired result, but it is the largest indicator of all considered. The support vector method (SVM) coped somewhat worse with the task, which showed an accuracy of 72.78%, which is a slightly worse result than the one obtained thanks to the logistic regression method. And the naïve Bayesian classifier method did the worst with the task, which achieved an accuracy of 71.13%, which is less than the two previous methods.","PeriodicalId":43783,"journal":{"name":"Radio Electronics Computer Science Control","volume":"32 1","pages":"0"},"PeriodicalIF":0.2000,"publicationDate":"2023-10-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Radio Electronics Computer Science Control","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.15588/1607-3274-2023-3-11","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"COMPUTER SCIENCE, HARDWARE & ARCHITECTURE","Score":null,"Total":0}
引用次数: 0

Abstract

Context. The interaction between a company and its target audience has been studied for centuries. From the very beginning of commercial relations, the relationship between the service provider and the recipient has been valued almost above all else. Trade is built on trust and respect. The image of an entrepreneur is often more important than the product he sells. For hundreds of years, the relationship between the merchant and the buyer, the entrepreneur and the client has not lost its importance, and in the era of mass digitalization, the quality of the relationship between the company and the target audience of different sizes and professional feedback support with clients often start the success of e-business. To provide these additional tools and information technologies to help businessmen monitor e-business development opportunities in a specific location, as well as establish feedback with users through social networks and mass media. Obtaining such tools will significantly expand the vision of market opportunities for ebusiness, it will clarify which of them make sense to invest in, and which ones are not worth paying time for. Also see what idea has the future and what business model needs to be implemented/maintained/developed for the rapid development of territorial/interregional e-business. It will also help to understand which levers have the greatest effect for business changes: what not to touch, and what policies to change to ensure high speed in the implementation of the plan based on the analysis of relevant research results, for example, to receive: direct feedback from customers, the dynamics of changes in overall satisfaction or interest of the target audience and advantages/disadvantages from users using NLP analysis; support for the development of e-business in relation to the location of their enterprise and the best directions; – graphs of business development (improvement/deterioration) depending on the content of comments. Objective of the study is to develop information technology to support the development of e-business by analyzing business locations, processing feedback from users, analyzing and classifying customer feedback in real time from social networks: Twitter, Reddit, Facebook and others using deep learning and Natural methods. Language Processing of Ukrainian-speaking and Englishspeaking texts. Method. NLP-methods were used to analyze the opinions of users and customers. Among the methods of implementing the main functions of English-language news classification, the following machine learning methods are used: naive Bayesian classifier, logistic regression, and the method of support vectors. The Naive Bayes algorithm was used to classify Ukrainian-language user feedback, as it performs well on small amounts of data, is easy to train and operate, and works well with text data. Naive Bayes classifier is a very good option for our system and considering that the number of responses in the dataset is smaller compared to the averages. Results. A machine learning model was developed for the analysis and classification of Ukrainian- and English-language reviews from users of e-commerce systems. Conclusions. The created model shows excellent classification results on test data. The overall accuracy of the sentimental model for the analysis of Ukrainian-language content is quite satisfactory, 92.3%. The logistic regression method coped best with the task of analyzing the impact of English-language news on the financial market, which showed an accuracy of 75.67%. This is certainly not the desired result, but it is the largest indicator of all considered. The support vector method (SVM) coped somewhat worse with the task, which showed an accuracy of 72.78%, which is a slightly worse result than the one obtained thanks to the logistic regression method. And the naïve Bayesian classifier method did the worst with the task, which achieved an accuracy of 71.13%, which is less than the two previous methods.
基于机器学习的电子商务系统用户反馈支持情感分析技术
上下文。几个世纪以来,人们一直在研究公司与其目标受众之间的互动。从商业关系的一开始,服务提供者和接受者之间的关系就被重视得几乎高于一切。贸易建立在信任和尊重的基础上。企业家的形象往往比他销售的产品更重要。几百年来,商家与买家、企业家与客户之间的关系并没有失去其重要性,而在大规模数字化时代,企业与不同规模的目标受众之间的关系质量以及与客户的专业反馈支持往往是电子商务成功的开始。提供这些额外的工具和资讯科技,协助商界人士在特定地点留意电子商务的发展机会,并透过社交网络和大众传媒与用户建立反馈。获得这样的工具将大大扩大电子商务市场机会的视野,它将澄清哪些是有意义的投资,哪些是不值得花时间。同时,您还可以了解到,为了地域/跨区域电子商务的快速发展,需要实施/维护/发展什么样的商业模式。这也将有助于了解哪些杠杆对业务变化的影响最大:什么是不触及的,以及根据相关研究结果的分析,改变哪些政策以确保计划的高速实施,例如,接收:客户的直接反馈,目标受众总体满意度或兴趣的变化动态以及使用NLP分析的用户的优势/劣势;就企业所在地及最佳方向,支援电子商务的发展;-根据评论内容的业务发展(改善/恶化)图表。 本研究的目的是利用深度学习和自然方法,分析商业地点、处理用户反馈、分析和分类来自Twitter、Reddit、Facebook等社交网络的实时客户反馈,从而发展信息技术,以支持电子商务的发展。乌克兰语和英语文本的语言处理。 方法。使用nlp方法分析用户和顾客的意见。在实现英语新闻分类的主要功能的方法中,使用了以下机器学习方法:朴素贝叶斯分类器、逻辑回归和支持向量法。使用朴素贝叶斯算法对乌克兰语用户反馈进行分类,因为它在少量数据上表现良好,易于训练和操作,并且对文本数据也很好。朴素贝叶斯分类器对我们的系统来说是一个非常好的选择,考虑到数据集中的响应数量比平均值要少。 结果。开发了一个机器学习模型,用于对电子商务系统用户的乌克兰语和英语评论进行分析和分类。 结论。所建立的模型在测试数据上显示出良好的分类效果。分析乌克兰语内容的情感模型的总体准确性相当令人满意,为92.3%。逻辑回归方法最适合分析英语新闻对金融市场的影响,准确率为75.67%。这当然不是期望的结果,但它是所有考虑的最大指标。支持向量法(SVM)的准确率为72.78%,略低于逻辑回归法。而naïve贝叶斯分类器方法在该任务中表现最差,准确率为71.13%,低于前两种方法。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Radio Electronics Computer Science Control
Radio Electronics Computer Science Control COMPUTER SCIENCE, HARDWARE & ARCHITECTURE-
自引率
20.00%
发文量
66
审稿时长
12 weeks
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信