Md. Iqbal Hossain, Maqsudur Rahman, M. T. Ahmed, Md. Saifur Rahman, A. Z. M. T. Islam
{"title":"基于机器学习算法的孟加拉语产品评论评级预测","authors":"Md. Iqbal Hossain, Maqsudur Rahman, M. T. Ahmed, Md. Saifur Rahman, A. Z. M. T. Islam","doi":"10.1109/AIMS52415.2021.9466022","DOIUrl":null,"url":null,"abstract":"The only way to provide feedback about a product is through reviews. When a new shopper proceeds to an online shop to purchase a product but does not have adequate time to study the reviews provided by other shoppers to get an opinion about the product, the shopper determines whether to buy the product or not on the number rating. Through reviews, shoppers can acquaint everyone about the product's quality and the manufacturer can enhance their products and business by interpreting that review. However, manufacturers demand a number review more than a text review for business analysis. This paper represents a machine learning based model for predicting the number rating from written text for Bangla product review. This study performs on a dataset collected manually from Daraz.com.bd, a Bangladeshi leading e-commerce shop. We have implemented Support Vector Machine (SVM), Random Forest, XGBoost, and Logistic Regression with Term Frequency-Inverse Document Frequency (TF-IDF) Vectorizer on our collected dataset and record all the performance metrics like accuracy, precision, recall and f1-score. From these above four algorithms, SVM showed more outstanding results than others in terms of performance metrics. SVM achieved 90% accuracy on the applied dataset. The other SVM performance metrics are 0.90, 0.92, and 0.91 for precision, recall and f1-score respectively.","PeriodicalId":299121,"journal":{"name":"2021 International Conference on Artificial Intelligence and Mechatronics Systems (AIMS)","volume":"473 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-04-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Rating Prediction of Product Reviews of Bangla Language using Machine Learning Algorithms\",\"authors\":\"Md. Iqbal Hossain, Maqsudur Rahman, M. T. Ahmed, Md. Saifur Rahman, A. Z. M. T. Islam\",\"doi\":\"10.1109/AIMS52415.2021.9466022\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The only way to provide feedback about a product is through reviews. When a new shopper proceeds to an online shop to purchase a product but does not have adequate time to study the reviews provided by other shoppers to get an opinion about the product, the shopper determines whether to buy the product or not on the number rating. Through reviews, shoppers can acquaint everyone about the product's quality and the manufacturer can enhance their products and business by interpreting that review. However, manufacturers demand a number review more than a text review for business analysis. This paper represents a machine learning based model for predicting the number rating from written text for Bangla product review. This study performs on a dataset collected manually from Daraz.com.bd, a Bangladeshi leading e-commerce shop. We have implemented Support Vector Machine (SVM), Random Forest, XGBoost, and Logistic Regression with Term Frequency-Inverse Document Frequency (TF-IDF) Vectorizer on our collected dataset and record all the performance metrics like accuracy, precision, recall and f1-score. From these above four algorithms, SVM showed more outstanding results than others in terms of performance metrics. SVM achieved 90% accuracy on the applied dataset. The other SVM performance metrics are 0.90, 0.92, and 0.91 for precision, recall and f1-score respectively.\",\"PeriodicalId\":299121,\"journal\":{\"name\":\"2021 International Conference on Artificial Intelligence and Mechatronics Systems (AIMS)\",\"volume\":\"473 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-04-28\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 International Conference on Artificial Intelligence and Mechatronics Systems (AIMS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/AIMS52415.2021.9466022\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 International Conference on Artificial Intelligence and Mechatronics Systems (AIMS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/AIMS52415.2021.9466022","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2
摘要
提供产品反馈的唯一途径就是通过评论。当一个新购物者进入网上商店购买产品,但没有足够的时间研究其他购物者提供的评论以获得对产品的意见时,购物者根据数字评级决定是否购买该产品。通过评论,购物者可以让每个人都了解产品的质量,制造商可以通过解释评论来提高他们的产品和业务。然而,对于业务分析,制造商更需要数字审查而不是文本审查。本文提出了一种基于机器学习的模型,用于从孟加拉产品评论的书面文本中预测数字评级。这项研究的数据集是从Daraz.com.bd手工收集的,这是一家孟加拉国领先的电子商务商店。我们在收集的数据集上实现了支持向量机(SVM)、随机森林(Random Forest)、XGBoost和具有Term Frequency- inverse Document Frequency (TF-IDF)矢量器的逻辑回归,并记录了所有的性能指标,如准确性、精密度、召回率和f1-score。在以上四种算法中,SVM在性能指标方面表现出比其他算法更突出的结果。SVM在应用数据集上的准确率达到90%。其他支持向量机性能指标分别为0.90,0.92和0.91的精度,召回率和f1-score。
Rating Prediction of Product Reviews of Bangla Language using Machine Learning Algorithms
The only way to provide feedback about a product is through reviews. When a new shopper proceeds to an online shop to purchase a product but does not have adequate time to study the reviews provided by other shoppers to get an opinion about the product, the shopper determines whether to buy the product or not on the number rating. Through reviews, shoppers can acquaint everyone about the product's quality and the manufacturer can enhance their products and business by interpreting that review. However, manufacturers demand a number review more than a text review for business analysis. This paper represents a machine learning based model for predicting the number rating from written text for Bangla product review. This study performs on a dataset collected manually from Daraz.com.bd, a Bangladeshi leading e-commerce shop. We have implemented Support Vector Machine (SVM), Random Forest, XGBoost, and Logistic Regression with Term Frequency-Inverse Document Frequency (TF-IDF) Vectorizer on our collected dataset and record all the performance metrics like accuracy, precision, recall and f1-score. From these above four algorithms, SVM showed more outstanding results than others in terms of performance metrics. SVM achieved 90% accuracy on the applied dataset. The other SVM performance metrics are 0.90, 0.92, and 0.91 for precision, recall and f1-score respectively.