Md. Iqbal Hossain, Maqsudur Rahman, T. Ahmed, A. Z. M. T. Islam
{"title":"Forecast the Rating of Online Products from Customer Text Review based on Machine Learning Algorithms","authors":"Md. Iqbal Hossain, Maqsudur Rahman, T. Ahmed, A. Z. M. T. Islam","doi":"10.1109/ICICT4SD50815.2021.9396822","DOIUrl":null,"url":null,"abstract":"An online product's rating is an essential metric to understand the acceptability of that product to users. Shoppers use the rating to measure the quality and excellence of the online product. It helps an online shopper to decide to buy a product or not. It also helps the producer to further modification of that product during reproduction. Sometimes people buy a product online and give a text review of buying products but apathy towards giving a number rating, commonly a star rating. But producers need to know the rating of products for analysis of their business. Producers can use this rating for business analysis and can drive better revenue to their business. We have used some supervised machine learning approach to predict rating based on customer text review and compared the results between Random Forest Classifier, XGBoost and Logistic Regression algorithm with TF-IDF Vectorizer from extensive and series of experiments. We applied the algorithms mentioned above on the dataset named “GrammarandProductReviews” provided by Datafiniti. We analyzed the performance of each algorithm through the accuracy, precision, recall and f1-score. From the study, it is observed that Random Forest algorithm gained accuracy of 94% and precision, recall and f1-scores are 0.94, 0.94, and 0.94 respectively, which showed best compared to others.","PeriodicalId":239251,"journal":{"name":"2021 International Conference on Information and Communication Technology for Sustainable Development (ICICT4SD)","volume":"6 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-02-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 International Conference on Information and Communication Technology for Sustainable Development (ICICT4SD)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICICT4SD50815.2021.9396822","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2
Abstract
An online product's rating is an essential metric to understand the acceptability of that product to users. Shoppers use the rating to measure the quality and excellence of the online product. It helps an online shopper to decide to buy a product or not. It also helps the producer to further modification of that product during reproduction. Sometimes people buy a product online and give a text review of buying products but apathy towards giving a number rating, commonly a star rating. But producers need to know the rating of products for analysis of their business. Producers can use this rating for business analysis and can drive better revenue to their business. We have used some supervised machine learning approach to predict rating based on customer text review and compared the results between Random Forest Classifier, XGBoost and Logistic Regression algorithm with TF-IDF Vectorizer from extensive and series of experiments. We applied the algorithms mentioned above on the dataset named “GrammarandProductReviews” provided by Datafiniti. We analyzed the performance of each algorithm through the accuracy, precision, recall and f1-score. From the study, it is observed that Random Forest algorithm gained accuracy of 94% and precision, recall and f1-scores are 0.94, 0.94, and 0.94 respectively, which showed best compared to others.