Fake Review Detection using Naive Bayesian Classifier

2023 International Conference on Sustainable Computing and Smart Systems (ICSCSS) Pub Date : 2023-06-14 DOI:10.1109/ICSCSS57650.2023.10169838

P. Kalaivani, V.Dinesh Raj, R. Madhavan, A. P. Naveen Kumar

{"title":"Fake Review Detection using Naive Bayesian Classifier","authors":"P. Kalaivani, V.Dinesh Raj, R. Madhavan, A. P. Naveen Kumar","doi":"10.1109/ICSCSS57650.2023.10169838","DOIUrl":null,"url":null,"abstract":"The issue of fake reviews is becoming an increasingly prevalent one on the internet. The purpose of these reviews is to deliberately deceive potential customers and influence their purchasing decisions. Businesses and customers alike are looking for ways to spot and filter out these fake reviews as a result. The Naive Bayes algorithm is one effective method for identifying fake reviews. A well-known machine learning algorithm for classification tasks is Naive Bayes. It is based on the probability theorem of Bayes, which enables us to determine the probability of an event with some evidence. The Naive Bayes algorithm can be trained on a dataset of reviews that are known to be real or fake in the context of fake reviews. The characteristics of genuine and fake reviews are then learned with the algorithm by utilizing this training data. After the algorithm has been trained, it can use the characteristics of new reviews to determine whether they are genuine or fake. The fact that Naive Bayes is a relatively straightforward algorithm that can be trained quickly and easily is one advantage of using it to detect fake reviews. Additionally, it works well with text data, which is the format used by the majority of reviews. Having said that, it’s critical to keep in mind that Naive Bayes isn’t perfect and may not be able to spot all fake reviews. Cleaning and normalizing data, dealing with missing data, and dealing with outliers are all potential obstacles in data pre-processing.","PeriodicalId":217957,"journal":{"name":"2023 International Conference on Sustainable Computing and Smart Systems (ICSCSS)","volume":"11 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-06-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 International Conference on Sustainable Computing and Smart Systems (ICSCSS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICSCSS57650.2023.10169838","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

The issue of fake reviews is becoming an increasingly prevalent one on the internet. The purpose of these reviews is to deliberately deceive potential customers and influence their purchasing decisions. Businesses and customers alike are looking for ways to spot and filter out these fake reviews as a result. The Naive Bayes algorithm is one effective method for identifying fake reviews. A well-known machine learning algorithm for classification tasks is Naive Bayes. It is based on the probability theorem of Bayes, which enables us to determine the probability of an event with some evidence. The Naive Bayes algorithm can be trained on a dataset of reviews that are known to be real or fake in the context of fake reviews. The characteristics of genuine and fake reviews are then learned with the algorithm by utilizing this training data. After the algorithm has been trained, it can use the characteristics of new reviews to determine whether they are genuine or fake. The fact that Naive Bayes is a relatively straightforward algorithm that can be trained quickly and easily is one advantage of using it to detect fake reviews. Additionally, it works well with text data, which is the format used by the majority of reviews. Having said that, it’s critical to keep in mind that Naive Bayes isn’t perfect and may not be able to spot all fake reviews. Cleaning and normalizing data, dealing with missing data, and dealing with outliers are all potential obstacles in data pre-processing.

查看原文本刊更多论文

基于朴素贝叶斯分类器的虚假评论检测

虚假评论问题在互联网上变得越来越普遍。这些评论的目的是故意欺骗潜在客户，影响他们的购买决定。因此，商家和消费者都在寻找发现和过滤这些虚假评论的方法。朴素贝叶斯算法是识别虚假评论的有效方法之一。朴素贝叶斯是一个著名的分类机器学习算法。它基于贝叶斯概率定理，它使我们能够通过一些证据来确定事件发生的概率。朴素贝叶斯算法可以在虚假评论的背景下，在已知是真实或虚假的评论数据集上进行训练。然后利用这些训练数据，用算法学习真假评论的特征。算法经过训练后，它可以使用新评论的特征来确定它们是真的还是假的。事实上，朴素贝叶斯是一种相对简单的算法，可以快速轻松地训练，这是使用它来检测虚假评论的一个优势。此外，它可以很好地处理文本数据，这是大多数评论使用的格式。话虽如此，但要记住，朴素贝叶斯算法并不完美，可能无法识别所有虚假评论。数据的清理和规范化、缺失数据的处理以及异常值的处理都是数据预处理中的潜在障碍。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2023 International Conference on Sustainable Computing and Smart Systems (ICSCSS)

自引率

0.00%

发文量