Deceptive review detection using GAN enhanced by GPT structure and score of reviews

2023 28th International Computer Conference, Computer Society of Iran (CSICC) Pub Date : 2023-01-25 DOI:10.1109/CSICC58665.2023.10105368

Maryam Tamimi, Mostafa Salehi, S. Najari

{"title":"Deceptive review detection using GAN enhanced by GPT structure and score of reviews","authors":"Maryam Tamimi, Mostafa Salehi, S. Najari","doi":"10.1109/CSICC58665.2023.10105368","DOIUrl":null,"url":null,"abstract":"These days, in online E-commerce platforms, the most of users utilize the purchase experience of other users reported in the reviews to make a right decision. As much as the positive effect of this phenomena, there would be a motivation to produce deceptive reviews toward some malicious goals. Therefore, the detection and deletion of these deceptive reviews from online platforms will be an essential task to keep them safe. Different approaches from Machine Learning (ML)-based methods to the recent Neural Networks (NN)-based ones have been proposed to detect deceptive reviews. Here, the lack of sufficient labeled data is an essential barrier, to obviate that Generative Adversarial Networks (GAN) have been utilized to generate some data with a distribution close to original ones. In this regard and along with the successful results of Generative Pre-trained Transformers (GPT) in textual tasks, it have also been used besides GAN framework to detect deceptive reviews. Despite a lot of efforts on this matter, there is no efficient study for considering metadata or behavioral features besides powerful generative models. In this paper, we have proposed a new approach called Score_Gpt2ganto consider the scores of reviews as a regularization concept besides the GPT-based GAN approach. Evaluation results in comparison between different methods have shown an increase in the accuracy of 1.4% on the TripAdvisor dataset and 3.8% on the YelpZip dataset by our proposed method.","PeriodicalId":127277,"journal":{"name":"2023 28th International Computer Conference, Computer Society of Iran (CSICC)","volume":"26 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-01-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 28th International Computer Conference, Computer Society of Iran (CSICC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CSICC58665.2023.10105368","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

These days, in online E-commerce platforms, the most of users utilize the purchase experience of other users reported in the reviews to make a right decision. As much as the positive effect of this phenomena, there would be a motivation to produce deceptive reviews toward some malicious goals. Therefore, the detection and deletion of these deceptive reviews from online platforms will be an essential task to keep them safe. Different approaches from Machine Learning (ML)-based methods to the recent Neural Networks (NN)-based ones have been proposed to detect deceptive reviews. Here, the lack of sufficient labeled data is an essential barrier, to obviate that Generative Adversarial Networks (GAN) have been utilized to generate some data with a distribution close to original ones. In this regard and along with the successful results of Generative Pre-trained Transformers (GPT) in textual tasks, it have also been used besides GAN framework to detect deceptive reviews. Despite a lot of efforts on this matter, there is no efficient study for considering metadata or behavioral features besides powerful generative models. In this paper, we have proposed a new approach called Score_Gpt2ganto consider the scores of reviews as a regularization concept besides the GPT-based GAN approach. Evaluation results in comparison between different methods have shown an increase in the accuracy of 1.4% on the TripAdvisor dataset and 3.8% on the YelpZip dataset by our proposed method.

查看原文本刊更多论文

基于GPT结构和评论分数增强的GAN欺骗性评论检测

如今，在在线电子商务平台上，大多数用户利用评论中其他用户的购买体验来做出正确的决定。尽管这种现象有积极的影响，但也会有动机为某些恶意目标制作欺骗性评论。因此，从网络平台上发现和删除这些虚假评论将是确保其安全的重要任务。从基于机器学习(ML)的方法到最近基于神经网络(NN)的方法，已经提出了不同的方法来检测欺骗性评论。在这里，缺乏足够的标记数据是一个重要的障碍，以避免生成对抗网络(GAN)被用来生成一些具有接近原始分布的数据。在这方面，随着生成预训练变形器(GPT)在文本任务中的成功结果，除了GAN框架外，它还被用于检测欺骗性评论。尽管在这个问题上做了很多努力，但除了强大的生成模型之外，还没有有效的研究来考虑元数据或行为特征。在本文中，我们提出了一种名为score_gpt2gan的新方法，将评论的分数作为基于gpt的GAN方法之外的正则化概念来考虑。对比不同方法的评估结果表明，我们提出的方法在TripAdvisor数据集上的准确率提高了1.4%，在YelpZip数据集上的准确率提高了3.8%。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2023 28th International Computer Conference, Computer Society of Iran (CSICC)

自引率

0.00%

发文量