Semi-Supervised Fake Reviews Detection based on AspamGAN

Day 2 Tue, March 22, 2022 Pub Date : 2022-03-30 DOI:10.36548/jaicn.2022.1.002

Chen Jing-Yu, Wang Ya-jun

{"title":"Semi-Supervised Fake Reviews Detection based on AspamGAN","authors":"Chen Jing-Yu, Wang Ya-jun","doi":"10.36548/jaicn.2022.1.002","DOIUrl":null,"url":null,"abstract":"With the popularization of social software and e-business in recent years, more and more consumers like to share their consumption experiences on social networks and refer to other consumers' reviews and opinions when making consumption decisions. Online reviews have become an essential part of browsing on websites such as shopping, and people's reliance on informative reviews have contributed to the rise of fake reviews. The traditional classification method is affected by the label dataset, which is not only time-consuming, laborious, and subjective, but also the extraction of artificial features also affects the classification accuracy. Due to the relative length of the online text, the possibility of the classifier losing important information increases, this weakens the model’s detection capability. To solve this aforementioned problem, a semi-supervised Generative Adversarial Network (AspamGAN) fake reviews detection method incorporating an attention mechanism is proposed. Using labeled and unlabeled data to correctly learn input distributions, the features required for classification are automatically discovered using deep neural networks, providing better prediction accuracy for online reviews. The approach includes attention mechanisms in the classifier to obtain an adequate semantic representation and relies on a limited dataset of labeled data to detect false reviews, and is applied on the TripAdvisor dataset. Experimental results show that the proposed algorithm outperforms state-of-the-art semi-supervised fake review detection techniques when the label dataset is limited.","PeriodicalId":10940,"journal":{"name":"Day 2 Tue, March 22, 2022","volume":"1 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2022-03-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"9","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Day 2 Tue, March 22, 2022","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.36548/jaicn.2022.1.002","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 9

Abstract

With the popularization of social software and e-business in recent years, more and more consumers like to share their consumption experiences on social networks and refer to other consumers' reviews and opinions when making consumption decisions. Online reviews have become an essential part of browsing on websites such as shopping, and people's reliance on informative reviews have contributed to the rise of fake reviews. The traditional classification method is affected by the label dataset, which is not only time-consuming, laborious, and subjective, but also the extraction of artificial features also affects the classification accuracy. Due to the relative length of the online text, the possibility of the classifier losing important information increases, this weakens the model’s detection capability. To solve this aforementioned problem, a semi-supervised Generative Adversarial Network (AspamGAN) fake reviews detection method incorporating an attention mechanism is proposed. Using labeled and unlabeled data to correctly learn input distributions, the features required for classification are automatically discovered using deep neural networks, providing better prediction accuracy for online reviews. The approach includes attention mechanisms in the classifier to obtain an adequate semantic representation and relies on a limited dataset of labeled data to detect false reviews, and is applied on the TripAdvisor dataset. Experimental results show that the proposed algorithm outperforms state-of-the-art semi-supervised fake review detection techniques when the label dataset is limited.

查看原文本刊更多论文

基于AspamGAN的半监督虚假评论检测

近年来，随着社交软件和电子商务的普及，越来越多的消费者喜欢在社交网络上分享自己的消费经历，并在做出消费决策时参考其他消费者的评论和意见。在线评论已经成为购物等网站浏览的重要组成部分，人们对信息丰富的评论的依赖导致了虚假评论的增多。传统的分类方法受标签数据的影响，不仅耗时、费力、主观，而且人工特征的提取也会影响分类的准确性。由于在线文本的相对长度，分类器丢失重要信息的可能性增加，这削弱了模型的检测能力。为了解决上述问题，提出了一种结合注意机制的半监督生成式对抗网络(AspamGAN)虚假评论检测方法。使用标记和未标记的数据正确学习输入分布，使用深度神经网络自动发现分类所需的特征，为在线评论提供更好的预测精度。该方法包括分类器中的注意机制，以获得足够的语义表示，并依赖于有限的标记数据集来检测虚假评论，并应用于TripAdvisor数据集。实验结果表明，在标签数据有限的情况下，该算法优于目前最先进的半监督虚假评论检测技术。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Day 2 Tue, March 22, 2022

自引率

0.00%

发文量