{"title":"Handling Imbalance in Fraudulent Reviewer Detection based on Expectation Maximization and KL Divergence","authors":"Wen Zhang, Guan-Shi Qin, Qiang Wang","doi":"10.1145/3498851.3498989","DOIUrl":null,"url":null,"abstract":"Online review fraud and review manipulation hurt the profits of stakeholders and undermine the value of online reviews. For this reason, it is critical to detect online review fraud and fraudulent reviewers effectively for the development of e-commerce. Extent studies propose various fraud detection techniques to detect fraudulent reviewers. However, most of these studies do not handle the data imbalance problem in fraudulent reviewer detection. To fill this research gap, this paper proposes a novel approach to detect fraudulent reviewers in handling the data imbalance based on Expectation Maximization (EM) and Kullback–Leibler (KL) divergence (called EMKL). We first use the expectation maximization algorithm to model the latent topic distributions of reviewers on the review features. Then, we adopt the Kullback–Leibler divergence to measure the similarities of reviewers based on their topic distributions to detect fraudulent reviewers. The experiment on Yelp dataset shows that the EMKL approach has a good performance in detecting fraudulent reviewers. In addition, the proposed EMKL method performs better than the performance of state-of-the-art techniques.","PeriodicalId":89230,"journal":{"name":"Proceedings. IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology","volume":"5 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2021-12-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings. IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3498851.3498989","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
Online review fraud and review manipulation hurt the profits of stakeholders and undermine the value of online reviews. For this reason, it is critical to detect online review fraud and fraudulent reviewers effectively for the development of e-commerce. Extent studies propose various fraud detection techniques to detect fraudulent reviewers. However, most of these studies do not handle the data imbalance problem in fraudulent reviewer detection. To fill this research gap, this paper proposes a novel approach to detect fraudulent reviewers in handling the data imbalance based on Expectation Maximization (EM) and Kullback–Leibler (KL) divergence (called EMKL). We first use the expectation maximization algorithm to model the latent topic distributions of reviewers on the review features. Then, we adopt the Kullback–Leibler divergence to measure the similarities of reviewers based on their topic distributions to detect fraudulent reviewers. The experiment on Yelp dataset shows that the EMKL approach has a good performance in detecting fraudulent reviewers. In addition, the proposed EMKL method performs better than the performance of state-of-the-art techniques.