{"title":"Detecting the Internet Water Army via comprehensive behavioral features using large-scale E-commerce reviews","authors":"B. Guo, Hao Wang, Zhaojun Yu, Yu Sun","doi":"10.1109/CITS.2017.8035320","DOIUrl":null,"url":null,"abstract":"Online reviews play a crucial role in helping consumers to make purchase decisions. However, a severe problem Internet Water Army (a large amount of paid posters who write inauthentic reviews) emerge in many E-commerce websites recently which dramatically undermines the value of user reviews. Although the word Internet Water Army originated from China, some other countries also suffered from this problem. Many organized underground paid poster groups found it extremely profitable to mislead the consumers by writing fake reviews. It had become more and more challenging to accurately detect the water army who could alter their writing style. In this paper, we design a comprehensive set of features to compare paid posters against normal users on different dimensions. Then we build an ensemble detection model of seven different algorithms. Our model has reached 0.726 in AUC measure and 0.683 in F1 measure on JD dataset, 0.926 in AUC measure and 0.871 in F1 measure on Amazon dataset, which outperforms previous studies. Our work provides some practical solutions and guidance to this severe problem for the whole E-commerce industry.","PeriodicalId":314150,"journal":{"name":"2017 International Conference on Computer, Information and Telecommunication Systems (CITS)","volume":"10 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 International Conference on Computer, Information and Telecommunication Systems (CITS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CITS.2017.8035320","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2
Abstract
Online reviews play a crucial role in helping consumers to make purchase decisions. However, a severe problem Internet Water Army (a large amount of paid posters who write inauthentic reviews) emerge in many E-commerce websites recently which dramatically undermines the value of user reviews. Although the word Internet Water Army originated from China, some other countries also suffered from this problem. Many organized underground paid poster groups found it extremely profitable to mislead the consumers by writing fake reviews. It had become more and more challenging to accurately detect the water army who could alter their writing style. In this paper, we design a comprehensive set of features to compare paid posters against normal users on different dimensions. Then we build an ensemble detection model of seven different algorithms. Our model has reached 0.726 in AUC measure and 0.683 in F1 measure on JD dataset, 0.926 in AUC measure and 0.871 in F1 measure on Amazon dataset, which outperforms previous studies. Our work provides some practical solutions and guidance to this severe problem for the whole E-commerce industry.