{"title":"A Research on Cross-Language Fake Reviews Identification Based on ERNIE and SGAN","authors":"Yuhang Zhang, Jiatai Wu, Min Zhang, Tao Liu","doi":"10.1109/ISCTIS58954.2023.10213159","DOIUrl":null,"url":null,"abstract":"With the rise of social media, the prevalence of fake reviews has surged, causing significant harm to the e-commerce industry's competitive landscape. To tackle the issue of limited publicly available datasets and the challenge of identifying fake reviews, a cross-lingual review dataset is created by amalgamating existing publicly available datasets. A fake review recognition model is devised based on the ERNIE2.0 pretrained language model and a semi-supervised generative adversarial network. Initially, ERNIE is employed to extract high-quality linguistic representations of the review data. Next, a generator in a semi-supervised generative adversarial network is utilized to generate noisy data that has a similar distribution to that of the genuine review text data. Finally, the identification of fake reviews is executed in a discriminator. Experimental validation is conducted using the cross-lingual dataset created, and the results indicate that the method achieves a remarkable 81.43% accuracy in identifying fake reviews with only a small amount of labeled data, thereby affirming its effectiveness.","PeriodicalId":334790,"journal":{"name":"2023 3rd International Symposium on Computer Technology and Information Science (ISCTIS)","volume":"39 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-07-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 3rd International Symposium on Computer Technology and Information Science (ISCTIS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISCTIS58954.2023.10213159","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
With the rise of social media, the prevalence of fake reviews has surged, causing significant harm to the e-commerce industry's competitive landscape. To tackle the issue of limited publicly available datasets and the challenge of identifying fake reviews, a cross-lingual review dataset is created by amalgamating existing publicly available datasets. A fake review recognition model is devised based on the ERNIE2.0 pretrained language model and a semi-supervised generative adversarial network. Initially, ERNIE is employed to extract high-quality linguistic representations of the review data. Next, a generator in a semi-supervised generative adversarial network is utilized to generate noisy data that has a similar distribution to that of the genuine review text data. Finally, the identification of fake reviews is executed in a discriminator. Experimental validation is conducted using the cross-lingual dataset created, and the results indicate that the method achieves a remarkable 81.43% accuracy in identifying fake reviews with only a small amount of labeled data, thereby affirming its effectiveness.