{"title":"基于ERNIE和SGAN的跨语言虚假评论识别研究","authors":"Yuhang Zhang, Jiatai Wu, Min Zhang, Tao Liu","doi":"10.1109/ISCTIS58954.2023.10213159","DOIUrl":null,"url":null,"abstract":"With the rise of social media, the prevalence of fake reviews has surged, causing significant harm to the e-commerce industry's competitive landscape. To tackle the issue of limited publicly available datasets and the challenge of identifying fake reviews, a cross-lingual review dataset is created by amalgamating existing publicly available datasets. A fake review recognition model is devised based on the ERNIE2.0 pretrained language model and a semi-supervised generative adversarial network. Initially, ERNIE is employed to extract high-quality linguistic representations of the review data. Next, a generator in a semi-supervised generative adversarial network is utilized to generate noisy data that has a similar distribution to that of the genuine review text data. Finally, the identification of fake reviews is executed in a discriminator. Experimental validation is conducted using the cross-lingual dataset created, and the results indicate that the method achieves a remarkable 81.43% accuracy in identifying fake reviews with only a small amount of labeled data, thereby affirming its effectiveness.","PeriodicalId":334790,"journal":{"name":"2023 3rd International Symposium on Computer Technology and Information Science (ISCTIS)","volume":"39 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-07-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A Research on Cross-Language Fake Reviews Identification Based on ERNIE and SGAN\",\"authors\":\"Yuhang Zhang, Jiatai Wu, Min Zhang, Tao Liu\",\"doi\":\"10.1109/ISCTIS58954.2023.10213159\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"With the rise of social media, the prevalence of fake reviews has surged, causing significant harm to the e-commerce industry's competitive landscape. To tackle the issue of limited publicly available datasets and the challenge of identifying fake reviews, a cross-lingual review dataset is created by amalgamating existing publicly available datasets. A fake review recognition model is devised based on the ERNIE2.0 pretrained language model and a semi-supervised generative adversarial network. Initially, ERNIE is employed to extract high-quality linguistic representations of the review data. Next, a generator in a semi-supervised generative adversarial network is utilized to generate noisy data that has a similar distribution to that of the genuine review text data. Finally, the identification of fake reviews is executed in a discriminator. Experimental validation is conducted using the cross-lingual dataset created, and the results indicate that the method achieves a remarkable 81.43% accuracy in identifying fake reviews with only a small amount of labeled data, thereby affirming its effectiveness.\",\"PeriodicalId\":334790,\"journal\":{\"name\":\"2023 3rd International Symposium on Computer Technology and Information Science (ISCTIS)\",\"volume\":\"39 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-07-07\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2023 3rd International Symposium on Computer Technology and Information Science (ISCTIS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ISCTIS58954.2023.10213159\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 3rd International Symposium on Computer Technology and Information Science (ISCTIS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISCTIS58954.2023.10213159","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
A Research on Cross-Language Fake Reviews Identification Based on ERNIE and SGAN
With the rise of social media, the prevalence of fake reviews has surged, causing significant harm to the e-commerce industry's competitive landscape. To tackle the issue of limited publicly available datasets and the challenge of identifying fake reviews, a cross-lingual review dataset is created by amalgamating existing publicly available datasets. A fake review recognition model is devised based on the ERNIE2.0 pretrained language model and a semi-supervised generative adversarial network. Initially, ERNIE is employed to extract high-quality linguistic representations of the review data. Next, a generator in a semi-supervised generative adversarial network is utilized to generate noisy data that has a similar distribution to that of the genuine review text data. Finally, the identification of fake reviews is executed in a discriminator. Experimental validation is conducted using the cross-lingual dataset created, and the results indicate that the method achieves a remarkable 81.43% accuracy in identifying fake reviews with only a small amount of labeled data, thereby affirming its effectiveness.