Ning Cao, Shujuan Ji, Fuzhen Zhuang, Dickson K. W. Chiu, Yajie Guo, Maoguo Gong
{"title":"An unsupervised domain adaptation method for cross-domain deceptive reviews detection","authors":"Ning Cao, Shujuan Ji, Fuzhen Zhuang, Dickson K. W. Chiu, Yajie Guo, Maoguo Gong","doi":"10.1007/s10489-025-06825-3","DOIUrl":null,"url":null,"abstract":"<div><p>Deceptive reviews seriously affect the interests of consumers, honest sellers, and e-commerce platforms. As e-commerce platforms often involve multiple domains (i.e., different products or services), in-domain deceptive review detection models trained and tested on a specific dataset may not perform well on other domains. Moreover, obtaining annotated data for so many individual domains is unrealistic. Cross-domain deceptive review detection aims to leverage labeled source domain data to improve the model’s performance on unlabeled target domain data. However, existing cross-domain deceptive review detection methods require labels for target domain data or do not consider domain-specific information. To further advance research, this paper proposes an unsupervised domain adaptation method for detecting cross-domain deceptive reviews. First, we propose a multiple mask views generation method to enhance domain-specific information to obtain different mask views of reviews. Secondly, the BERT and mask attention mechanisms are used sequentially to obtain contextual representations of the mask views and the original view of reviews. Thirdly, to maintain the consistency between the mask views and the original view of reviews, we use the intra-domain Kullback-Leibler divergence to constrain their learning process. Moreover, we use inter-domain dynamic maximum mean discrepancy and conditional maximum mean discrepancy to reduce differences between the distribution of source and target domains. Three sets of experiments on two datasets show that our method is superior to the baselines. In particular, the impact of domain differences on domain adaptability is further analyzed according to the quantified metric named domain distance defined in this paper.</p></div>","PeriodicalId":8041,"journal":{"name":"Applied Intelligence","volume":"55 14","pages":""},"PeriodicalIF":3.5000,"publicationDate":"2025-09-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Applied Intelligence","FirstCategoryId":"94","ListUrlMain":"https://link.springer.com/article/10.1007/s10489-025-06825-3","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
Deceptive reviews seriously affect the interests of consumers, honest sellers, and e-commerce platforms. As e-commerce platforms often involve multiple domains (i.e., different products or services), in-domain deceptive review detection models trained and tested on a specific dataset may not perform well on other domains. Moreover, obtaining annotated data for so many individual domains is unrealistic. Cross-domain deceptive review detection aims to leverage labeled source domain data to improve the model’s performance on unlabeled target domain data. However, existing cross-domain deceptive review detection methods require labels for target domain data or do not consider domain-specific information. To further advance research, this paper proposes an unsupervised domain adaptation method for detecting cross-domain deceptive reviews. First, we propose a multiple mask views generation method to enhance domain-specific information to obtain different mask views of reviews. Secondly, the BERT and mask attention mechanisms are used sequentially to obtain contextual representations of the mask views and the original view of reviews. Thirdly, to maintain the consistency between the mask views and the original view of reviews, we use the intra-domain Kullback-Leibler divergence to constrain their learning process. Moreover, we use inter-domain dynamic maximum mean discrepancy and conditional maximum mean discrepancy to reduce differences between the distribution of source and target domains. Three sets of experiments on two datasets show that our method is superior to the baselines. In particular, the impact of domain differences on domain adaptability is further analyzed according to the quantified metric named domain distance defined in this paper.
期刊介绍:
With a focus on research in artificial intelligence and neural networks, this journal addresses issues involving solutions of real-life manufacturing, defense, management, government and industrial problems which are too complex to be solved through conventional approaches and require the simulation of intelligent thought processes, heuristics, applications of knowledge, and distributed and parallel processing. The integration of these multiple approaches in solving complex problems is of particular importance.
The journal presents new and original research and technological developments, addressing real and complex issues applicable to difficult problems. It provides a medium for exchanging scientific research and technological achievements accomplished by the international community.