使用大型语言模型检测真实的和计算机生成的客户评论的双阶段框架

Decision Analytics Journal Pub Date : 2025-05-02 DOI:10.1016/j.dajour.2025.100581

Dina Nawara , Rasha Kashef

{"title":"使用大型语言模型检测真实的和计算机生成的客户评论的双阶段框架","authors":"Dina Nawara , Rasha Kashef","doi":"10.1016/j.dajour.2025.100581","DOIUrl":null,"url":null,"abstract":"<div><div>Customer reviews are crucial in potential buyers’ decision-making process. However, on online platforms, the credibility of these reviews is often undermined by fake reviews, which can mislead users. With advancements in large language models (LLMs), the review landscape has transformed, making it more common to encounter computer-generated reviews created using state-of-the-art language models rather than genuine user feedback. This evolution poses significant challenges in distinguishing authentic reviews from artificially generated ones. To address these challenges, we propose a novel dual-phase framework that first generates high-diversity synthetic reviews using advanced LLMs to learn their patterns, and then it leverages this knowledge to enhance fake reviews detection. Our methodology involves two key phases. In the first phase, we generate computer-generated reviews by leveraging advanced methods, including generative transformers, trained on genuine user reviews. In the second phase; traditional and deep learning based classifiers, are incorporated as detection models which classify reviews as either authentic or computer-generated. Evaluated on a benchmark Amazon review dataset, our framework demonstrate (1) the efficacy of our approach in generating diverse and contextually relevant human-based and computerized-based reviews and (2) the robustness of our system in classifying and verifying the authenticity of reviews.</div></div>","PeriodicalId":100357,"journal":{"name":"Decision Analytics Journal","volume":"15 ","pages":"Article 100581"},"PeriodicalIF":0.0000,"publicationDate":"2025-05-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A dual-phase framework for detecting authentic and computer-generated customer reviews using large language models\",\"authors\":\"Dina Nawara , Rasha Kashef\",\"doi\":\"10.1016/j.dajour.2025.100581\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Customer reviews are crucial in potential buyers’ decision-making process. However, on online platforms, the credibility of these reviews is often undermined by fake reviews, which can mislead users. With advancements in large language models (LLMs), the review landscape has transformed, making it more common to encounter computer-generated reviews created using state-of-the-art language models rather than genuine user feedback. This evolution poses significant challenges in distinguishing authentic reviews from artificially generated ones. To address these challenges, we propose a novel dual-phase framework that first generates high-diversity synthetic reviews using advanced LLMs to learn their patterns, and then it leverages this knowledge to enhance fake reviews detection. Our methodology involves two key phases. In the first phase, we generate computer-generated reviews by leveraging advanced methods, including generative transformers, trained on genuine user reviews. In the second phase; traditional and deep learning based classifiers, are incorporated as detection models which classify reviews as either authentic or computer-generated. Evaluated on a benchmark Amazon review dataset, our framework demonstrate (1) the efficacy of our approach in generating diverse and contextually relevant human-based and computerized-based reviews and (2) the robustness of our system in classifying and verifying the authenticity of reviews.</div></div>\",\"PeriodicalId\":100357,\"journal\":{\"name\":\"Decision Analytics Journal\",\"volume\":\"15 \",\"pages\":\"Article 100581\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2025-05-02\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Decision Analytics Journal\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S2772662225000372\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Decision Analytics Journal","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2772662225000372","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

客户评论在潜在买家的决策过程中是至关重要的。然而，在网络平台上，这些评论的可信度往往受到虚假评论的影响，这些评论可能会误导用户。随着大型语言模型（llm）的进步，评论领域已经发生了变化，使得使用最先进的语言模型而不是真正的用户反馈创建的计算机生成的评论变得更加常见。这种演变在区分真实评论和人工生成的评论方面提出了重大挑战。为了应对这些挑战，我们提出了一个新的双阶段框架，首先使用高级llm来学习其模式生成高多样性的合成评论，然后利用这些知识来增强虚假评论检测。我们的方法包括两个关键阶段。在第一阶段，我们利用先进的方法生成计算机生成的评论，包括生成转换器，训练真实的用户评论。在第二阶段；传统的和基于深度学习的分类器被合并为检测模型，将评论分类为真实的或计算机生成的。在一个基准的亚马逊评论数据集上进行评估，我们的框架证明了(1)我们的方法在生成多样化和上下文相关的基于人的和基于计算机的评论方面的有效性，以及(2)我们的系统在分类和验证评论真实性方面的鲁棒性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

A dual-phase framework for detecting authentic and computer-generated customer reviews using large language models

Customer reviews are crucial in potential buyers’ decision-making process. However, on online platforms, the credibility of these reviews is often undermined by fake reviews, which can mislead users. With advancements in large language models (LLMs), the review landscape has transformed, making it more common to encounter computer-generated reviews created using state-of-the-art language models rather than genuine user feedback. This evolution poses significant challenges in distinguishing authentic reviews from artificially generated ones. To address these challenges, we propose a novel dual-phase framework that first generates high-diversity synthetic reviews using advanced LLMs to learn their patterns, and then it leverages this knowledge to enhance fake reviews detection. Our methodology involves two key phases. In the first phase, we generate computer-generated reviews by leveraging advanced methods, including generative transformers, trained on genuine user reviews. In the second phase; traditional and deep learning based classifiers, are incorporated as detection models which classify reviews as either authentic or computer-generated. Evaluated on a benchmark Amazon review dataset, our framework demonstrate (1) the efficacy of our approach in generating diverse and contextually relevant human-based and computerized-based reviews and (2) the robustness of our system in classifying and verifying the authenticity of reviews.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Decision Analytics Journal

CiteScore

3.90

自引率

0.00%

发文量