利用预训练的 DeBERTa，以 "帝王蝶 "范式进行优化，实现高性能虚假评论检测。

IF 3.9 2区综合性期刊 Q1 MULTIDISCIPLINARY SCIENCES

Scientific Reports Pub Date : 2025-03-03 DOI:10.1038/s41598-025-89453-8

S Geetha, E Elakiya, R Sujithra Kanmani, Manas Kamal Das

{"title":"利用预训练的 DeBERTa，以 \"帝王蝶 \"范式进行优化，实现高性能虚假评论检测。","authors":"S Geetha, E Elakiya, R Sujithra Kanmani, Manas Kamal Das","doi":"10.1038/s41598-025-89453-8","DOIUrl":null,"url":null,"abstract":"In this era of internet, e-commerce has grown tremendously and the customers are increasingly relying on reviews for product information. As these reviews influence the purchasing ability of the future customer, it can give a positive or negative impact on the businesses. The effectiveness of online reviews is compromised by fake reviews that provide false information about the product. Fake reviews can not only impact the reputation of the businesses but also involve financial losses. Thus, detection of fake reviews is essential to solve the problem for maintaining the integrity of online reviews. Existing Machine learning models often struggle with deep contextual understanding. Scaling machine learning models while maintaining accuracy and efficiency becomes increasingly challenging as the volume of online reviews continues to grow. Hence, this research work introduces a novel MBO-DeBERTa, a deep neural network with Monarch Butterfly Optimizer. The proposed model improves the capacity to differentiate between overlapping characteristics of fake and authentic reviews. MBO-DeBERTa attained a classification accuracy of 98% for detecting the fake reviews. The proposed framework is tested on three different datasets such as Amazon, Fake Review and Deceptive Opinion Spam containing 21000,40000 and 1600 reviews respectively which are publicly available in Kaggle. The proposed model also detects adversarial attacks using the Fast Gradient Sign Method (FGSM) and thereby evaluating its resistance to such attacks and noise. The proposed model was also tested on the unseen data of Myntra and Amazon verified customer reviews and our model works efficiently for real world data. Thus the results show that the suggested model outperforms the current models showing increased accuracy, precision, recall, F1 score and reduced loss rate.","PeriodicalId":21811,"journal":{"name":"Scientific Reports","volume":"15 1","pages":"7445"},"PeriodicalIF":3.9000,"publicationDate":"2025-03-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11876299/pdf/","citationCount":"0","resultStr":"{\"title\":\"High performance fake review detection using pretrained DeBERTa optimized with Monarch Butterfly paradigm.\",\"authors\":\"S Geetha, E Elakiya, R Sujithra Kanmani, Manas Kamal Das\",\"doi\":\"10.1038/s41598-025-89453-8\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In this era of internet, e-commerce has grown tremendously and the customers are increasingly relying on reviews for product information. As these reviews influence the purchasing ability of the future customer, it can give a positive or negative impact on the businesses. The effectiveness of online reviews is compromised by fake reviews that provide false information about the product. Fake reviews can not only impact the reputation of the businesses but also involve financial losses. Thus, detection of fake reviews is essential to solve the problem for maintaining the integrity of online reviews. Existing Machine learning models often struggle with deep contextual understanding. Scaling machine learning models while maintaining accuracy and efficiency becomes increasingly challenging as the volume of online reviews continues to grow. Hence, this research work introduces a novel MBO-DeBERTa, a deep neural network with Monarch Butterfly Optimizer. The proposed model improves the capacity to differentiate between overlapping characteristics of fake and authentic reviews. MBO-DeBERTa attained a classification accuracy of 98% for detecting the fake reviews. The proposed framework is tested on three different datasets such as Amazon, Fake Review and Deceptive Opinion Spam containing 21000,40000 and 1600 reviews respectively which are publicly available in Kaggle. The proposed model also detects adversarial attacks using the Fast Gradient Sign Method (FGSM) and thereby evaluating its resistance to such attacks and noise. The proposed model was also tested on the unseen data of Myntra and Amazon verified customer reviews and our model works efficiently for real world data. Thus the results show that the suggested model outperforms the current models showing increased accuracy, precision, recall, F1 score and reduced loss rate.\",\"PeriodicalId\":21811,\"journal\":{\"name\":\"Scientific Reports\",\"volume\":\"15 1\",\"pages\":\"7445\"},\"PeriodicalIF\":3.9000,\"publicationDate\":\"2025-03-03\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11876299/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Scientific Reports\",\"FirstCategoryId\":\"103\",\"ListUrlMain\":\"https://doi.org/10.1038/s41598-025-89453-8\",\"RegionNum\":2,\"RegionCategory\":\"综合性期刊\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"MULTIDISCIPLINARY SCIENCES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Scientific Reports","FirstCategoryId":"103","ListUrlMain":"https://doi.org/10.1038/s41598-025-89453-8","RegionNum":2,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"MULTIDISCIPLINARY SCIENCES","Score":null,"Total":0}

引用次数: 0

摘要

在这个互联网时代，电子商务发展迅猛，客户越来越依赖评论来获取产品信息。由于这些评论会影响未来顾客的购买能力，因此会对企业产生积极或消极的影响。提供虚假产品信息的虚假评论会损害在线评论的有效性。虚假评论不仅会影响企业声誉，还会造成经济损失。因此，检测虚假评论对于解决维护在线评论完整性的问题至关重要。现有的机器学习模型往往难以深入理解上下文。随着在线评论数量的不断增长，在保持准确性和效率的同时扩展机器学习模型变得越来越具有挑战性。因此，这项研究工作引入了一种新颖的 MBO-DeBERTa，这是一种带有 "帝王蝶优化器"（Monarch Butterfly Optimizer）的深度神经网络。所提出的模型提高了区分真假评论重叠特征的能力。MBO-DeBERTa 检测虚假评论的分类准确率达到 98%。提议的框架在三个不同的数据集上进行了测试，如亚马逊、虚假评论和欺骗性意见垃圾邮件，分别包含 21000、40000 和 1600 条评论。提出的模型还使用快速梯度符号法（FGSM）检测对抗性攻击，从而评估其对此类攻击和噪声的抵抗能力。我们还在 Myntra 和亚马逊验证过的客户评论的未知数据上测试了所提出的模型，结果表明我们的模型在真实世界的数据中也能有效工作。结果表明，建议的模型优于当前的模型，准确率、精确度、召回率、F1 分数均有所提高，损失率也有所降低。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

High performance fake review detection using pretrained DeBERTa optimized with Monarch Butterfly paradigm.

查看原文本刊更多论文

High performance fake review detection using pretrained DeBERTa optimized with Monarch Butterfly paradigm.

In this era of internet, e-commerce has grown tremendously and the customers are increasingly relying on reviews for product information. As these reviews influence the purchasing ability of the future customer, it can give a positive or negative impact on the businesses. The effectiveness of online reviews is compromised by fake reviews that provide false information about the product. Fake reviews can not only impact the reputation of the businesses but also involve financial losses. Thus, detection of fake reviews is essential to solve the problem for maintaining the integrity of online reviews. Existing Machine learning models often struggle with deep contextual understanding. Scaling machine learning models while maintaining accuracy and efficiency becomes increasingly challenging as the volume of online reviews continues to grow. Hence, this research work introduces a novel MBO-DeBERTa, a deep neural network with Monarch Butterfly Optimizer. The proposed model improves the capacity to differentiate between overlapping characteristics of fake and authentic reviews. MBO-DeBERTa attained a classification accuracy of 98% for detecting the fake reviews. The proposed framework is tested on three different datasets such as Amazon, Fake Review and Deceptive Opinion Spam containing 21000,40000 and 1600 reviews respectively which are publicly available in Kaggle. The proposed model also detects adversarial attacks using the Fast Gradient Sign Method (FGSM) and thereby evaluating its resistance to such attacks and noise. The proposed model was also tested on the unseen data of Myntra and Amazon verified customer reviews and our model works efficiently for real world data. Thus the results show that the suggested model outperforms the current models showing increased accuracy, precision, recall, F1 score and reduced loss rate.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Scientific Reports Natural Science Disciplines-

CiteScore

7.50

自引率

4.30%

发文量

19567

审稿时长

3.9 months

期刊介绍： We publish original research from all areas of the natural sciences, psychology, medicine and engineering. You can learn more about what we publish by browsing our specific scientific subject areas below or explore Scientific Reports by browsing all articles and collections. Scientific Reports has a 2-year impact factor: 4.380 (2021), and is the 6th most-cited journal in the world, with more than 540,000 citations in 2020 (Clarivate Analytics, 2021). •Engineering Engineering covers all aspects of engineering, technology, and applied science. It plays a crucial role in the development of technologies to address some of the world''s biggest challenges, helping to save lives and improve the way we live. •Physical sciences Physical sciences are those academic disciplines that aim to uncover the underlying laws of nature — often written in the language of mathematics. It is a collective term for areas of study including astronomy, chemistry, materials science and physics. •Earth and environmental sciences Earth and environmental sciences cover all aspects of Earth and planetary science and broadly encompass solid Earth processes, surface and atmospheric dynamics, Earth system history, climate and climate change, marine and freshwater systems, and ecology. It also considers the interactions between humans and these systems. •Biological sciences Biological sciences encompass all the divisions of natural sciences examining various aspects of vital processes. The concept includes anatomy, physiology, cell biology, biochemistry and biophysics, and covers all organisms from microorganisms, animals to plants. •Health sciences The health sciences study health, disease and healthcare. This field of study aims to develop knowledge, interventions and technology for use in healthcare to improve the treatment of patients.