使用位置LSTM / K-L散度集成方法检测虚假Yelp评论

Christopher G. Harris
{"title":"使用位置LSTM / K-L散度集成方法检测虚假Yelp评论","authors":"Christopher G. Harris","doi":"10.1109/ICISIT54091.2022.9872788","DOIUrl":null,"url":null,"abstract":"Online reviews of products and services often have a significant impact on future purchases. Unfortunately, this invites opportunities for fraud –to provide fake reviews to improve a company’s reputation in the eyes of consumers or to disparage the reputation of a competitor. We combine two methods to detect fraud in Yelp restaurant reviews. First, we develop and apply a bi-directional Long-Short Term Memory (LSTM), a type of recurrent neural network, to take advantage of the positional relevancy of comments within reviews. LSTMs are well-suited to examine linguistic features in text and evaluating different regions of text within the review enhances our model’s accuracy. To this component, we apply a Kullback-Leibler (K-L) divergence technique to examine the discrepancy in the term rankings between real and fake Yelp reviews. This ensemble is used to discriminate fake reviews from real ones, achieving an Average Precision (AP) of 0.5402 and an Area Under the Curve (AOC) of 0.866. which is an improvement over other state-of-the-art techniques on the same YelpNYC dataset.","PeriodicalId":214014,"journal":{"name":"2022 1st International Conference on Information System & Information Technology (ICISIT)","volume":"3 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-07-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Detecting Fake Yelp Reviews Using a Positional LSTM / K-L Divergence Ensemble Approach\",\"authors\":\"Christopher G. Harris\",\"doi\":\"10.1109/ICISIT54091.2022.9872788\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Online reviews of products and services often have a significant impact on future purchases. Unfortunately, this invites opportunities for fraud –to provide fake reviews to improve a company’s reputation in the eyes of consumers or to disparage the reputation of a competitor. We combine two methods to detect fraud in Yelp restaurant reviews. First, we develop and apply a bi-directional Long-Short Term Memory (LSTM), a type of recurrent neural network, to take advantage of the positional relevancy of comments within reviews. LSTMs are well-suited to examine linguistic features in text and evaluating different regions of text within the review enhances our model’s accuracy. To this component, we apply a Kullback-Leibler (K-L) divergence technique to examine the discrepancy in the term rankings between real and fake Yelp reviews. This ensemble is used to discriminate fake reviews from real ones, achieving an Average Precision (AP) of 0.5402 and an Area Under the Curve (AOC) of 0.866. which is an improvement over other state-of-the-art techniques on the same YelpNYC dataset.\",\"PeriodicalId\":214014,\"journal\":{\"name\":\"2022 1st International Conference on Information System & Information Technology (ICISIT)\",\"volume\":\"3 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-07-27\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 1st International Conference on Information System & Information Technology (ICISIT)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICISIT54091.2022.9872788\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 1st International Conference on Information System & Information Technology (ICISIT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICISIT54091.2022.9872788","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

摘要

网上对产品和服务的评论通常会对未来的购买产生重大影响。不幸的是,这为欺诈提供了机会——提供虚假评论,以提高公司在消费者心目中的声誉,或贬低竞争对手的声誉。我们结合了两种方法来检测Yelp餐厅评论中的欺诈行为。首先,我们开发并应用了一种双向长短期记忆(LSTM),一种循环神经网络,以利用评论中评论的位置相关性。lstm非常适合检查文本中的语言特征,并且在评论中评估文本的不同区域可以提高我们模型的准确性。对于这个组件,我们应用Kullback-Leibler (K-L)发散技术来检查真实和虚假Yelp评论之间的术语排名差异。该集合用于区分虚假评论和真实评论,实现平均精度(AP)为0.5402,曲线下面积(AOC)为0.866。这是对相同YelpNYC数据集的其他最先进技术的改进。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Detecting Fake Yelp Reviews Using a Positional LSTM / K-L Divergence Ensemble Approach
Online reviews of products and services often have a significant impact on future purchases. Unfortunately, this invites opportunities for fraud –to provide fake reviews to improve a company’s reputation in the eyes of consumers or to disparage the reputation of a competitor. We combine two methods to detect fraud in Yelp restaurant reviews. First, we develop and apply a bi-directional Long-Short Term Memory (LSTM), a type of recurrent neural network, to take advantage of the positional relevancy of comments within reviews. LSTMs are well-suited to examine linguistic features in text and evaluating different regions of text within the review enhances our model’s accuracy. To this component, we apply a Kullback-Leibler (K-L) divergence technique to examine the discrepancy in the term rankings between real and fake Yelp reviews. This ensemble is used to discriminate fake reviews from real ones, achieving an Average Precision (AP) of 0.5402 and an Area Under the Curve (AOC) of 0.866. which is an improvement over other state-of-the-art techniques on the same YelpNYC dataset.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信