使用监督机器学习的虚假评论过滤系统

Deepanshu Jain, Sayam Kumar, Yashika Goyal
{"title":"使用监督机器学习的虚假评论过滤系统","authors":"Deepanshu Jain, Sayam Kumar, Yashika Goyal","doi":"10.1109/ICDSIS55133.2022.9915878","DOIUrl":null,"url":null,"abstract":"As the surge in internet users is expanding prominently, the role of the online reviewing system is also rising. For companies, the legitimacy of internet evaluations is critical, as it can directly impact their reputation and profitability. It plays an indispensable role in influencing people’s perceptions of a product or service. This research projects light on the best technique to identify and filter out authentic reviews while proposing a flexible and user-friendly website. The website will have a tremendous sway on customers and will assist them in making a better judgment about a product/service. The website is deployed with the designed supervised learning model. Firstly, the user will have to enter the URL of the website where the product is located. After which, the dataset is extracted from the given URL using Python tools for Web Scraping. The data is then analyzed and dissected using Natural Language Processing techniques to extract sound features from it. Ultimately, different Machine Learning Models are further trained on the dataset. The experimental results of this research reveal that the model performs at an accuracy of 89.12% on the datasets. The major objective of this research is to provide a fake review filtering system that will provide users with more reliable review information and eliminate revenue loss of the companies at an exponential rate.","PeriodicalId":178360,"journal":{"name":"2022 IEEE International Conference on Data Science and Information System (ICDSIS)","volume":"6 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-07-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Fake Reviews Filtering System Using Supervised Machine Learning\",\"authors\":\"Deepanshu Jain, Sayam Kumar, Yashika Goyal\",\"doi\":\"10.1109/ICDSIS55133.2022.9915878\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"As the surge in internet users is expanding prominently, the role of the online reviewing system is also rising. For companies, the legitimacy of internet evaluations is critical, as it can directly impact their reputation and profitability. It plays an indispensable role in influencing people’s perceptions of a product or service. This research projects light on the best technique to identify and filter out authentic reviews while proposing a flexible and user-friendly website. The website will have a tremendous sway on customers and will assist them in making a better judgment about a product/service. The website is deployed with the designed supervised learning model. Firstly, the user will have to enter the URL of the website where the product is located. After which, the dataset is extracted from the given URL using Python tools for Web Scraping. The data is then analyzed and dissected using Natural Language Processing techniques to extract sound features from it. Ultimately, different Machine Learning Models are further trained on the dataset. The experimental results of this research reveal that the model performs at an accuracy of 89.12% on the datasets. The major objective of this research is to provide a fake review filtering system that will provide users with more reliable review information and eliminate revenue loss of the companies at an exponential rate.\",\"PeriodicalId\":178360,\"journal\":{\"name\":\"2022 IEEE International Conference on Data Science and Information System (ICDSIS)\",\"volume\":\"6 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-07-29\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 IEEE International Conference on Data Science and Information System (ICDSIS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICDSIS55133.2022.9915878\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE International Conference on Data Science and Information System (ICDSIS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICDSIS55133.2022.9915878","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

摘要

随着网民数量的显著增长,网络评论系统的作用也在不断提升。对于企业来说,互联网评价的合法性至关重要,因为它可以直接影响企业的声誉和盈利能力。它在影响人们对产品或服务的看法方面起着不可或缺的作用。这项研究项目在提出一个灵活和用户友好的网站的同时,揭示了识别和过滤真实评论的最佳技术。网站将对客户产生巨大的影响,并将帮助他们对产品/服务做出更好的判断。网站部署了设计好的监督学习模型。首先,用户必须输入产品所在网站的URL。之后,使用用于Web抓取的Python工具从给定的URL提取数据集。然后使用自然语言处理技术对数据进行分析和剖析,从中提取声音特征。最终,在数据集上进一步训练不同的机器学习模型。本研究的实验结果表明,该模型在数据集上的准确率为89.12%。本研究的主要目的是提供一个虚假评论过滤系统,为用户提供更可靠的评论信息,并以指数速度消除公司的收入损失。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Fake Reviews Filtering System Using Supervised Machine Learning
As the surge in internet users is expanding prominently, the role of the online reviewing system is also rising. For companies, the legitimacy of internet evaluations is critical, as it can directly impact their reputation and profitability. It plays an indispensable role in influencing people’s perceptions of a product or service. This research projects light on the best technique to identify and filter out authentic reviews while proposing a flexible and user-friendly website. The website will have a tremendous sway on customers and will assist them in making a better judgment about a product/service. The website is deployed with the designed supervised learning model. Firstly, the user will have to enter the URL of the website where the product is located. After which, the dataset is extracted from the given URL using Python tools for Web Scraping. The data is then analyzed and dissected using Natural Language Processing techniques to extract sound features from it. Ultimately, different Machine Learning Models are further trained on the dataset. The experimental results of this research reveal that the model performs at an accuracy of 89.12% on the datasets. The major objective of this research is to provide a fake review filtering system that will provide users with more reliable review information and eliminate revenue loss of the companies at an exponential rate.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信