Development of Machine Learning Based Fraudulent Website Detection Scheme

Shyh-Wei Chen, Po-Hsiang Chen, Ching-Tsorng Tsai, Chia-Hui Liu
{"title":"Development of Machine Learning Based Fraudulent Website Detection Scheme","authors":"Shyh-Wei Chen, Po-Hsiang Chen, Ching-Tsorng Tsai, Chia-Hui Liu","doi":"10.1109/ICKII55100.2022.9983523","DOIUrl":null,"url":null,"abstract":"The development of mobile computing and e-commerce has greatly changed traditional transactions and grown online shopping. People are buying and selling goods on websites or social platforms. However, there are many malicious and counterfeit products on fraudulent websites to deceive consumers and make high improper profits. Due to the obvious increase in the number of such fraudulent websites, it is difficult to identify and detect these websites by manual inspection. In order to solve this problem, we propose an intelligent detection mechanism by using a machine learning approach to classify fraudulent websites. We use data set containing 300 legitimate websites, 300 fraudulent websites, and 15 features for training. In two machine learning algorithms, Random Forest and Deep Neural Networks, we divided the training set and the test set in a ratio of 8:2. Finally, the prediction is compared with the previous research results. The experimental results show that the RF accuracy of the random forest algorithm is 99.3% which is better than other deep neural networks algorithms.","PeriodicalId":352222,"journal":{"name":"2022 IEEE 5th International Conference on Knowledge Innovation and Invention (ICKII )","volume":"72 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-07-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE 5th International Conference on Knowledge Innovation and Invention (ICKII )","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICKII55100.2022.9983523","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

The development of mobile computing and e-commerce has greatly changed traditional transactions and grown online shopping. People are buying and selling goods on websites or social platforms. However, there are many malicious and counterfeit products on fraudulent websites to deceive consumers and make high improper profits. Due to the obvious increase in the number of such fraudulent websites, it is difficult to identify and detect these websites by manual inspection. In order to solve this problem, we propose an intelligent detection mechanism by using a machine learning approach to classify fraudulent websites. We use data set containing 300 legitimate websites, 300 fraudulent websites, and 15 features for training. In two machine learning algorithms, Random Forest and Deep Neural Networks, we divided the training set and the test set in a ratio of 8:2. Finally, the prediction is compared with the previous research results. The experimental results show that the RF accuracy of the random forest algorithm is 99.3% which is better than other deep neural networks algorithms.
基于机器学习的欺诈网站检测方案的开发
移动计算和电子商务的发展极大地改变了传统的交易方式,促进了网上购物的发展。人们在网站或社交平台上买卖商品。然而,欺诈网站上有许多恶意假冒产品欺骗消费者,赚取高额不正当利润。由于此类欺诈网站的数量明显增加,很难通过人工检查来识别和检测这些网站。为了解决这个问题,我们提出了一种智能检测机制,利用机器学习的方法对欺诈网站进行分类。我们使用包含300个合法网站、300个欺诈网站和15个特征的数据集进行训练。在随机森林(Random Forest)和深度神经网络(Deep Neural Networks)这两种机器学习算法中,我们将训练集和测试集按8:2的比例进行划分。最后,将预测结果与前人的研究结果进行了比较。实验结果表明,随机森林算法的RF准确率为99.3%,优于其他深度神经网络算法。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信