Development of Machine Learning Based Fraudulent Website Detection Scheme

2022 IEEE 5th International Conference on Knowledge Innovation and Invention (ICKII ) Pub Date : 2022-07-22 DOI:10.1109/ICKII55100.2022.9983523

Shyh-Wei Chen, Po-Hsiang Chen, Ching-Tsorng Tsai, Chia-Hui Liu

{"title":"Development of Machine Learning Based Fraudulent Website Detection Scheme","authors":"Shyh-Wei Chen, Po-Hsiang Chen, Ching-Tsorng Tsai, Chia-Hui Liu","doi":"10.1109/ICKII55100.2022.9983523","DOIUrl":null,"url":null,"abstract":"The development of mobile computing and e-commerce has greatly changed traditional transactions and grown online shopping. People are buying and selling goods on websites or social platforms. However, there are many malicious and counterfeit products on fraudulent websites to deceive consumers and make high improper profits. Due to the obvious increase in the number of such fraudulent websites, it is difficult to identify and detect these websites by manual inspection. In order to solve this problem, we propose an intelligent detection mechanism by using a machine learning approach to classify fraudulent websites. We use data set containing 300 legitimate websites, 300 fraudulent websites, and 15 features for training. In two machine learning algorithms, Random Forest and Deep Neural Networks, we divided the training set and the test set in a ratio of 8:2. Finally, the prediction is compared with the previous research results. The experimental results show that the RF accuracy of the random forest algorithm is 99.3% which is better than other deep neural networks algorithms.","PeriodicalId":352222,"journal":{"name":"2022 IEEE 5th International Conference on Knowledge Innovation and Invention (ICKII )","volume":"72 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-07-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE 5th International Conference on Knowledge Innovation and Invention (ICKII )","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICKII55100.2022.9983523","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

The development of mobile computing and e-commerce has greatly changed traditional transactions and grown online shopping. People are buying and selling goods on websites or social platforms. However, there are many malicious and counterfeit products on fraudulent websites to deceive consumers and make high improper profits. Due to the obvious increase in the number of such fraudulent websites, it is difficult to identify and detect these websites by manual inspection. In order to solve this problem, we propose an intelligent detection mechanism by using a machine learning approach to classify fraudulent websites. We use data set containing 300 legitimate websites, 300 fraudulent websites, and 15 features for training. In two machine learning algorithms, Random Forest and Deep Neural Networks, we divided the training set and the test set in a ratio of 8:2. Finally, the prediction is compared with the previous research results. The experimental results show that the RF accuracy of the random forest algorithm is 99.3% which is better than other deep neural networks algorithms.

查看原文本刊更多论文

基于机器学习的欺诈网站检测方案的开发

移动计算和电子商务的发展极大地改变了传统的交易方式，促进了网上购物的发展。人们在网站或社交平台上买卖商品。然而，欺诈网站上有许多恶意假冒产品欺骗消费者，赚取高额不正当利润。由于此类欺诈网站的数量明显增加，很难通过人工检查来识别和检测这些网站。为了解决这个问题，我们提出了一种智能检测机制，利用机器学习的方法对欺诈网站进行分类。我们使用包含300个合法网站、300个欺诈网站和15个特征的数据集进行训练。在随机森林(Random Forest)和深度神经网络(Deep Neural Networks)这两种机器学习算法中，我们将训练集和测试集按8:2的比例进行划分。最后，将预测结果与前人的研究结果进行了比较。实验结果表明，随机森林算法的RF准确率为99.3%，优于其他深度神经网络算法。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2022 IEEE 5th International Conference on Knowledge Innovation and Invention (ICKII )

自引率

0.00%

发文量