Identification and Analysis of Phishing Website based on Machine Learning Methods

M. H. Alkawaz, Stephanie Joanne Steven, Omar Farook Mohammad, Md Gapar Md Johar
{"title":"Identification and Analysis of Phishing Website based on Machine Learning Methods","authors":"M. H. Alkawaz, Stephanie Joanne Steven, Omar Farook Mohammad, Md Gapar Md Johar","doi":"10.1109/iscaie54458.2022.9794467","DOIUrl":null,"url":null,"abstract":"People are increasingly sharing their details online as internet usage grows. Therefore, fraudsters have access to a massive amount of information and financial activities. The attackers create web pages that seem like reputable sites and transmit the malevolent content to victims to get them to provide subtle information. Prevailing phishing security measures are inadequate for detecting new phishing assaults. To accomplish this aim, objective to meet for this research is to analyses and compare phishing website and legitimate by analyzing the data collected from open-source platforms through a survey. Another objective for this research is to propose a method to detect fake sites using Decision Tree and Random Forest approaches. Microsoft Form has been utilized to carry out the survey with 30 participants. Majority of the participants have poor awareness and phishing attack and does not obverse the features of interface before accessing the search browser. With the data collection, this survey supports the purpose of identifying the best phishing website detection where Decision Tree and Random Forest were trained and tested. In achieving high number of feature importance detection and accuracy rate, the result demonstrates that Random Forest has the best performance in phishing website detection compared to Decision Tree.","PeriodicalId":395670,"journal":{"name":"2022 IEEE 12th Symposium on Computer Applications & Industrial Electronics (ISCAIE)","volume":"331 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-05-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE 12th Symposium on Computer Applications & Industrial Electronics (ISCAIE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/iscaie54458.2022.9794467","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3

Abstract

People are increasingly sharing their details online as internet usage grows. Therefore, fraudsters have access to a massive amount of information and financial activities. The attackers create web pages that seem like reputable sites and transmit the malevolent content to victims to get them to provide subtle information. Prevailing phishing security measures are inadequate for detecting new phishing assaults. To accomplish this aim, objective to meet for this research is to analyses and compare phishing website and legitimate by analyzing the data collected from open-source platforms through a survey. Another objective for this research is to propose a method to detect fake sites using Decision Tree and Random Forest approaches. Microsoft Form has been utilized to carry out the survey with 30 participants. Majority of the participants have poor awareness and phishing attack and does not obverse the features of interface before accessing the search browser. With the data collection, this survey supports the purpose of identifying the best phishing website detection where Decision Tree and Random Forest were trained and tested. In achieving high number of feature importance detection and accuracy rate, the result demonstrates that Random Forest has the best performance in phishing website detection compared to Decision Tree.
基于机器学习方法的钓鱼网站识别与分析
随着互联网使用量的增长,人们越来越多地在网上分享自己的个人信息。因此,欺诈者可以获得大量的信息和金融活动。攻击者创建看起来像信誉良好的网站的网页,并将恶意内容传输给受害者,让他们提供微妙的信息。现行的网络钓鱼安全措施不足以检测新的网络钓鱼攻击。为了实现这一目标,本研究的目的是通过调查分析从开源平台收集的数据,分析和比较网络钓鱼网站和合法网站。本研究的另一个目的是提出一种使用决策树和随机森林方法检测假站点的方法。本次调查采用微软表单进行,共有30名参与者。大多数参与者对网络钓鱼攻击的认知较差,并且在访问搜索浏览器之前没有针对界面的特性。通过数据收集,本调查支持确定最佳网络钓鱼网站检测的目的,其中决策树和随机森林进行了训练和测试。在获得较高的特征重要性检测次数和准确率的情况下,结果表明随机森林在网络钓鱼网站检测方面的性能优于决策树。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信