Homoglyph Attack Detection Model Using Machine Learning and Hash Function

J. Sens. Actuator Networks Pub Date : 2022-09-16 DOI:10.3390/jsan11030054

A. Almuhaideb, N. Aslam, Almaha Alabdullatif, Sarah Altamimi, Shooq Alothman, Amnah Alhussain, Waad Aldosari, Shikah J. Alsunaidi, K. Alissa

{"title":"Homoglyph Attack Detection Model Using Machine Learning and Hash Function","authors":"A. Almuhaideb, N. Aslam, Almaha Alabdullatif, Sarah Altamimi, Shooq Alothman, Amnah Alhussain, Waad Aldosari, Shikah J. Alsunaidi, K. Alissa","doi":"10.3390/jsan11030054","DOIUrl":null,"url":null,"abstract":"Phishing is still a major security threat in cyberspace. In phishing, attackers steal critical information from victims by presenting a spoofing/fake site that appears to be a visual clone of a legitimate site. Several Unicode characters are visually identical to ASCII characters. This similarity in characters is generally known as homoglyphs. Malicious adversaries utilize homoglyphs in URLs and DNS domains to target organizations. To reduce the risks caused by phishing attacks, effective ways of detecting phishing websites are urgently required. This paper proposes a homoglyph attack detection model that combines a hash function and machine learning. There are two phases to the model approach. The machine was being trained during the development phase. The deployment phase involved deploying the model with a Java interface and testing the outcomes through actual user interaction. The results are more accurate when the URL is hashed, as any little changes to the URL can be recognized. The homoglyph detector can be developed as a stand-alone software that is used as the initial step in requesting a webpage as it enhances browser security and protects websites from phishing attempts. To verify the effectiveness, we compared the proposed model on several criteria to existing phishing detection methods. By using the hash function, the proposed security features increase the overall security of the homoglyph attack detection in terms of accuracy, integrity, and availability. The experiment results showed that the model can detect phishing sites with an accuracy of 99.8% using Random Forest, and the hash function improves the accuracy of homoglyph attack detection.","PeriodicalId":288992,"journal":{"name":"J. Sens. Actuator Networks","volume":"15 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-09-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"J. Sens. Actuator Networks","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3390/jsan11030054","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

Abstract

Phishing is still a major security threat in cyberspace. In phishing, attackers steal critical information from victims by presenting a spoofing/fake site that appears to be a visual clone of a legitimate site. Several Unicode characters are visually identical to ASCII characters. This similarity in characters is generally known as homoglyphs. Malicious adversaries utilize homoglyphs in URLs and DNS domains to target organizations. To reduce the risks caused by phishing attacks, effective ways of detecting phishing websites are urgently required. This paper proposes a homoglyph attack detection model that combines a hash function and machine learning. There are two phases to the model approach. The machine was being trained during the development phase. The deployment phase involved deploying the model with a Java interface and testing the outcomes through actual user interaction. The results are more accurate when the URL is hashed, as any little changes to the URL can be recognized. The homoglyph detector can be developed as a stand-alone software that is used as the initial step in requesting a webpage as it enhances browser security and protects websites from phishing attempts. To verify the effectiveness, we compared the proposed model on several criteria to existing phishing detection methods. By using the hash function, the proposed security features increase the overall security of the homoglyph attack detection in terms of accuracy, integrity, and availability. The experiment results showed that the model can detect phishing sites with an accuracy of 99.8% using Random Forest, and the hash function improves the accuracy of homoglyph attack detection.

查看原文本刊更多论文

基于机器学习和哈希函数的同音攻击检测模型

网络钓鱼仍然是网络空间的主要安全威胁。在网络钓鱼中，攻击者通过呈现一个看起来是合法网站的视觉克隆的欺骗/假网站来窃取受害者的关键信息。有几个Unicode字符在视觉上与ASCII字符相同。这种字符上的相似性通常被称为同形异义字。恶意攻击者利用url和DNS域中的同音异义字来攻击组织。为了降低网络钓鱼攻击带来的风险，迫切需要有效的检测网络钓鱼网站的方法。本文提出了一种结合哈希函数和机器学习的同音攻击检测模型。建模方法有两个阶段。这台机器在开发阶段正在接受训练。部署阶段涉及使用Java接口部署模型，并通过实际用户交互测试结果。当对URL进行散列处理时，结果更加准确，因为对URL的任何微小更改都可以被识别出来。同音字型检测器可以开发成一个独立的软件，作为请求网页的第一步，因为它增强了浏览器的安全性，并保护网站免受网络钓鱼的企图。为了验证该模型的有效性，我们将该模型与现有的网络钓鱼检测方法在几个标准上进行了比较。通过使用哈希函数，所提出的安全特性在准确性、完整性和可用性方面提高了同音攻击检测的整体安全性。实验结果表明，该模型使用随机森林检测钓鱼网站的准确率达到99.8%，并且哈希函数提高了同音字形攻击检测的准确率。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

J. Sens. Actuator Networks

自引率

0.00%

发文量