Generative adversarial network-based phishing URL detection with variational autoencoder and transformer

IAES International Journal of Artificial Intelligence (IJ-AI) Pub Date : 2024-06-01 DOI:10.11591/ijai.v13.i2.pp2165-2172

Jishnu Kaitholikkal Sasi, Arthi Balakrishnan

{"title":"Generative adversarial network-based phishing URL detection with variational autoencoder and transformer","authors":"Jishnu Kaitholikkal Sasi, Arthi Balakrishnan","doi":"10.11591/ijai.v13.i2.pp2165-2172","DOIUrl":null,"url":null,"abstract":"Phishing attacks pose a constant threat to online security, necessitating the development of efficient tools for identifying malicious URLs. In this article, we propose a novel approach to detect phishing URLs employing a generative adversarial network (GAN) with a variational autoencoder (VAE) as the generator and a transformer model with self-attention as the discriminator. The VAE generator is trained to produce synthetic URLs. In contrast, the transformer discriminator uses its self-attention mechanism to focus on the different parts of the input URLs to extract crucial features. Our model uses adversarial training to distinguish between legitimate and phishing URLs. We evaluate the effectiveness of the proposed method using a large set of one million URLs that incorporate both authentic and phishing URLs. Experimental results show that our model is effective, with an impressive accuracy of 97.75%, outperforming the baseline models. This study significantly improves online security by offering a novel and highly accurate phishing URL detection method.","PeriodicalId":507934,"journal":{"name":"IAES International Journal of Artificial Intelligence (IJ-AI)","volume":"59 15","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IAES International Journal of Artificial Intelligence (IJ-AI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.11591/ijai.v13.i2.pp2165-2172","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Phishing attacks pose a constant threat to online security, necessitating the development of efficient tools for identifying malicious URLs. In this article, we propose a novel approach to detect phishing URLs employing a generative adversarial network (GAN) with a variational autoencoder (VAE) as the generator and a transformer model with self-attention as the discriminator. The VAE generator is trained to produce synthetic URLs. In contrast, the transformer discriminator uses its self-attention mechanism to focus on the different parts of the input URLs to extract crucial features. Our model uses adversarial training to distinguish between legitimate and phishing URLs. We evaluate the effectiveness of the proposed method using a large set of one million URLs that incorporate both authentic and phishing URLs. Experimental results show that our model is effective, with an impressive accuracy of 97.75%, outperforming the baseline models. This study significantly improves online security by offering a novel and highly accurate phishing URL detection method.

查看原文本刊更多论文

利用变异自动编码器和变换器进行基于生成对抗网络的钓鱼 URL 检测

网络钓鱼攻击对网络安全构成持续威胁，因此有必要开发高效的工具来识别恶意 URL。在本文中，我们提出了一种检测网络钓鱼 URL 的新方法，该方法采用了一种生成式对抗网络 (GAN)，以变异自动编码器 (VAE) 作为生成器，以具有自我关注功能的变压器模型作为判别器。VAE 生成器经过训练，可以生成合成 URL。与此相反，变换器判别器利用其自我注意机制，关注输入 URL 的不同部分，以提取关键特征。我们的模型利用对抗训练来区分合法 URL 和网络钓鱼 URL。我们使用包含真实网址和网络钓鱼网址的 100 万个大型网址集来评估所提出方法的有效性。实验结果表明，我们的模型非常有效，准确率高达 97.75%，优于基线模型。这项研究提供了一种新颖、高度准确的网络钓鱼 URL 检测方法，从而大大提高了网络安全性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

IAES International Journal of Artificial Intelligence (IJ-AI)

自引率

0.00%

发文量