Latent semantic analysis and keyword extraction for phishing classification

2010 IEEE International Conference on Intelligence and Security Informatics Pub Date : 2010-05-23 DOI:10.1109/ISI.2010.5484762

G. L'Huillier, A. Hevia, R. Weber, Sebastián A. Ríos

引用次数: 51

Abstract

Phishing email fraud has been considered as one of the main cyber-threats over the last years. Its development has been closely related to social engineering techniques, where different fraud strategies are used to deceit a naïve email user. In this work, a latent semantic analysis and text mining methodology is proposed for the characterisation of such strategies, and further classification using supervised learning algorithms. Results obtained showed that the feature set obtained in this work is competitive against previous phishing feature extraction methodologies, achieving promising results over different benchmark machine learning classification techniques.

查看原文本刊更多论文

网络钓鱼分类的潜在语义分析与关键词提取

近年来，网络钓鱼电子邮件诈骗被认为是主要的网络威胁之一。它的发展与社会工程技术密切相关，其中使用不同的欺诈策略来欺骗naïve电子邮件用户。在这项工作中，提出了一种潜在语义分析和文本挖掘方法来描述这些策略，并使用监督学习算法进行进一步分类。结果表明，本文所获得的特征集与以前的网络钓鱼特征提取方法相比具有竞争力，在不同的基准机器学习分类技术上取得了令人满意的结果。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2010 IEEE International Conference on Intelligence and Security Informatics

自引率

0.00%

发文量