Fresh-Phish:一个自动检测网络钓鱼网站的框架

2017 IEEE International Conference on Information Reuse and Integration (IRI) Pub Date : 2017-08-01 DOI:10.1109/IRI.2017.40

H. Shirazi, Kyle Haefner, I. Ray

{"title":"Fresh-Phish:一个自动检测网络钓鱼网站的框架","authors":"H. Shirazi, Kyle Haefner, I. Ray","doi":"10.1109/IRI.2017.40","DOIUrl":null,"url":null,"abstract":"Denizens of the Internet are coming under a barrage of phishing attacks of increasing frequency and sophistication. Emails accompanied by authentic looking websites are ensnaring users who, unwittingly, hand over their credentials compromising both their privacy and security. Methods such as the blacklisting of these phishing websites become untenable and cannot keep pace with the explosion of fake sites. Detection of nefarious websites must become automated and be able to adapt to this ever evolving form of social engineering. We develop a framework, called \"\"Fresh-Phish\"\", for creating current machine learning data for phishing websites. Using 30 different website features that we query using python, we build a large labeled dataset and analyze several machine learning classifiers against this dataset to determine which is the most accurate. We analyze not just the accuracy of the technique, but also how long it takes to train the model.","PeriodicalId":254330,"journal":{"name":"2017 IEEE International Conference on Information Reuse and Integration (IRI)","volume":"19 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"21","resultStr":"{\"title\":\"Fresh-Phish: A Framework for Auto-Detection of Phishing Websites\",\"authors\":\"H. Shirazi, Kyle Haefner, I. Ray\",\"doi\":\"10.1109/IRI.2017.40\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Denizens of the Internet are coming under a barrage of phishing attacks of increasing frequency and sophistication. Emails accompanied by authentic looking websites are ensnaring users who, unwittingly, hand over their credentials compromising both their privacy and security. Methods such as the blacklisting of these phishing websites become untenable and cannot keep pace with the explosion of fake sites. Detection of nefarious websites must become automated and be able to adapt to this ever evolving form of social engineering. We develop a framework, called \\\"\\\"Fresh-Phish\\\"\\\", for creating current machine learning data for phishing websites. Using 30 different website features that we query using python, we build a large labeled dataset and analyze several machine learning classifiers against this dataset to determine which is the most accurate. We analyze not just the accuracy of the technique, but also how long it takes to train the model.\",\"PeriodicalId\":254330,\"journal\":{\"name\":\"2017 IEEE International Conference on Information Reuse and Integration (IRI)\",\"volume\":\"19 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2017-08-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"21\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2017 IEEE International Conference on Information Reuse and Integration (IRI)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/IRI.2017.40\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 IEEE International Conference on Information Reuse and Integration (IRI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IRI.2017.40","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 21

摘要

互联网用户正在遭受越来越频繁和复杂的网络钓鱼攻击。带有貌似真实的网站的电子邮件会诱使用户在不知情的情况下交出他们的凭证，从而损害他们的隐私和安全。将这些钓鱼网站列入黑名单等方法变得站不住脚，无法跟上虚假网站的爆炸式增长。恶意网站的检测必须自动化，并能够适应这种不断发展的社会工程形式。我们开发了一个名为“Fresh-Phish”的框架，用于为网络钓鱼网站创建当前的机器学习数据。使用python查询的30个不同的网站功能，我们构建了一个大型标记数据集，并针对该数据集分析几个机器学习分类器，以确定哪个最准确。我们不仅分析了技术的准确性，还分析了训练模型所需的时间。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Fresh-Phish: A Framework for Auto-Detection of Phishing Websites

Denizens of the Internet are coming under a barrage of phishing attacks of increasing frequency and sophistication. Emails accompanied by authentic looking websites are ensnaring users who, unwittingly, hand over their credentials compromising both their privacy and security. Methods such as the blacklisting of these phishing websites become untenable and cannot keep pace with the explosion of fake sites. Detection of nefarious websites must become automated and be able to adapt to this ever evolving form of social engineering. We develop a framework, called ""Fresh-Phish"", for creating current machine learning data for phishing websites. Using 30 different website features that we query using python, we build a large labeled dataset and analyze several machine learning classifiers against this dataset to determine which is the most accurate. We analyze not just the accuracy of the technique, but also how long it takes to train the model.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2017 IEEE International Conference on Information Reuse and Integration (IRI)

自引率

0.00%

发文量