S. Zaman, Shekh Minhaz Uddin Deep, Zul Kawsar, Md. Ashaduzzaman, Ahmed Iqbal Pritom
{"title":"Phishing Website Detection Using Effective Classifiers and Feature Selection Techniques","authors":"S. Zaman, Shekh Minhaz Uddin Deep, Zul Kawsar, Md. Ashaduzzaman, Ahmed Iqbal Pritom","doi":"10.1109/ICIET48527.2019.9290554","DOIUrl":null,"url":null,"abstract":"Phishing is a relatively new form of network assault where a web page illegally invokes current users to request financial or personal data or passwords. This act jeopardizes the privacy of many users and consequently, ongoing research has been carried out to find detection tools and to develop existing solutions. Classifiers based on machine learning can be used to detect phishing websites effectively and therefore, various machine learning classification algorithms i.e. Naive Bayes, J48 and HNB are implemented and compared through this research. In addition, performance of a classifier combining HNB and J48 was also closely observed as a solution to the stated problem. The study proposes a novel manual feature selection approach and presents a comparative study with Filter method feature selection techniques. The dataset used in this research is collected from UCI machine learning repository, has 2670 instances and 30 attributes of website structure. The empirical result indicated that the Address bar based feature group achieved the highest accuracy in detecting phishing website. In addition, two top algorithms, HNB and J48, were developed for an integrated multi-classified process. The findings have shown that combining techniques results in 96.25% accuracy in the identification of phishing websites for all apps.","PeriodicalId":427838,"journal":{"name":"2019 2nd International Conference on Innovation in Engineering and Technology (ICIET)","volume":"90 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-12-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 2nd International Conference on Innovation in Engineering and Technology (ICIET)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICIET48527.2019.9290554","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 5
Abstract
Phishing is a relatively new form of network assault where a web page illegally invokes current users to request financial or personal data or passwords. This act jeopardizes the privacy of many users and consequently, ongoing research has been carried out to find detection tools and to develop existing solutions. Classifiers based on machine learning can be used to detect phishing websites effectively and therefore, various machine learning classification algorithms i.e. Naive Bayes, J48 and HNB are implemented and compared through this research. In addition, performance of a classifier combining HNB and J48 was also closely observed as a solution to the stated problem. The study proposes a novel manual feature selection approach and presents a comparative study with Filter method feature selection techniques. The dataset used in this research is collected from UCI machine learning repository, has 2670 instances and 30 attributes of website structure. The empirical result indicated that the Address bar based feature group achieved the highest accuracy in detecting phishing website. In addition, two top algorithms, HNB and J48, were developed for an integrated multi-classified process. The findings have shown that combining techniques results in 96.25% accuracy in the identification of phishing websites for all apps.