Hina Rashid , Hannan Bin Liaqat , Muhammad Usman Sana , Tayybah Kiren , Hanen Karamti , Imran Ashraf
{"title":"Framework for detecting phishing crimes on Twitter using selective features and machine learning","authors":"Hina Rashid , Hannan Bin Liaqat , Muhammad Usman Sana , Tayybah Kiren , Hanen Karamti , Imran Ashraf","doi":"10.1016/j.compeleceng.2025.110363","DOIUrl":null,"url":null,"abstract":"<div><div>Socially aware information technology (SIT) plays a preferential role in facilitating the users for different tasks. Social media phishing is an escalating cybersecurity threat, where attackers employ deceptive tricks to steal personal data. Phishing detection in real-time is crucial, and highly dependent upon the selection of the most relevant features. Exiting literature often depends upon manual or random feature selection leading to inefficiencies in classification results. This research introduces a hybrid machine learning approach to phishing detection based on three feature selection methods Relief, Chi-square, and extra tree classifier for determining the most important features. Five classifiers including Naïve Bayes, support vector machine, decision tree, random forest (RF), and logistic regression are assessed based on accuracy, precision, recall, F1 score, and area under the curve (AUC). Experimental results indicate that RF obtains the highest accuracy of 95.56% and an AUC of 99.00%, better than other models and previous works. The results demonstrate the efficiency of the proposed method in improving phishing detection on social media.</div></div>","PeriodicalId":50630,"journal":{"name":"Computers & Electrical Engineering","volume":"124 ","pages":"Article 110363"},"PeriodicalIF":4.0000,"publicationDate":"2025-04-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computers & Electrical Engineering","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0045790625003064","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, HARDWARE & ARCHITECTURE","Score":null,"Total":0}
引用次数: 0
Abstract
Socially aware information technology (SIT) plays a preferential role in facilitating the users for different tasks. Social media phishing is an escalating cybersecurity threat, where attackers employ deceptive tricks to steal personal data. Phishing detection in real-time is crucial, and highly dependent upon the selection of the most relevant features. Exiting literature often depends upon manual or random feature selection leading to inefficiencies in classification results. This research introduces a hybrid machine learning approach to phishing detection based on three feature selection methods Relief, Chi-square, and extra tree classifier for determining the most important features. Five classifiers including Naïve Bayes, support vector machine, decision tree, random forest (RF), and logistic regression are assessed based on accuracy, precision, recall, F1 score, and area under the curve (AUC). Experimental results indicate that RF obtains the highest accuracy of 95.56% and an AUC of 99.00%, better than other models and previous works. The results demonstrate the efficiency of the proposed method in improving phishing detection on social media.
期刊介绍:
The impact of computers has nowhere been more revolutionary than in electrical engineering. The design, analysis, and operation of electrical and electronic systems are now dominated by computers, a transformation that has been motivated by the natural ease of interface between computers and electrical systems, and the promise of spectacular improvements in speed and efficiency.
Published since 1973, Computers & Electrical Engineering provides rapid publication of topical research into the integration of computer technology and computational techniques with electrical and electronic systems. The journal publishes papers featuring novel implementations of computers and computational techniques in areas like signal and image processing, high-performance computing, parallel processing, and communications. Special attention will be paid to papers describing innovative architectures, algorithms, and software tools.