Junaidi Muhammad, Affan Bin Hasan, Muhammad Farrukh
{"title":"Classification and Prediction of Spam Emails Based on AI Enabling Models Using Deep and Machine Learning Techniques","authors":"Junaidi Muhammad, Affan Bin Hasan, Muhammad Farrukh","doi":"10.1109/ICETECC56662.2022.10069229","DOIUrl":null,"url":null,"abstract":"The increasing volume of unwanted/unsolicited bulk emails, also known as “SPAM,” is a devastating issue that provokes a multitude of problems in communication systems. Over the past few years, the work on spam classification has been tremendously enhanced to a greater extent. In this paper, we present an approach that encompasses machine and deep neural networks such as Gaussian Naive Bayes (GNB), Convolution Neural Networks (CNN) network, Long Short Term Memory (LSTM) network, and a customized model developed with the combination of CNN and LSTM to classify and predict the widely used open source spam assassin dataset that contains around 6000 real email samples. The models are trained and tested, and the results are presented in the paper. Overall, CNN-LSTM attained a prediction score of 98.68% on the spam dataset.","PeriodicalId":364463,"journal":{"name":"2022 International Conference on Emerging Technologies in Electronics, Computing and Communication (ICETECC)","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2022-12-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 International Conference on Emerging Technologies in Electronics, Computing and Communication (ICETECC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICETECC56662.2022.10069229","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
The increasing volume of unwanted/unsolicited bulk emails, also known as “SPAM,” is a devastating issue that provokes a multitude of problems in communication systems. Over the past few years, the work on spam classification has been tremendously enhanced to a greater extent. In this paper, we present an approach that encompasses machine and deep neural networks such as Gaussian Naive Bayes (GNB), Convolution Neural Networks (CNN) network, Long Short Term Memory (LSTM) network, and a customized model developed with the combination of CNN and LSTM to classify and predict the widely used open source spam assassin dataset that contains around 6000 real email samples. The models are trained and tested, and the results are presented in the paper. Overall, CNN-LSTM attained a prediction score of 98.68% on the spam dataset.