Junaidi Muhammad, Affan Bin Hasan, Muhammad Farrukh
{"title":"基于深度和机器学习技术的人工智能支持模型的垃圾邮件分类和预测","authors":"Junaidi Muhammad, Affan Bin Hasan, Muhammad Farrukh","doi":"10.1109/ICETECC56662.2022.10069229","DOIUrl":null,"url":null,"abstract":"The increasing volume of unwanted/unsolicited bulk emails, also known as “SPAM,” is a devastating issue that provokes a multitude of problems in communication systems. Over the past few years, the work on spam classification has been tremendously enhanced to a greater extent. In this paper, we present an approach that encompasses machine and deep neural networks such as Gaussian Naive Bayes (GNB), Convolution Neural Networks (CNN) network, Long Short Term Memory (LSTM) network, and a customized model developed with the combination of CNN and LSTM to classify and predict the widely used open source spam assassin dataset that contains around 6000 real email samples. The models are trained and tested, and the results are presented in the paper. Overall, CNN-LSTM attained a prediction score of 98.68% on the spam dataset.","PeriodicalId":364463,"journal":{"name":"2022 International Conference on Emerging Technologies in Electronics, Computing and Communication (ICETECC)","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2022-12-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Classification and Prediction of Spam Emails Based on AI Enabling Models Using Deep and Machine Learning Techniques\",\"authors\":\"Junaidi Muhammad, Affan Bin Hasan, Muhammad Farrukh\",\"doi\":\"10.1109/ICETECC56662.2022.10069229\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The increasing volume of unwanted/unsolicited bulk emails, also known as “SPAM,” is a devastating issue that provokes a multitude of problems in communication systems. Over the past few years, the work on spam classification has been tremendously enhanced to a greater extent. In this paper, we present an approach that encompasses machine and deep neural networks such as Gaussian Naive Bayes (GNB), Convolution Neural Networks (CNN) network, Long Short Term Memory (LSTM) network, and a customized model developed with the combination of CNN and LSTM to classify and predict the widely used open source spam assassin dataset that contains around 6000 real email samples. The models are trained and tested, and the results are presented in the paper. Overall, CNN-LSTM attained a prediction score of 98.68% on the spam dataset.\",\"PeriodicalId\":364463,\"journal\":{\"name\":\"2022 International Conference on Emerging Technologies in Electronics, Computing and Communication (ICETECC)\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-12-07\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 International Conference on Emerging Technologies in Electronics, Computing and Communication (ICETECC)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICETECC56662.2022.10069229\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 International Conference on Emerging Technologies in Electronics, Computing and Communication (ICETECC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICETECC56662.2022.10069229","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Classification and Prediction of Spam Emails Based on AI Enabling Models Using Deep and Machine Learning Techniques
The increasing volume of unwanted/unsolicited bulk emails, also known as “SPAM,” is a devastating issue that provokes a multitude of problems in communication systems. Over the past few years, the work on spam classification has been tremendously enhanced to a greater extent. In this paper, we present an approach that encompasses machine and deep neural networks such as Gaussian Naive Bayes (GNB), Convolution Neural Networks (CNN) network, Long Short Term Memory (LSTM) network, and a customized model developed with the combination of CNN and LSTM to classify and predict the widely used open source spam assassin dataset that contains around 6000 real email samples. The models are trained and tested, and the results are presented in the paper. Overall, CNN-LSTM attained a prediction score of 98.68% on the spam dataset.