Classification and Prediction of Spam Emails Based on AI Enabling Models Using Deep and Machine Learning Techniques

2022 International Conference on Emerging Technologies in Electronics, Computing and Communication (ICETECC) Pub Date : 2022-12-07 DOI:10.1109/ICETECC56662.2022.10069229

Junaidi Muhammad, Affan Bin Hasan, Muhammad Farrukh

引用次数: 0

Abstract

The increasing volume of unwanted/unsolicited bulk emails, also known as “SPAM,” is a devastating issue that provokes a multitude of problems in communication systems. Over the past few years, the work on spam classification has been tremendously enhanced to a greater extent. In this paper, we present an approach that encompasses machine and deep neural networks such as Gaussian Naive Bayes (GNB), Convolution Neural Networks (CNN) network, Long Short Term Memory (LSTM) network, and a customized model developed with the combination of CNN and LSTM to classify and predict the widely used open source spam assassin dataset that contains around 6000 real email samples. The models are trained and tested, and the results are presented in the paper. Overall, CNN-LSTM attained a prediction score of 98.68% on the spam dataset.

查看原文本刊更多论文

基于深度和机器学习技术的人工智能支持模型的垃圾邮件分类和预测

不需要的/未经请求的批量电子邮件(也称为“SPAM”)数量的增加是一个毁灭性的问题，它在通信系统中引发了许多问题。在过去的几年里，垃圾邮件分类的工作在很大程度上得到了极大的加强。在本文中，我们提出了一种包含机器和深度神经网络(如高斯朴素贝叶斯(GNB)，卷积神经网络(CNN)网络，长短期记忆(LSTM)网络)的方法，以及一种结合CNN和LSTM开发的定制模型，用于分类和预测广泛使用的开源垃圾邮件刺客数据集，该数据集包含大约6000个真实电子邮件样本。本文对模型进行了训练和测试，并给出了结果。总体而言，CNN-LSTM在垃圾邮件数据集上的预测得分为98.68%。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2022 International Conference on Emerging Technologies in Electronics, Computing and Communication (ICETECC)

自引率

0.00%

发文量