利用机器学习和深度学习技术对垃圾邮件进行分类

IF 1 Q4 COMPUTER SCIENCE, INFORMATION SYSTEMS

International Journal on Information Technologies and Security Pub Date : 2024-06-01 DOI:10.59035/fpko7430

Bandar Alshawi, Amr Munshi, Majid Alotaibi, Ryan Alturki, Nasser Allheeib

{"title":"利用机器学习和深度学习技术对垃圾邮件进行分类","authors":"Bandar Alshawi, Amr Munshi, Majid Alotaibi, Ryan Alturki, Nasser Allheeib","doi":"10.59035/fpko7430","DOIUrl":null,"url":null,"abstract":"Abstract: The Internet and social media networks usage has increased nowadays and become a prominent medium of communicating. Email is one of the professional reliable methods of communication. Automatic classifications of spam emails have become an area of interest. In order to detect spam emails, this study utilizes a dataset, including spam and non-spam emails. Various techniques are applied to obtain higher accuracy using machine learning techniques. NLP is also utilized for improvising accuracy using embeddings. For that, this work utilizes the BERT model, to achieve satisfactory detection of spam emails. Further, the results are compared with state-of-the-art methods, including, KNN, LSTM and Bi-LSTM. The results obtained by Bi-LSTM and LSTM were 97.94% and 86.02%, respectively. The presented methodology is promising in detecting spam emails due to the higher accuracy achieved.","PeriodicalId":42317,"journal":{"name":"International Journal on Information Technologies and Security","volume":null,"pages":null},"PeriodicalIF":1.0000,"publicationDate":"2024-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Classification of SPAM mail utilizing machine learning and deep learning techniques\",\"authors\":\"Bandar Alshawi, Amr Munshi, Majid Alotaibi, Ryan Alturki, Nasser Allheeib\",\"doi\":\"10.59035/fpko7430\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Abstract: The Internet and social media networks usage has increased nowadays and become a prominent medium of communicating. Email is one of the professional reliable methods of communication. Automatic classifications of spam emails have become an area of interest. In order to detect spam emails, this study utilizes a dataset, including spam and non-spam emails. Various techniques are applied to obtain higher accuracy using machine learning techniques. NLP is also utilized for improvising accuracy using embeddings. For that, this work utilizes the BERT model, to achieve satisfactory detection of spam emails. Further, the results are compared with state-of-the-art methods, including, KNN, LSTM and Bi-LSTM. The results obtained by Bi-LSTM and LSTM were 97.94% and 86.02%, respectively. The presented methodology is promising in detecting spam emails due to the higher accuracy achieved.\",\"PeriodicalId\":42317,\"journal\":{\"name\":\"International Journal on Information Technologies and Security\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":1.0000,\"publicationDate\":\"2024-06-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"International Journal on Information Technologies and Security\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.59035/fpko7430\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q4\",\"JCRName\":\"COMPUTER SCIENCE, INFORMATION SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal on Information Technologies and Security","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.59035/fpko7430","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}

引用次数: 0

摘要

摘要：如今，互联网和社交媒体网络的使用率越来越高，已成为一种重要的沟通媒介。电子邮件是专业可靠的通信方式之一。对垃圾邮件进行自动分类已成为一个备受关注的领域。为了检测垃圾邮件，本研究使用了一个数据集，其中包括垃圾邮件和非垃圾邮件。为了获得更高的准确率，我们使用了机器学习技术来应用各种技术。为了提高准确率，还使用了嵌入式 NLP。为此，这项工作采用了 BERT 模型，以达到令人满意的垃圾邮件检测效果。此外，还将结果与 KNN、LSTM 和 Bi-LSTM 等最先进的方法进行了比较。Bi-LSTM 和 LSTM 的检测结果分别为 97.94% 和 86.02%。由于所达到的准确率较高，因此所介绍的方法在检测垃圾邮件方面前景广阔。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Classification of SPAM mail utilizing machine learning and deep learning techniques

Abstract: The Internet and social media networks usage has increased nowadays and become a prominent medium of communicating. Email is one of the professional reliable methods of communication. Automatic classifications of spam emails have become an area of interest. In order to detect spam emails, this study utilizes a dataset, including spam and non-spam emails. Various techniques are applied to obtain higher accuracy using machine learning techniques. NLP is also utilized for improvising accuracy using embeddings. For that, this work utilizes the BERT model, to achieve satisfactory detection of spam emails. Further, the results are compared with state-of-the-art methods, including, KNN, LSTM and Bi-LSTM. The results obtained by Bi-LSTM and LSTM were 97.94% and 86.02%, respectively. The presented methodology is promising in detecting spam emails due to the higher accuracy achieved.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

International Journal on Information Technologies and Security COMPUTER SCIENCE, INFORMATION SYSTEMS-

自引率

66.70%

发文量