Spam SMS Classification Using Machine Learning

2023 32nd International Conference on Computer Communications and Networks (ICCCN) Pub Date : 2023-07-01 DOI:10.1109/ICCCN58024.2023.10230203

N. Majd, Mandar Shivaji Hanchate

引用次数: 0

Abstract

Over the past few years, the use of emails and text messages has drastically increased. Short Message Service (SMS) on cellphone providers and related apps, like Whatsapp, is one of the best and fastest ways to communicate among users. SMSs are used and sent globally for personal and business purposes. However, alongside safe SMSs, the users may receive fraudulent Spam SMSs, which could cause security issues and inconvenient for the users. Numerous Spam messages are being sent daily for both personal and professional benefits. Accurately identifying Spam SMS is a challenge. The objective of this research is to build a model utilizing machine learning and deep learning to understand the semantics of SMSs and classify them to either Spam or non-Spam (Ham). We used a pre-trained BERT model and combined it with several machine learning and deep learning models. The results indicated that BERT+SVC and BERT+BiLSTM performed the best with 99.10% and 99.19% accuracies respectively on the test dataset.

查看原文本刊更多论文

垃圾短信分类使用机器学习

在过去的几年里，电子邮件和短信的使用急剧增加。手机提供商和Whatsapp等相关应用程序提供的短信服务(SMS)是用户之间最好、最快的沟通方式之一。短信可以在全球范围内使用和发送，用于个人和商业目的。但是，在收到安全短信的同时，用户可能会收到诈骗的垃圾短信，这可能会给用户带来安全问题和不便。每天都有大量的垃圾邮件被发送给个人和职业利益。准确识别垃圾短信是一个挑战。本研究的目的是建立一个利用机器学习和深度学习的模型来理解短信的语义，并将其分类为垃圾邮件或非垃圾邮件(Ham)。我们使用了一个预训练的BERT模型，并将其与几个机器学习和深度学习模型相结合。结果表明，BERT+SVC和BERT+BiLSTM在测试数据集上表现最好，准确率分别为99.10%和99.19%。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2023 32nd International Conference on Computer Communications and Networks (ICCCN)

自引率

0.00%

发文量