Spam SMS Classification Using Machine Learning

N. Majd, Mandar Shivaji Hanchate
{"title":"Spam SMS Classification Using Machine Learning","authors":"N. Majd, Mandar Shivaji Hanchate","doi":"10.1109/ICCCN58024.2023.10230203","DOIUrl":null,"url":null,"abstract":"Over the past few years, the use of emails and text messages has drastically increased. Short Message Service (SMS) on cellphone providers and related apps, like Whatsapp, is one of the best and fastest ways to communicate among users. SMSs are used and sent globally for personal and business purposes. However, alongside safe SMSs, the users may receive fraudulent Spam SMSs, which could cause security issues and inconvenient for the users. Numerous Spam messages are being sent daily for both personal and professional benefits. Accurately identifying Spam SMS is a challenge. The objective of this research is to build a model utilizing machine learning and deep learning to understand the semantics of SMSs and classify them to either Spam or non-Spam (Ham). We used a pre-trained BERT model and combined it with several machine learning and deep learning models. The results indicated that BERT+SVC and BERT+BiLSTM performed the best with 99.10% and 99.19% accuracies respectively on the test dataset.","PeriodicalId":132030,"journal":{"name":"2023 32nd International Conference on Computer Communications and Networks (ICCCN)","volume":"22 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 32nd International Conference on Computer Communications and Networks (ICCCN)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCCN58024.2023.10230203","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Over the past few years, the use of emails and text messages has drastically increased. Short Message Service (SMS) on cellphone providers and related apps, like Whatsapp, is one of the best and fastest ways to communicate among users. SMSs are used and sent globally for personal and business purposes. However, alongside safe SMSs, the users may receive fraudulent Spam SMSs, which could cause security issues and inconvenient for the users. Numerous Spam messages are being sent daily for both personal and professional benefits. Accurately identifying Spam SMS is a challenge. The objective of this research is to build a model utilizing machine learning and deep learning to understand the semantics of SMSs and classify them to either Spam or non-Spam (Ham). We used a pre-trained BERT model and combined it with several machine learning and deep learning models. The results indicated that BERT+SVC and BERT+BiLSTM performed the best with 99.10% and 99.19% accuracies respectively on the test dataset.
垃圾短信分类使用机器学习
在过去的几年里,电子邮件和短信的使用急剧增加。手机提供商和Whatsapp等相关应用程序提供的短信服务(SMS)是用户之间最好、最快的沟通方式之一。短信可以在全球范围内使用和发送,用于个人和商业目的。但是,在收到安全短信的同时,用户可能会收到诈骗的垃圾短信,这可能会给用户带来安全问题和不便。每天都有大量的垃圾邮件被发送给个人和职业利益。准确识别垃圾短信是一个挑战。本研究的目的是建立一个利用机器学习和深度学习的模型来理解短信的语义,并将其分类为垃圾邮件或非垃圾邮件(Ham)。我们使用了一个预训练的BERT模型,并将其与几个机器学习和深度学习模型相结合。结果表明,BERT+SVC和BERT+BiLSTM在测试数据集上表现最好,准确率分别为99.10%和99.19%。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信