Supervised Learning based E-mail/ SMS Spam Classifier

Satendra Kumar, Raj Kumar, A. Saini
{"title":"Supervised Learning based E-mail/ SMS Spam Classifier","authors":"Satendra Kumar, Raj Kumar, A. Saini","doi":"10.2174/0126662558279046240126051302","DOIUrl":null,"url":null,"abstract":"\n\nOne of the challenging problems facing the modern Internet is spam,\nwhich can annoy individual customers and wreak financial havoc on businesses. Spam communications target customers without their permission and clog their mailboxes. They consume\nmore time and organizational resources when checking for and deleting spam. Even though\nmost web users openly dislike spam, enough are willing to accept lucrative deals that spam remains a real problem. While most web users are well aware of their hatred of spam, the fact\nthat enough of them still click on commercial offers means spammers can still make money\nfrom them. While most customers know what to do, they need clear instructions on avoiding\nand deleting spam. No matter what you do to eliminate spam, you won't succeed. Filtering is\nthe most straightforward and practical technique in spam-blocking strategies.\n\n\n\nWe present procedures for identifying emails as spam or ham based on text classification. Different methods of e-mail organization preprocessing are interrelated, for example, applying stop word exclusion, stemming, including reduction and highlight selection strategies to\nextract buzzwords from each quality, and finally, using unique classifiers to Quarantine messages as spam or ham.\n\n\n\nThe Nave Bayes classifier is a good choice. Some classifiers, such as Simple Logistic\nand Adaboost, perform well. However, the Support Vector Machine Classifier (SVC) outperforms it. Therefore, the SVC makes decisions based on each case's comparisons and perspectives.\n\n\n\nMany spam separation studies have focused on recent classifier-related challenges. Machine Learning (ML) for spam detection is an important area of modern research. Today,\nspam detection using ML is an important area of research. Examine the adequacy of the proposed work and recognize the application of multiple learning estimates to extract spam from\nemails. Similarly, estimates have also been scrutinized.\n","PeriodicalId":506582,"journal":{"name":"Recent Advances in Computer Science and Communications","volume":" 57","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-06-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Recent Advances in Computer Science and Communications","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.2174/0126662558279046240126051302","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

One of the challenging problems facing the modern Internet is spam, which can annoy individual customers and wreak financial havoc on businesses. Spam communications target customers without their permission and clog their mailboxes. They consume more time and organizational resources when checking for and deleting spam. Even though most web users openly dislike spam, enough are willing to accept lucrative deals that spam remains a real problem. While most web users are well aware of their hatred of spam, the fact that enough of them still click on commercial offers means spammers can still make money from them. While most customers know what to do, they need clear instructions on avoiding and deleting spam. No matter what you do to eliminate spam, you won't succeed. Filtering is the most straightforward and practical technique in spam-blocking strategies. We present procedures for identifying emails as spam or ham based on text classification. Different methods of e-mail organization preprocessing are interrelated, for example, applying stop word exclusion, stemming, including reduction and highlight selection strategies to extract buzzwords from each quality, and finally, using unique classifiers to Quarantine messages as spam or ham. The Nave Bayes classifier is a good choice. Some classifiers, such as Simple Logistic and Adaboost, perform well. However, the Support Vector Machine Classifier (SVC) outperforms it. Therefore, the SVC makes decisions based on each case's comparisons and perspectives. Many spam separation studies have focused on recent classifier-related challenges. Machine Learning (ML) for spam detection is an important area of modern research. Today, spam detection using ML is an important area of research. Examine the adequacy of the proposed work and recognize the application of multiple learning estimates to extract spam from emails. Similarly, estimates have also been scrutinized.
基于监督学习的电子邮件/短信垃圾邮件分类器
垃圾邮件是现代互联网面临的挑战性问题之一,它不仅会惹恼个人客户,还会给企业造成经济损失。垃圾邮件在未经客户许可的情况下以客户为目标,堵塞了他们的邮箱。在检查和删除垃圾邮件时,它们会消耗更多的时间和组织资源。尽管大多数网络用户公开表示不喜欢垃圾邮件,但他们愿意接受有利可图的交易,因此垃圾邮件仍然是一个现实问题。虽然大多数网络用户都很清楚自己憎恨垃圾邮件,但他们中仍有足够多的人点击商业广告,这意味着垃圾邮件发送者仍能从中牟利。虽然大多数客户知道该怎么做,但他们需要明确的说明来避免和删除垃圾邮件。无论你用什么方法来消除垃圾邮件,都不会成功。在垃圾邮件拦截策略中,过滤是最直接、最实用的技术。我们介绍了基于文本分类识别垃圾邮件或火腿肠邮件的程序。不同的电子邮件组织预处理方法是相互关联的,例如,应用停止词排除、词干处理、包括缩减和高亮选择策略,从每个质量中提取流行词,最后,使用独特的分类器将邮件隔离为垃圾邮件或火腿肠。一些分类器,如 Simple Logistic 和 Adaboost,表现也不错。不过,支持向量机分类器(SVC)的表现要优于它。因此,支持向量机分类器根据每个案例的比较和视角做出决策。用于垃圾邮件检测的机器学习(ML)是现代研究的一个重要领域。如今,使用 ML 进行垃圾邮件检测是一个重要的研究领域。检查提议的工作是否充分,并认识到应用多种学习估计值从邮件中提取垃圾信息的重要性。同样,也对估计值进行了仔细研究。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信