A Comparative Approach to Naïve Bayes Classifier and Support Vector Machine for Email Spam Classification

Thae Ma Ma, K. Yamamori, A. Thida
{"title":"A Comparative Approach to Naïve Bayes Classifier and Support Vector Machine for Email Spam Classification","authors":"Thae Ma Ma, K. Yamamori, A. Thida","doi":"10.1109/GCCE50665.2020.9291921","DOIUrl":null,"url":null,"abstract":"Spam or unsolicited emails that are used by spammers can cause huge loss to both the email users and the email server. Therefore, in order to detect spam emails not to enter into our mailbox, a developed email spam classification system is required. This paper proposes two popular machine learning methods, Naïve Bayes Classifier and Support Vector Machine, to classify the emails into spam or ham based on the body or content of the emails. In Naïve Bayes Classifier, independent words are considered as features. Support Vector Machine can be used to represent an email in vector space in which each feature means one dimension. Finally, two methods are compared in terms of precision, recall, F-measure performance metrics with the aim of finding the best method.","PeriodicalId":179456,"journal":{"name":"2020 IEEE 9th Global Conference on Consumer Electronics (GCCE)","volume":"82 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-10-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"18","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 IEEE 9th Global Conference on Consumer Electronics (GCCE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/GCCE50665.2020.9291921","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 18

Abstract

Spam or unsolicited emails that are used by spammers can cause huge loss to both the email users and the email server. Therefore, in order to detect spam emails not to enter into our mailbox, a developed email spam classification system is required. This paper proposes two popular machine learning methods, Naïve Bayes Classifier and Support Vector Machine, to classify the emails into spam or ham based on the body or content of the emails. In Naïve Bayes Classifier, independent words are considered as features. Support Vector Machine can be used to represent an email in vector space in which each feature means one dimension. Finally, two methods are compared in terms of precision, recall, F-measure performance metrics with the aim of finding the best method.
Naïve贝叶斯分类器与支持向量机在垃圾邮件分类中的比较
垃圾邮件发送者使用的垃圾邮件或未经请求的电子邮件会给电子邮件用户和电子邮件服务器造成巨大损失。因此,为了检测不进入我们邮箱的垃圾邮件,需要一个成熟的垃圾邮件分类系统。本文提出了两种流行的机器学习方法,Naïve贝叶斯分类器和支持向量机,根据邮件的正文或内容将邮件分类为垃圾邮件或火腿。在Naïve贝叶斯分类器中,独立的词被认为是特征。支持向量机可以用来表示向量空间中的电子邮件,其中每个特征表示一个维度。最后,对两种方法在精度、召回率、F-measure性能指标方面进行了比较,以期找到最佳方法。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信