{"title":"Novel approach: Naïve Bayes with Vector space model for spam classification","authors":"S. Vahora, Mosin Hasan, R. Lakhani","doi":"10.1109/NUICONE.2011.6153245","DOIUrl":null,"url":null,"abstract":"We always see our normal mail goes into spam folder of the mail box. Interestingly 90% of the time the mail server classifies it perfectly but sometimes it fails due to spammer are getting highly technical. In this paper, we are using novel approach which uses Vector space model with Naïve Bayes to correctly classify mails as spam mail. Naïve Bayes method is used for spam classification but still binding with personalize word vector helps in increasing the accuracy of the system because user receives special type of message only. In this research work, we use vector space model with naïve bayes to classify spam mail. We got nearly 85% of accuracy in spam classification. We have used personalize mail classification option instead of standard global classification because people visiting subjective (i.e. pornographic) sites frequently get spam mail related to that subject (pornography) only and hence personalization shows improved result.","PeriodicalId":206392,"journal":{"name":"2011 Nirma University International Conference on Engineering","volume":"51 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2011-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2011 Nirma University International Conference on Engineering","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/NUICONE.2011.6153245","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 4
Abstract
We always see our normal mail goes into spam folder of the mail box. Interestingly 90% of the time the mail server classifies it perfectly but sometimes it fails due to spammer are getting highly technical. In this paper, we are using novel approach which uses Vector space model with Naïve Bayes to correctly classify mails as spam mail. Naïve Bayes method is used for spam classification but still binding with personalize word vector helps in increasing the accuracy of the system because user receives special type of message only. In this research work, we use vector space model with naïve bayes to classify spam mail. We got nearly 85% of accuracy in spam classification. We have used personalize mail classification option instead of standard global classification because people visiting subjective (i.e. pornographic) sites frequently get spam mail related to that subject (pornography) only and hence personalization shows improved result.