{"title":"Email Representation using Noncharacteristic Information and its Application","authors":"Pei-yu Liu, Jing Zhao, Zhen-fang Zhu","doi":"10.4156/JCIT.VOL5.ISSUE8.19","DOIUrl":null,"url":null,"abstract":"Focusing on the uncertainty of classifying emails based-on email content and the incompleteness of email representation, the paper proposes a new representation using noncharacteristic information. The new approach refers to the whole email, contains feature items extracted from email content, and noncharacteristic items extracted from email header. In the expriment, we adopt Naive Bayes classifier to classify emails, classification results indicate that the new approach overcomes the shortcomings of original content-based filtering and improves the recall and the precision of spam filtering.","PeriodicalId":360193,"journal":{"name":"J. Convergence Inf. Technol.","volume":"3 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2010-10-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"J. Convergence Inf. Technol.","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.4156/JCIT.VOL5.ISSUE8.19","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3
Abstract
Focusing on the uncertainty of classifying emails based-on email content and the incompleteness of email representation, the paper proposes a new representation using noncharacteristic information. The new approach refers to the whole email, contains feature items extracted from email content, and noncharacteristic items extracted from email header. In the expriment, we adopt Naive Bayes classifier to classify emails, classification results indicate that the new approach overcomes the shortcomings of original content-based filtering and improves the recall and the precision of spam filtering.