基于最近社区分类器的垃圾邮件检测

Michal Prilepok, M. Kudelka
{"title":"基于最近社区分类器的垃圾邮件检测","authors":"Michal Prilepok, M. Kudelka","doi":"10.1109/INCoS.2015.75","DOIUrl":null,"url":null,"abstract":"Undesirable emails (spam) are increasingly becoming a big problem nowadays, not only for users, but also for Internet service providers. Therefore, the design of new algorithms detecting the spam is currently one of the research hot-topics. We define two requirements and use them simultaneously. The first requirement is a low rate of falsely detected emails which has an impact on the algorithm performance. The second requirement is a fast detection of spams. It minimizes the delay in receiving emails. In this paper, we focus our effort on the first requirement. To solve this problem we applied network community analysis. The approach is to find communities - groups of same emails. In this paper, we present a new nearest community classifier and apply it in the field of spam detection. The obtained results are very close to Bayesian Spam Filter. We achieved 93.78% accuracy. The algorithm can detect 80.72% of spam emails and 98.01% non-spam emails.","PeriodicalId":345650,"journal":{"name":"2015 International Conference on Intelligent Networking and Collaborative Systems","volume":"8 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2015-09-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":"{\"title\":\"Spam Detection Based on Nearest Community Classifier\",\"authors\":\"Michal Prilepok, M. Kudelka\",\"doi\":\"10.1109/INCoS.2015.75\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Undesirable emails (spam) are increasingly becoming a big problem nowadays, not only for users, but also for Internet service providers. Therefore, the design of new algorithms detecting the spam is currently one of the research hot-topics. We define two requirements and use them simultaneously. The first requirement is a low rate of falsely detected emails which has an impact on the algorithm performance. The second requirement is a fast detection of spams. It minimizes the delay in receiving emails. In this paper, we focus our effort on the first requirement. To solve this problem we applied network community analysis. The approach is to find communities - groups of same emails. In this paper, we present a new nearest community classifier and apply it in the field of spam detection. The obtained results are very close to Bayesian Spam Filter. We achieved 93.78% accuracy. The algorithm can detect 80.72% of spam emails and 98.01% non-spam emails.\",\"PeriodicalId\":345650,\"journal\":{\"name\":\"2015 International Conference on Intelligent Networking and Collaborative Systems\",\"volume\":\"8 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2015-09-02\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2015 International Conference on Intelligent Networking and Collaborative Systems\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/INCoS.2015.75\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2015 International Conference on Intelligent Networking and Collaborative Systems","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/INCoS.2015.75","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3

摘要

不受欢迎的电子邮件(垃圾邮件)如今日益成为一个大问题,不仅对用户,而且对互联网服务提供商。因此,设计新的垃圾邮件检测算法是当前的研究热点之一。我们定义两个需求并同时使用它们。第一个要求是低误检率的邮件,这会影响算法的性能。第二个要求是快速检测垃圾邮件。它最大限度地减少了接收电子邮件的延迟。在本文中,我们将重点放在第一个需求上。为了解决这个问题,我们应用了网络社区分析。方法是找到社区——由相同的电子邮件组成的群体。本文提出了一种新的最近社区分类器,并将其应用于垃圾邮件检测领域。所得结果与贝叶斯垃圾邮件过滤器非常接近。准确率达到93.78%。该算法能检测出80.72%的垃圾邮件和98.01%的非垃圾邮件。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Spam Detection Based on Nearest Community Classifier
Undesirable emails (spam) are increasingly becoming a big problem nowadays, not only for users, but also for Internet service providers. Therefore, the design of new algorithms detecting the spam is currently one of the research hot-topics. We define two requirements and use them simultaneously. The first requirement is a low rate of falsely detected emails which has an impact on the algorithm performance. The second requirement is a fast detection of spams. It minimizes the delay in receiving emails. In this paper, we focus our effort on the first requirement. To solve this problem we applied network community analysis. The approach is to find communities - groups of same emails. In this paper, we present a new nearest community classifier and apply it in the field of spam detection. The obtained results are very close to Bayesian Spam Filter. We achieved 93.78% accuracy. The algorithm can detect 80.72% of spam emails and 98.01% non-spam emails.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信