基于迭代模式的垃圾邮件检测新方法

2012 2nd International eConference on Computer and Knowledge Engineering (ICCKE) Pub Date : 2012-12-31 DOI:10.1109/ICCKE.2012.6395399

M. Razmara, B. Asadi, M. Narouei, M. Ahmadi

{"title":"基于迭代模式的垃圾邮件检测新方法","authors":"M. Razmara, B. Asadi, M. Narouei, M. Ahmadi","doi":"10.1109/ICCKE.2012.6395399","DOIUrl":null,"url":null,"abstract":"Spamming is becoming a major threat that negatively impacts the usability of e-mail. Although lots of techniques have been proposed for detecting and blocking spam messages, Spammers still spread spam e-mails for different purposes such as advertising, phishing, adult and other purposes and there is not any complete solution for this problem. In this work we present a novel solution toward spam filtering by using a new set of features for classification models. These features are the sequential unique and closed patterns which are extracted from the content of messages. After applying a term selection method, we show that these features have good performance in classifying spam messages from legitimate messages. The achieved results on 6 different datasets show the effectiveness of our proposed method compared to close similar methods. We outperform the accuracy near +2% compared to related state of arts. In addition our method is resilient against injecting irrelevant and bothersome words.","PeriodicalId":154379,"journal":{"name":"2012 2nd International eConference on Computer and Knowledge Engineering (ICCKE)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2012-12-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":"{\"title\":\"A novel approach toward spam detection based on iterative patterns\",\"authors\":\"M. Razmara, B. Asadi, M. Narouei, M. Ahmadi\",\"doi\":\"10.1109/ICCKE.2012.6395399\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Spamming is becoming a major threat that negatively impacts the usability of e-mail. Although lots of techniques have been proposed for detecting and blocking spam messages, Spammers still spread spam e-mails for different purposes such as advertising, phishing, adult and other purposes and there is not any complete solution for this problem. In this work we present a novel solution toward spam filtering by using a new set of features for classification models. These features are the sequential unique and closed patterns which are extracted from the content of messages. After applying a term selection method, we show that these features have good performance in classifying spam messages from legitimate messages. The achieved results on 6 different datasets show the effectiveness of our proposed method compared to close similar methods. We outperform the accuracy near +2% compared to related state of arts. In addition our method is resilient against injecting irrelevant and bothersome words.\",\"PeriodicalId\":154379,\"journal\":{\"name\":\"2012 2nd International eConference on Computer and Knowledge Engineering (ICCKE)\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2012-12-31\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"6\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2012 2nd International eConference on Computer and Knowledge Engineering (ICCKE)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICCKE.2012.6395399\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2012 2nd International eConference on Computer and Knowledge Engineering (ICCKE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCKE.2012.6395399","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 6

摘要

垃圾邮件正在成为对电子邮件可用性产生负面影响的主要威胁。虽然有许多侦测及阻止滥发讯息的技术被提出，但滥发者仍以不同的目的散布滥发电邮，例如广告、网络钓鱼、成人及其他目的，而这个问题并没有完全的解决办法。在这项工作中，我们通过使用一组新的分类模型特征，提出了一种新的垃圾邮件过滤解决方案。这些特征是从消息内容中提取的顺序的、唯一的和封闭的模式。在应用术语选择方法后，我们证明了这些特征在区分垃圾邮件和合法邮件方面具有良好的性能。在6个不同的数据集上取得的结果表明，与相近的方法相比，我们提出的方法是有效的。与相关技术水平相比，我们的准确率接近+2%。此外，我们的方法对注入无关和麻烦的单词具有弹性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

A novel approach toward spam detection based on iterative patterns

Spamming is becoming a major threat that negatively impacts the usability of e-mail. Although lots of techniques have been proposed for detecting and blocking spam messages, Spammers still spread spam e-mails for different purposes such as advertising, phishing, adult and other purposes and there is not any complete solution for this problem. In this work we present a novel solution toward spam filtering by using a new set of features for classification models. These features are the sequential unique and closed patterns which are extracted from the content of messages. After applying a term selection method, we show that these features have good performance in classifying spam messages from legitimate messages. The achieved results on 6 different datasets show the effectiveness of our proposed method compared to close similar methods. We outperform the accuracy near +2% compared to related state of arts. In addition our method is resilient against injecting irrelevant and bothersome words.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2012 2nd International eConference on Computer and Knowledge Engineering (ICCKE)

自引率

0.00%

发文量