基于梯度模糊准则的垃圾邮件分类器(GFGSC)

Int. Arab J. Inf. Technol. Pub Date : 2023-01-01 DOI:10.34028/iajit/20/3/12

Vinoth Narayanan Arumugam Subramaniam, Rajesh Annamalai

{"title":"基于梯度模糊准则的垃圾邮件分类器(GFGSC)","authors":"Vinoth Narayanan Arumugam Subramaniam, Rajesh Annamalai","doi":"10.34028/iajit/20/3/12","DOIUrl":null,"url":null,"abstract":"Spam messages have increased dramatically in recent years even as the number of email clients has grown. Email has already become a valuable way of communicating because it saves time and effort. However, numerous emails contain unwelcome content known as spam as a result of social platforms and advertisements. Despite the fact that many techniques have already been created for spam mails categorization, none of them achieves 100 percent efficiency in analyzing spam messages. So, in this research, we propose a novel Gradient Fuzzy Guideline-based Spam Classifier (GFGSC) for classifying the spam e-mails as spam or non-spam. This research uses four types of datasets and these datasets are pre-processed using normalization. Then the set of data can be extracted using Principal Component Analysis (PCA) and Latent Semantic Analysis (LSA) techniques. The aspects are selected using Information Gain (IG) and Chi-Square (ChS) techniques. And the GFGSC classifier can be used for classifying the data as spam or non-spam with better effectiveness. Finally, the performances are examined and these metrics are matched with the existing approaches. The results are obtained using the MATLAB tool.","PeriodicalId":13624,"journal":{"name":"Int. Arab J. Inf. Technol.","volume":"155 1","pages":"398-406"},"PeriodicalIF":0.0000,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A novel spam classification system for e-mail using a gradient fuzzy guideline-based spam classifier (GFGSC)\",\"authors\":\"Vinoth Narayanan Arumugam Subramaniam, Rajesh Annamalai\",\"doi\":\"10.34028/iajit/20/3/12\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Spam messages have increased dramatically in recent years even as the number of email clients has grown. Email has already become a valuable way of communicating because it saves time and effort. However, numerous emails contain unwelcome content known as spam as a result of social platforms and advertisements. Despite the fact that many techniques have already been created for spam mails categorization, none of them achieves 100 percent efficiency in analyzing spam messages. So, in this research, we propose a novel Gradient Fuzzy Guideline-based Spam Classifier (GFGSC) for classifying the spam e-mails as spam or non-spam. This research uses four types of datasets and these datasets are pre-processed using normalization. Then the set of data can be extracted using Principal Component Analysis (PCA) and Latent Semantic Analysis (LSA) techniques. The aspects are selected using Information Gain (IG) and Chi-Square (ChS) techniques. And the GFGSC classifier can be used for classifying the data as spam or non-spam with better effectiveness. Finally, the performances are examined and these metrics are matched with the existing approaches. The results are obtained using the MATLAB tool.\",\"PeriodicalId\":13624,\"journal\":{\"name\":\"Int. Arab J. Inf. Technol.\",\"volume\":\"155 1\",\"pages\":\"398-406\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Int. Arab J. Inf. Technol.\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.34028/iajit/20/3/12\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Int. Arab J. Inf. Technol.","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.34028/iajit/20/3/12","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

近年来，随着电子邮件客户端数量的增加，垃圾邮件也急剧增加。电子邮件已经成为一种有价值的沟通方式，因为它节省了时间和精力。然而，由于社交平台和广告，许多电子邮件包含不受欢迎的内容，被称为垃圾邮件。尽管已经创建了许多用于垃圾邮件分类的技术，但没有一种技术能够在分析垃圾邮件时达到100%的效率。因此，在本研究中，我们提出了一种新的基于梯度模糊准则的垃圾邮件分类器(GFGSC)，用于将垃圾邮件分类为垃圾邮件或非垃圾邮件。本研究使用了四种类型的数据集，这些数据集使用归一化进行预处理。然后利用主成分分析(PCA)和潜在语义分析(LSA)技术对数据集进行提取。使用信息增益(IG)和卡方(ChS)技术选择这些方面。GFGSC分类器可以有效地对垃圾邮件和非垃圾邮件进行分类。最后，对性能进行了检查，并将这些指标与现有方法进行了匹配。利用MATLAB工具得到了仿真结果。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

A novel spam classification system for e-mail using a gradient fuzzy guideline-based spam classifier (GFGSC)

Spam messages have increased dramatically in recent years even as the number of email clients has grown. Email has already become a valuable way of communicating because it saves time and effort. However, numerous emails contain unwelcome content known as spam as a result of social platforms and advertisements. Despite the fact that many techniques have already been created for spam mails categorization, none of them achieves 100 percent efficiency in analyzing spam messages. So, in this research, we propose a novel Gradient Fuzzy Guideline-based Spam Classifier (GFGSC) for classifying the spam e-mails as spam or non-spam. This research uses four types of datasets and these datasets are pre-processed using normalization. Then the set of data can be extracted using Principal Component Analysis (PCA) and Latent Semantic Analysis (LSA) techniques. The aspects are selected using Information Gain (IG) and Chi-Square (ChS) techniques. And the GFGSC classifier can be used for classifying the data as spam or non-spam with better effectiveness. Finally, the performances are examined and these metrics are matched with the existing approaches. The results are obtained using the MATLAB tool.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Int. Arab J. Inf. Technol.

自引率

0.00%

发文量