{"title":"A novel spam classification system for e-mail using a gradient fuzzy guideline-based spam classifier (GFGSC)","authors":"Vinoth Narayanan Arumugam Subramaniam, Rajesh Annamalai","doi":"10.34028/iajit/20/3/12","DOIUrl":null,"url":null,"abstract":"Spam messages have increased dramatically in recent years even as the number of email clients has grown. Email has already become a valuable way of communicating because it saves time and effort. However, numerous emails contain unwelcome content known as spam as a result of social platforms and advertisements. Despite the fact that many techniques have already been created for spam mails categorization, none of them achieves 100 percent efficiency in analyzing spam messages. So, in this research, we propose a novel Gradient Fuzzy Guideline-based Spam Classifier (GFGSC) for classifying the spam e-mails as spam or non-spam. This research uses four types of datasets and these datasets are pre-processed using normalization. Then the set of data can be extracted using Principal Component Analysis (PCA) and Latent Semantic Analysis (LSA) techniques. The aspects are selected using Information Gain (IG) and Chi-Square (ChS) techniques. And the GFGSC classifier can be used for classifying the data as spam or non-spam with better effectiveness. Finally, the performances are examined and these metrics are matched with the existing approaches. The results are obtained using the MATLAB tool.","PeriodicalId":13624,"journal":{"name":"Int. Arab J. Inf. Technol.","volume":"155 1","pages":"398-406"},"PeriodicalIF":0.0000,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Int. Arab J. Inf. Technol.","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.34028/iajit/20/3/12","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Spam messages have increased dramatically in recent years even as the number of email clients has grown. Email has already become a valuable way of communicating because it saves time and effort. However, numerous emails contain unwelcome content known as spam as a result of social platforms and advertisements. Despite the fact that many techniques have already been created for spam mails categorization, none of them achieves 100 percent efficiency in analyzing spam messages. So, in this research, we propose a novel Gradient Fuzzy Guideline-based Spam Classifier (GFGSC) for classifying the spam e-mails as spam or non-spam. This research uses four types of datasets and these datasets are pre-processed using normalization. Then the set of data can be extracted using Principal Component Analysis (PCA) and Latent Semantic Analysis (LSA) techniques. The aspects are selected using Information Gain (IG) and Chi-Square (ChS) techniques. And the GFGSC classifier can be used for classifying the data as spam or non-spam with better effectiveness. Finally, the performances are examined and these metrics are matched with the existing approaches. The results are obtained using the MATLAB tool.