Hybrid Model for Email Spam Prediction Using Random Forest for Feature Extraction

Hardik Saini, K. S. Saini
{"title":"Hybrid Model for Email Spam Prediction Using Random Forest for Feature Extraction","authors":"Hardik Saini, K. S. Saini","doi":"10.1109/ICAIA57370.2023.10169126","DOIUrl":null,"url":null,"abstract":"With the advancement in world wide web, the way to communicate among individuals, via internet, is changed and thus, various platforms become popular such as email. Numerous organizations and people make the deployment of email as major sources of communication. This platform is extensively utilized in spite of alternative means, such as electronic messages, and social networks. However, this technology is more prone to malicious activities. The malicious users target this free mail structure and send a huge number of useless messages, for attaining revenues, or stealing personal data or IDs, to harm its users. Thus, there is necessity to discover the methods for detecting the email spam. The spam is detected in email in different phases in which the data is pre-processed, features are extracted, and the mails are classified. This work introduced a new model to predict the email spam. This approach implements the random forest in order to extract the features. Eventually, the spam is predicted using logistic regression model. The proposed model is implemented in python using anaconda.","PeriodicalId":196526,"journal":{"name":"2023 International Conference on Artificial Intelligence and Applications (ICAIA) Alliance Technology Conference (ATCON-1)","volume":"17 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-04-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 International Conference on Artificial Intelligence and Applications (ICAIA) Alliance Technology Conference (ATCON-1)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICAIA57370.2023.10169126","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

With the advancement in world wide web, the way to communicate among individuals, via internet, is changed and thus, various platforms become popular such as email. Numerous organizations and people make the deployment of email as major sources of communication. This platform is extensively utilized in spite of alternative means, such as electronic messages, and social networks. However, this technology is more prone to malicious activities. The malicious users target this free mail structure and send a huge number of useless messages, for attaining revenues, or stealing personal data or IDs, to harm its users. Thus, there is necessity to discover the methods for detecting the email spam. The spam is detected in email in different phases in which the data is pre-processed, features are extracted, and the mails are classified. This work introduced a new model to predict the email spam. This approach implements the random forest in order to extract the features. Eventually, the spam is predicted using logistic regression model. The proposed model is implemented in python using anaconda.
基于随机森林特征提取的垃圾邮件预测混合模型
随着万维网的进步,人与人之间的交流方式,通过互联网,改变了,因此,各种平台变得流行,如电子邮件。许多组织和个人将电子邮件作为主要的通信来源。尽管有电子消息和社交网络等替代手段,但该平台仍被广泛使用。然而,这种技术更容易受到恶意活动的攻击。恶意用户以这种免费邮件结构为目标,发送大量无用的信息,以获取收入,或窃取个人数据或id,以伤害其用户。因此,有必要研究垃圾邮件的检测方法。垃圾邮件的检测分为数据预处理、特征提取和邮件分类三个阶段。本文提出了一种新的垃圾邮件预测模型。该方法实现了随机森林来提取特征。最后,利用逻辑回归模型对垃圾邮件进行预测。提出的模型在python中使用anaconda实现。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信