基于随机森林特征提取的垃圾邮件预测混合模型

2023 International Conference on Artificial Intelligence and Applications (ICAIA) Alliance Technology Conference (ATCON-1) Pub Date : 2023-04-21 DOI:10.1109/ICAIA57370.2023.10169126

Hardik Saini, K. S. Saini

{"title":"基于随机森林特征提取的垃圾邮件预测混合模型","authors":"Hardik Saini, K. S. Saini","doi":"10.1109/ICAIA57370.2023.10169126","DOIUrl":null,"url":null,"abstract":"With the advancement in world wide web, the way to communicate among individuals, via internet, is changed and thus, various platforms become popular such as email. Numerous organizations and people make the deployment of email as major sources of communication. This platform is extensively utilized in spite of alternative means, such as electronic messages, and social networks. However, this technology is more prone to malicious activities. The malicious users target this free mail structure and send a huge number of useless messages, for attaining revenues, or stealing personal data or IDs, to harm its users. Thus, there is necessity to discover the methods for detecting the email spam. The spam is detected in email in different phases in which the data is pre-processed, features are extracted, and the mails are classified. This work introduced a new model to predict the email spam. This approach implements the random forest in order to extract the features. Eventually, the spam is predicted using logistic regression model. The proposed model is implemented in python using anaconda.","PeriodicalId":196526,"journal":{"name":"2023 International Conference on Artificial Intelligence and Applications (ICAIA) Alliance Technology Conference (ATCON-1)","volume":"17 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-04-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Hybrid Model for Email Spam Prediction Using Random Forest for Feature Extraction\",\"authors\":\"Hardik Saini, K. S. Saini\",\"doi\":\"10.1109/ICAIA57370.2023.10169126\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"With the advancement in world wide web, the way to communicate among individuals, via internet, is changed and thus, various platforms become popular such as email. Numerous organizations and people make the deployment of email as major sources of communication. This platform is extensively utilized in spite of alternative means, such as electronic messages, and social networks. However, this technology is more prone to malicious activities. The malicious users target this free mail structure and send a huge number of useless messages, for attaining revenues, or stealing personal data or IDs, to harm its users. Thus, there is necessity to discover the methods for detecting the email spam. The spam is detected in email in different phases in which the data is pre-processed, features are extracted, and the mails are classified. This work introduced a new model to predict the email spam. This approach implements the random forest in order to extract the features. Eventually, the spam is predicted using logistic regression model. The proposed model is implemented in python using anaconda.\",\"PeriodicalId\":196526,\"journal\":{\"name\":\"2023 International Conference on Artificial Intelligence and Applications (ICAIA) Alliance Technology Conference (ATCON-1)\",\"volume\":\"17 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-04-21\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2023 International Conference on Artificial Intelligence and Applications (ICAIA) Alliance Technology Conference (ATCON-1)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICAIA57370.2023.10169126\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 International Conference on Artificial Intelligence and Applications (ICAIA) Alliance Technology Conference (ATCON-1)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICAIA57370.2023.10169126","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

随着万维网的进步，人与人之间的交流方式，通过互联网，改变了，因此，各种平台变得流行，如电子邮件。许多组织和个人将电子邮件作为主要的通信来源。尽管有电子消息和社交网络等替代手段，但该平台仍被广泛使用。然而，这种技术更容易受到恶意活动的攻击。恶意用户以这种免费邮件结构为目标，发送大量无用的信息，以获取收入，或窃取个人数据或id，以伤害其用户。因此，有必要研究垃圾邮件的检测方法。垃圾邮件的检测分为数据预处理、特征提取和邮件分类三个阶段。本文提出了一种新的垃圾邮件预测模型。该方法实现了随机森林来提取特征。最后，利用逻辑回归模型对垃圾邮件进行预测。提出的模型在python中使用anaconda实现。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Hybrid Model for Email Spam Prediction Using Random Forest for Feature Extraction

With the advancement in world wide web, the way to communicate among individuals, via internet, is changed and thus, various platforms become popular such as email. Numerous organizations and people make the deployment of email as major sources of communication. This platform is extensively utilized in spite of alternative means, such as electronic messages, and social networks. However, this technology is more prone to malicious activities. The malicious users target this free mail structure and send a huge number of useless messages, for attaining revenues, or stealing personal data or IDs, to harm its users. Thus, there is necessity to discover the methods for detecting the email spam. The spam is detected in email in different phases in which the data is pre-processed, features are extracted, and the mails are classified. This work introduced a new model to predict the email spam. This approach implements the random forest in order to extract the features. Eventually, the spam is predicted using logistic regression model. The proposed model is implemented in python using anaconda.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2023 International Conference on Artificial Intelligence and Applications (ICAIA) Alliance Technology Conference (ATCON-1)

自引率

0.00%

发文量