{"title":"基于随机森林特征提取的垃圾邮件预测混合模型","authors":"Hardik Saini, K. S. Saini","doi":"10.1109/ICAIA57370.2023.10169126","DOIUrl":null,"url":null,"abstract":"With the advancement in world wide web, the way to communicate among individuals, via internet, is changed and thus, various platforms become popular such as email. Numerous organizations and people make the deployment of email as major sources of communication. This platform is extensively utilized in spite of alternative means, such as electronic messages, and social networks. However, this technology is more prone to malicious activities. The malicious users target this free mail structure and send a huge number of useless messages, for attaining revenues, or stealing personal data or IDs, to harm its users. Thus, there is necessity to discover the methods for detecting the email spam. The spam is detected in email in different phases in which the data is pre-processed, features are extracted, and the mails are classified. This work introduced a new model to predict the email spam. This approach implements the random forest in order to extract the features. Eventually, the spam is predicted using logistic regression model. The proposed model is implemented in python using anaconda.","PeriodicalId":196526,"journal":{"name":"2023 International Conference on Artificial Intelligence and Applications (ICAIA) Alliance Technology Conference (ATCON-1)","volume":"17 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-04-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Hybrid Model for Email Spam Prediction Using Random Forest for Feature Extraction\",\"authors\":\"Hardik Saini, K. S. Saini\",\"doi\":\"10.1109/ICAIA57370.2023.10169126\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"With the advancement in world wide web, the way to communicate among individuals, via internet, is changed and thus, various platforms become popular such as email. Numerous organizations and people make the deployment of email as major sources of communication. This platform is extensively utilized in spite of alternative means, such as electronic messages, and social networks. However, this technology is more prone to malicious activities. The malicious users target this free mail structure and send a huge number of useless messages, for attaining revenues, or stealing personal data or IDs, to harm its users. Thus, there is necessity to discover the methods for detecting the email spam. The spam is detected in email in different phases in which the data is pre-processed, features are extracted, and the mails are classified. This work introduced a new model to predict the email spam. This approach implements the random forest in order to extract the features. Eventually, the spam is predicted using logistic regression model. The proposed model is implemented in python using anaconda.\",\"PeriodicalId\":196526,\"journal\":{\"name\":\"2023 International Conference on Artificial Intelligence and Applications (ICAIA) Alliance Technology Conference (ATCON-1)\",\"volume\":\"17 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-04-21\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2023 International Conference on Artificial Intelligence and Applications (ICAIA) Alliance Technology Conference (ATCON-1)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICAIA57370.2023.10169126\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 International Conference on Artificial Intelligence and Applications (ICAIA) Alliance Technology Conference (ATCON-1)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICAIA57370.2023.10169126","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Hybrid Model for Email Spam Prediction Using Random Forest for Feature Extraction
With the advancement in world wide web, the way to communicate among individuals, via internet, is changed and thus, various platforms become popular such as email. Numerous organizations and people make the deployment of email as major sources of communication. This platform is extensively utilized in spite of alternative means, such as electronic messages, and social networks. However, this technology is more prone to malicious activities. The malicious users target this free mail structure and send a huge number of useless messages, for attaining revenues, or stealing personal data or IDs, to harm its users. Thus, there is necessity to discover the methods for detecting the email spam. The spam is detected in email in different phases in which the data is pre-processed, features are extracted, and the mails are classified. This work introduced a new model to predict the email spam. This approach implements the random forest in order to extract the features. Eventually, the spam is predicted using logistic regression model. The proposed model is implemented in python using anaconda.