E-mail Fraud Detection

Arju Kumar, Saurav Kumar, K. Kumar, Dr. Bharat Bhushan Naib
{"title":"E-mail Fraud Detection","authors":"Arju Kumar, Saurav Kumar, K. Kumar, Dr. Bharat Bhushan Naib","doi":"10.35940/ijese.b7797.0811923","DOIUrl":null,"url":null,"abstract":"Spam issues have become worse on social media platforms and apps with the growth of IoT. To solve the problem, researchers have suggested several spam detection techniques. Spam rates are still high despite the use of anti-spam technologies and tactics, especially given the ubiquity of rogue e-mails that lead to dangerous websites. By using up memory or storage space, spam e-mails may cause servers to run slowly. One of the most essential methods for identifying and eliminating spam is filtering e-mails. To this end, various deep learning and machine learning technologies have been used, including Naive Bayes, decision trees, SVM, and random forest. E-mail and Internet of Things spam filters use various machine learning approaches and systems are categorized in this research. Additionally, as more people use mobile devices and SMS services become more affordable, the issue of spam SMS messages is spreading worldwide. This study suggests using a variety of machine learning approaches to detect and get rid of spam as a solution to this problem. According to the trial findings, the TF-IDF with Random Forest classification algorithm outperformed the other examined algorithms in accuracy %. It is only possible to gauge performance on accuracy since the dataset is imbalanced. Therefore, the algorithms must have good precision, recall, and F-measure.","PeriodicalId":275796,"journal":{"name":"International Journal of Emerging Science and Engineering","volume":"518 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-08-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Emerging Science and Engineering","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.35940/ijese.b7797.0811923","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Spam issues have become worse on social media platforms and apps with the growth of IoT. To solve the problem, researchers have suggested several spam detection techniques. Spam rates are still high despite the use of anti-spam technologies and tactics, especially given the ubiquity of rogue e-mails that lead to dangerous websites. By using up memory or storage space, spam e-mails may cause servers to run slowly. One of the most essential methods for identifying and eliminating spam is filtering e-mails. To this end, various deep learning and machine learning technologies have been used, including Naive Bayes, decision trees, SVM, and random forest. E-mail and Internet of Things spam filters use various machine learning approaches and systems are categorized in this research. Additionally, as more people use mobile devices and SMS services become more affordable, the issue of spam SMS messages is spreading worldwide. This study suggests using a variety of machine learning approaches to detect and get rid of spam as a solution to this problem. According to the trial findings, the TF-IDF with Random Forest classification algorithm outperformed the other examined algorithms in accuracy %. It is only possible to gauge performance on accuracy since the dataset is imbalanced. Therefore, the algorithms must have good precision, recall, and F-measure.
电子邮件欺诈检测
随着物联网的发展,社交媒体平台和应用程序上的垃圾邮件问题变得越来越严重。为了解决这个问题,研究人员提出了几种垃圾邮件检测技术。尽管使用了反垃圾邮件技术和策略,但垃圾邮件率仍然很高,特别是考虑到导致危险网站的流氓电子邮件无处不在。垃圾邮件会耗尽内存或存储空间,从而导致服务器运行缓慢。识别和消除垃圾邮件的最基本方法之一是过滤电子邮件。为此,使用了各种深度学习和机器学习技术,包括朴素贝叶斯、决策树、支持向量机和随机森林。电子邮件和物联网垃圾邮件过滤器使用各种机器学习方法和系统在本研究中进行了分类。此外,随着越来越多的人使用移动设备,短信服务变得越来越便宜,垃圾短信的问题正在全球蔓延。这项研究建议使用各种机器学习方法来检测和摆脱垃圾邮件作为解决这个问题的方法。根据试验结果,随机森林分类算法的TF-IDF在准确率%上优于其他检测算法。由于数据集是不平衡的,所以只能通过准确性来衡量性能。因此,算法必须具有良好的精度、召回率和F-measure。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信