Malweb:一个使用机器学习算法的高效恶意网站检测系统

A. E. El-Din, Ezz El-Din Hemdan, A. El-Sayed
{"title":"Malweb:一个使用机器学习算法的高效恶意网站检测系统","authors":"A. E. El-Din, Ezz El-Din Hemdan, A. El-Sayed","doi":"10.1109/ICEEM52022.2021.9480648","DOIUrl":null,"url":null,"abstract":"These days, malware is one of the supreme acknowledged cyber threats. As data volumes increase rapidly, the number of malware threats increases. Malware not only increases in quantities but also becomes smarter and more difficult to detect. Detect malware threats on websites caused by high data traffic, becomes a challenging problem, which must be solved. Moreover, billions of dollars are lost annually due to malicious website scams. Applying analytics to discover new information, predict future malware insights, and make control decisions is a critical process that makes online websites secure. In this research, we propose and analyze a machine learning-based system to detect the behavior of malicious websites based on specific features. With these features, we classify websites as either malicious versus non-malicious. This paper employs a variety of machine learning techniques, including Logistic Regression, Decision Tree, and Naïve Bayes to detect malicious and non-malicious websites, based on various feature selection circumstances to improve results. Applying feature selection with a new way depends on the threshold of categories so Reasonable results are reached with 100% accuracy, recall, and precision when applying Logistic Regression and Decision Tree algorithms while 95% when applying a Naïve Bayes algorithm with an acceptable timing slot.","PeriodicalId":352371,"journal":{"name":"2021 International Conference on Electronic Engineering (ICEEM)","volume":"7 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-07-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Malweb: An Efficient Malicious Websites Detection System using Machine Learning Algorithms\",\"authors\":\"A. E. El-Din, Ezz El-Din Hemdan, A. El-Sayed\",\"doi\":\"10.1109/ICEEM52022.2021.9480648\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"These days, malware is one of the supreme acknowledged cyber threats. As data volumes increase rapidly, the number of malware threats increases. Malware not only increases in quantities but also becomes smarter and more difficult to detect. Detect malware threats on websites caused by high data traffic, becomes a challenging problem, which must be solved. Moreover, billions of dollars are lost annually due to malicious website scams. Applying analytics to discover new information, predict future malware insights, and make control decisions is a critical process that makes online websites secure. In this research, we propose and analyze a machine learning-based system to detect the behavior of malicious websites based on specific features. With these features, we classify websites as either malicious versus non-malicious. This paper employs a variety of machine learning techniques, including Logistic Regression, Decision Tree, and Naïve Bayes to detect malicious and non-malicious websites, based on various feature selection circumstances to improve results. Applying feature selection with a new way depends on the threshold of categories so Reasonable results are reached with 100% accuracy, recall, and precision when applying Logistic Regression and Decision Tree algorithms while 95% when applying a Naïve Bayes algorithm with an acceptable timing slot.\",\"PeriodicalId\":352371,\"journal\":{\"name\":\"2021 International Conference on Electronic Engineering (ICEEM)\",\"volume\":\"7 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-07-03\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 International Conference on Electronic Engineering (ICEEM)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICEEM52022.2021.9480648\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 International Conference on Electronic Engineering (ICEEM)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICEEM52022.2021.9480648","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2

摘要

如今,恶意软件是公认的最高网络威胁之一。随着数据量的快速增长,恶意软件威胁的数量也在不断增加。恶意软件不仅数量增加,而且变得更聪明,更难以检测。检测恶意软件对网站造成的高数据流量威胁,成为一个具有挑战性的问题,必须解决。此外,由于恶意网站诈骗,每年损失数十亿美元。应用分析来发现新信息,预测未来恶意软件的洞察力,并做出控制决策是确保在线网站安全的关键过程。在这项研究中,我们提出并分析了一个基于机器学习的系统,以检测基于特定特征的恶意网站的行为。通过这些功能,我们将网站分为恶意网站和非恶意网站。本文采用多种机器学习技术,包括逻辑回归、决策树和Naïve贝叶斯,基于各种特征选择情况来检测恶意和非恶意网站,以提高结果。以一种新的方式应用特征选择取决于类别的阈值,所以当使用逻辑回归和决策树算法时,达到100%的正确率,召回率和精度,而当使用Naïve贝叶斯算法并具有可接受的时间槽时,达到95%的合理结果。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Malweb: An Efficient Malicious Websites Detection System using Machine Learning Algorithms
These days, malware is one of the supreme acknowledged cyber threats. As data volumes increase rapidly, the number of malware threats increases. Malware not only increases in quantities but also becomes smarter and more difficult to detect. Detect malware threats on websites caused by high data traffic, becomes a challenging problem, which must be solved. Moreover, billions of dollars are lost annually due to malicious website scams. Applying analytics to discover new information, predict future malware insights, and make control decisions is a critical process that makes online websites secure. In this research, we propose and analyze a machine learning-based system to detect the behavior of malicious websites based on specific features. With these features, we classify websites as either malicious versus non-malicious. This paper employs a variety of machine learning techniques, including Logistic Regression, Decision Tree, and Naïve Bayes to detect malicious and non-malicious websites, based on various feature selection circumstances to improve results. Applying feature selection with a new way depends on the threshold of categories so Reasonable results are reached with 100% accuracy, recall, and precision when applying Logistic Regression and Decision Tree algorithms while 95% when applying a Naïve Bayes algorithm with an acceptable timing slot.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信