Understanding Complex Malware

2021 IEEE Integrated STEM Education Conference (ISEC) Pub Date : 2021-03-13 DOI:10.1109/ISEC52395.2021.9763932

Daniel Edis, Taylor Hayman, A. Vatsa

{"title":"Understanding Complex Malware","authors":"Daniel Edis, Taylor Hayman, A. Vatsa","doi":"10.1109/ISEC52395.2021.9763932","DOIUrl":null,"url":null,"abstract":"With the surge of cybercrime and contribution of malware (malicious software) attacks in cybercrime, there is need to design a smart and deep engine-based Intrusion Detection Systems (IDS). The malware could be virous, worm, trojan, etc. and their behaviors are dynamic and static in nature. IDS may monitor events and activity of malware and classify them in order that prediction of potential attacks can be made for users’ sensitive data and associated computational resources. Moreover, the false positive rate alarming of IDS systems is high. Therefore, there is need to reconsider the design of IDS systems, increase its detection accuracy, and elevate prediction of vulnerable attacks. Further, new IDS must capable to deal with nonlinear behavior of malware datasets and model must have better fitting ability. Therefore, in order to protect and avoid vulnerable attacks, we would like to contribute an implementation of a deep learning algorithm - Extreme Gradient Boosting (XGBoost) and Recurrent Neural Network (RNN) - on Microsoft Malware Classification Dataset (BIG 2015) datasets. Also, BIG 2015 raw datasets will be preprocessed and resized to make the data compatible to these algorithms. Moreover, the performance of these algorithms will be measures and compared using these parameters - Accuracy, Precision, Recall, F1 score, Loss, True Positives, True Negatives, False Positives, and False Negatives, and receiver operating characteristic (ROC) curve by calculating the AUC (the area under the ROC curve).","PeriodicalId":329844,"journal":{"name":"2021 IEEE Integrated STEM Education Conference (ISEC)","volume":"2010 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-03-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE Integrated STEM Education Conference (ISEC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISEC52395.2021.9763932","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 2

Abstract

With the surge of cybercrime and contribution of malware (malicious software) attacks in cybercrime, there is need to design a smart and deep engine-based Intrusion Detection Systems (IDS). The malware could be virous, worm, trojan, etc. and their behaviors are dynamic and static in nature. IDS may monitor events and activity of malware and classify them in order that prediction of potential attacks can be made for users’ sensitive data and associated computational resources. Moreover, the false positive rate alarming of IDS systems is high. Therefore, there is need to reconsider the design of IDS systems, increase its detection accuracy, and elevate prediction of vulnerable attacks. Further, new IDS must capable to deal with nonlinear behavior of malware datasets and model must have better fitting ability. Therefore, in order to protect and avoid vulnerable attacks, we would like to contribute an implementation of a deep learning algorithm - Extreme Gradient Boosting (XGBoost) and Recurrent Neural Network (RNN) - on Microsoft Malware Classification Dataset (BIG 2015) datasets. Also, BIG 2015 raw datasets will be preprocessed and resized to make the data compatible to these algorithms. Moreover, the performance of these algorithms will be measures and compared using these parameters - Accuracy, Precision, Recall, F1 score, Loss, True Positives, True Negatives, False Positives, and False Negatives, and receiver operating characteristic (ROC) curve by calculating the AUC (the area under the ROC curve).

查看原文本刊更多论文

了解复杂的恶意软件

随着网络犯罪的激增和恶意软件攻击在网络犯罪中的贡献，需要设计一种基于智能深度引擎的入侵检测系统(IDS)。恶意软件可以是病毒、蠕虫、木马等，其行为本质上有动态和静态之分。IDS可以监视恶意软件的事件和活动并对其进行分类，以便对用户的敏感数据和相关计算资源进行潜在攻击的预测。此外，IDS系统的误报率报警较高。因此，需要重新考虑IDS系统的设计，提高其检测精度，提高对脆弱攻击的预测能力。此外，新的入侵检测系统必须能够处理恶意软件数据集的非线性行为，并且模型必须具有更好的拟合能力。因此，为了保护和避免易受攻击，我们想在微软恶意软件分类数据集(BIG 2015)数据集上贡献一种深度学习算法-极端梯度增强(XGBoost)和循环神经网络(RNN)的实现。此外，BIG 2015原始数据集将被预处理和调整大小，使数据与这些算法兼容。此外，这些算法的性能将使用以下参数进行测量和比较-准确性，精度，召回率，F1分数，损失，真阳性，真阴性，假阳性和假阴性，以及通过计算AUC (ROC曲线下的面积)计算接收者工作特征(ROC)曲线。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2021 IEEE Integrated STEM Education Conference (ISEC)

自引率

0.00%

发文量