SINNER: A Reward-Sensitive Algorithm for Imbalanced Malware Classification Using Neural Networks with Experience Replay

Information Pub Date : 2024-07-23 DOI:10.3390/info15080425

Anthony J. Coscia, Andrea Iannacone, Antonio Maci, Alessandro Stamerra

{"title":"SINNER: A Reward-Sensitive Algorithm for Imbalanced Malware Classification Using Neural Networks with Experience Replay","authors":"Anthony J. Coscia, Andrea Iannacone, Antonio Maci, Alessandro Stamerra","doi":"10.3390/info15080425","DOIUrl":null,"url":null,"abstract":"Reports produced by popular malware analysis services showed a disparity in samples available for different malware families. The unequal distribution between such classes can be attributed to several factors, such as technological advances and the application domain that seeks to infect a computer virus. Recent studies have demonstrated the effectiveness of deep learning (DL) algorithms when learning multi-class classification tasks using imbalanced datasets. This can be achieved by updating the learning function such that correct and incorrect predictions performed on the minority class are more rewarded or penalized, respectively. This procedure can be logically implemented by leveraging the deep reinforcement learning (DRL) paradigm through a proper formulation of the Markov decision process (MDP). This paper proposes SINNER, i.e., a DRL-based multi-class classifier that approaches the data imbalance problem at the algorithmic level by exploiting a redesigned reward function, which modifies the traditional MDP model used to learn this task. Based on the experimental results, the proposed formula appears to be successful. In addition, SINNER has been compared to several DL-based models that can handle class skew without relying on data-level techniques. Using three out of four datasets sourced from the existing literature, the proposed model achieved state-of-the-art classification performance.","PeriodicalId":510156,"journal":{"name":"Information","volume":"137 16","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-07-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Information","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3390/info15080425","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Reports produced by popular malware analysis services showed a disparity in samples available for different malware families. The unequal distribution between such classes can be attributed to several factors, such as technological advances and the application domain that seeks to infect a computer virus. Recent studies have demonstrated the effectiveness of deep learning (DL) algorithms when learning multi-class classification tasks using imbalanced datasets. This can be achieved by updating the learning function such that correct and incorrect predictions performed on the minority class are more rewarded or penalized, respectively. This procedure can be logically implemented by leveraging the deep reinforcement learning (DRL) paradigm through a proper formulation of the Markov decision process (MDP). This paper proposes SINNER, i.e., a DRL-based multi-class classifier that approaches the data imbalance problem at the algorithmic level by exploiting a redesigned reward function, which modifies the traditional MDP model used to learn this task. Based on the experimental results, the proposed formula appears to be successful. In addition, SINNER has been compared to several DL-based models that can handle class skew without relying on data-level techniques. Using three out of four datasets sourced from the existing literature, the proposed model achieved state-of-the-art classification performance.

查看原文本刊更多论文

SINNER：利用经验回放神经网络进行不平衡恶意软件分类的奖励敏感算法

流行的恶意软件分析服务机构提供的报告显示，不同恶意软件家族的可用样本存在差异。这些类别之间的不平等分布可归因于几个因素，如技术进步和试图感染计算机病毒的应用领域。最近的研究表明，深度学习（DL）算法在使用不平衡数据集学习多类分类任务时非常有效。这可以通过更新学习函数来实现，从而使对少数类别进行的正确和错误预测分别得到更多奖励或惩罚。利用深度强化学习（DRL）范式，通过对马尔可夫决策过程（MDP）进行适当的表述，可以合乎逻辑地实现这一过程。本文提出了 SINNER，即一种基于 DRL 的多类分类器，它通过利用重新设计的奖励函数，在算法层面上解决了数据不平衡问题，并修改了用于学习这一任务的传统 MDP 模型。根据实验结果，所提出的公式似乎是成功的。此外，SINNER 还与几种基于 DL 的模型进行了比较，这些模型无需依赖数据级技术就能处理类偏斜问题。利用现有文献中的四个数据集中的三个，所提出的模型取得了最先进的分类性能。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Information

自引率

0.00%

发文量