Zero-Day Malware Defence with Limited Samples

Journal of Communications and Information Networks Pub Date : 2024-12-01 DOI:10.23919/JCIN.2024.10820160

Yuanxiang Gong;Chiya Zhang;Yiyi Liu

{"title":"Zero-Day Malware Defence with Limited Samples","authors":"Yuanxiang Gong;Chiya Zhang;Yiyi Liu","doi":"10.23919/JCIN.2024.10820160","DOIUrl":null,"url":null,"abstract":"Zero-day malware refers to a previously unknown or newly discovered type of malware. While most existing studies rely on large malware sample sets, their performance is unknown when dealing with a limited number of samples. This paper addresses this challenge by proposing a novel approach for effective zero-day malware detection, even with a scarcity of known samples. The proposed method begins by visualizing the malware binary and converting it into an entropy image. Subsequently, a deep convolutional generative adversarial network (DCGAN) is employed to learn from the available samples and generate new, highly similar synthetic samples. By combining these generated samples with the real ones, a comprehensive training set is constructed for a convolutional neural network (CNN) classification model. The randomness introduced by DCGAN facilitates the generation of new features, even in the presence of a small sample size. This enables the classifier to learn the characteristics of unknown zero-day malware and enhance its detection capabilities. Extensive experiments validate the effectiveness of the proposed approach, demonstrating that leveraging entropy images as features and applying DCGAN for data augmentation leads to a robust zero-day malware detection system, capable of achieving promising results even with a limited number of samples.","PeriodicalId":100766,"journal":{"name":"Journal of Communications and Information Networks","volume":"9 4","pages":"340-347"},"PeriodicalIF":0.0000,"publicationDate":"2024-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Communications and Information Networks","FirstCategoryId":"1085","ListUrlMain":"https://ieeexplore.ieee.org/document/10820160/","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Zero-day malware refers to a previously unknown or newly discovered type of malware. While most existing studies rely on large malware sample sets, their performance is unknown when dealing with a limited number of samples. This paper addresses this challenge by proposing a novel approach for effective zero-day malware detection, even with a scarcity of known samples. The proposed method begins by visualizing the malware binary and converting it into an entropy image. Subsequently, a deep convolutional generative adversarial network (DCGAN) is employed to learn from the available samples and generate new, highly similar synthetic samples. By combining these generated samples with the real ones, a comprehensive training set is constructed for a convolutional neural network (CNN) classification model. The randomness introduced by DCGAN facilitates the generation of new features, even in the presence of a small sample size. This enables the classifier to learn the characteristics of unknown zero-day malware and enhance its detection capabilities. Extensive experiments validate the effectiveness of the proposed approach, demonstrating that leveraging entropy images as features and applying DCGAN for data augmentation leads to a robust zero-day malware detection system, capable of achieving promising results even with a limited number of samples.

查看原文本刊更多论文

零日恶意软件防御有限样本

零日恶意软件是指以前未知或新发现的恶意软件类型。虽然大多数现有的研究依赖于大型恶意软件样本集，但当处理有限数量的样本时，它们的性能是未知的。本文通过提出一种有效的零日恶意软件检测的新方法来解决这一挑战，即使已知样本稀缺。该方法首先将恶意软件二进制文件可视化，并将其转换为熵图像。随后，使用深度卷积生成对抗网络（DCGAN）从可用样本中学习并生成新的，高度相似的合成样本。将生成的样本与真实样本相结合，构建卷积神经网络（CNN）分类模型的综合训练集。DCGAN引入的随机性有利于新特征的生成，即使在小样本量的情况下也是如此。这使分类器能够学习未知零日恶意软件的特征并增强其检测能力。大量的实验验证了所提出方法的有效性，表明利用熵图像作为特征并应用DCGAN进行数据增强可以产生一个强大的零日恶意软件检测系统，即使在有限数量的样本下也能够获得有希望的结果。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Journal of Communications and Information Networks

自引率

0.00%

发文量