MAXS: Scaling Malware Execution with Sequential Multi-Hypothesis Testing

Proceedings of the 11th ACM on Asia Conference on Computer and Communications Security Pub Date : 2016-05-30 DOI:10.1145/2897845.2897873

Phani Vadrevu, R. Perdisci

{"title":"MAXS: Scaling Malware Execution with Sequential Multi-Hypothesis Testing","authors":"Phani Vadrevu, R. Perdisci","doi":"10.1145/2897845.2897873","DOIUrl":null,"url":null,"abstract":"In an attempt to coerce useful information about the behavior of new malware families, threat analysts commonly force newly collected malicious software samples to run within a sandboxed environment. The main goal is to gather intelligence that can later be leveraged to detect and enumerate new malware infections within a network. Currently, most analysis environments \"blindly\" execute each newly collected malware sample for a predetermined amount of time (e.g., four to five minutes). However, a large majority of malware samples that are forced through sandbox execution are simply repackaged versions of previously seen (and already analyzed) malware. Consequently, a significant amount of time may be wasted in analyzing samples that do not generate new intelligence. In this paper, we propose MAXS, a novel probabilistic multi-hypothesis testing framework for scaling execution in malware analysis environments, including bare-metal execution environments. Our main goal is to automatically recognize whether a malware sample that is undergoing dynamic analysis has likely been seen before (e.g., in a \"differently packed\" form), and determine if we could therefore stop its execution early while avoiding loss of valuable malware intelligence (e.g., without missing DNS queries to never-before-seen malware command-and-control domains). We have tested our prototype implementation of MAXS over two large collections of malware execution traces obtained from two distinct production-level analysis environments. Our experimental results show that using MAXS we are able to reduce malware execution time by up to 50% in average, with less than 0.3% information loss. This roughly translates into the ability to double the capacity of malware sandbox environments, thus significantly optimizing the resources dedicated to malware execution and analysis. Our results are particularly important for bare-metal execution environments, in which it is not easy to leverage the economies of scale that characterize virtual-machine or emulation based malware sandboxes. For example, MAXS could be used to significantly cut the cost of bare-metal analysis environments by reducing the hardware resources needed to analyze a predetermined daily number of new malware samples.","PeriodicalId":166633,"journal":{"name":"Proceedings of the 11th ACM on Asia Conference on Computer and Communications Security","volume":"70 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-05-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"19","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 11th ACM on Asia Conference on Computer and Communications Security","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/2897845.2897873","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 19

Abstract

In an attempt to coerce useful information about the behavior of new malware families, threat analysts commonly force newly collected malicious software samples to run within a sandboxed environment. The main goal is to gather intelligence that can later be leveraged to detect and enumerate new malware infections within a network. Currently, most analysis environments "blindly" execute each newly collected malware sample for a predetermined amount of time (e.g., four to five minutes). However, a large majority of malware samples that are forced through sandbox execution are simply repackaged versions of previously seen (and already analyzed) malware. Consequently, a significant amount of time may be wasted in analyzing samples that do not generate new intelligence. In this paper, we propose MAXS, a novel probabilistic multi-hypothesis testing framework for scaling execution in malware analysis environments, including bare-metal execution environments. Our main goal is to automatically recognize whether a malware sample that is undergoing dynamic analysis has likely been seen before (e.g., in a "differently packed" form), and determine if we could therefore stop its execution early while avoiding loss of valuable malware intelligence (e.g., without missing DNS queries to never-before-seen malware command-and-control domains). We have tested our prototype implementation of MAXS over two large collections of malware execution traces obtained from two distinct production-level analysis environments. Our experimental results show that using MAXS we are able to reduce malware execution time by up to 50% in average, with less than 0.3% information loss. This roughly translates into the ability to double the capacity of malware sandbox environments, thus significantly optimizing the resources dedicated to malware execution and analysis. Our results are particularly important for bare-metal execution environments, in which it is not easy to leverage the economies of scale that characterize virtual-machine or emulation based malware sandboxes. For example, MAXS could be used to significantly cut the cost of bare-metal analysis environments by reducing the hardware resources needed to analyze a predetermined daily number of new malware samples.

查看原文本刊更多论文

MAXS:扩展恶意软件执行与顺序多假设检验

为了获取有关新恶意软件家族行为的有用信息，威胁分析人员通常会强制新收集的恶意软件样本在沙盒环境中运行。其主要目标是收集情报，以便以后用于检测和列举网络中的新恶意软件感染。目前，大多数分析环境“盲目地”执行每个新收集的恶意软件样本一段预定的时间(例如，4到5分钟)。然而，大多数强制通过沙箱执行的恶意软件样本只是先前看到(和已经分析过)的恶意软件的重新打包版本。因此，大量的时间可能浪费在分析不能产生新智能的样本上。在本文中，我们提出了MAXS，一种新的概率多假设测试框架，用于在恶意软件分析环境中扩展执行，包括裸机执行环境。我们的主要目标是自动识别正在进行动态分析的恶意软件样本是否可能以前见过(例如，以“不同包装”的形式)，并确定我们是否可以因此及早停止其执行，同时避免丢失有价值的恶意软件情报(例如，不会丢失对从未见过的恶意软件命令和控制域的DNS查询)。我们已经在两个不同的生产级分析环境中获得的两个大型恶意软件执行跟踪集上测试了MAXS的原型实现。我们的实验结果表明，使用MAXS，我们能够将恶意软件的执行时间平均减少高达50%，而信息丢失不到0.3%。这大致转化为将恶意软件沙箱环境的容量增加一倍的能力，从而显着优化专用于恶意软件执行和分析的资源。我们的结果对于裸机执行环境尤其重要，在这种环境中，利用虚拟机或基于仿真的恶意软件沙箱的规模经济是不容易的。例如，MAXS可用于通过减少分析每日预定数量的新恶意软件样本所需的硬件资源，从而显著降低裸机分析环境的成本。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings of the 11th ACM on Asia Conference on Computer and Communications Security

自引率

0.00%

发文量