More Semantics More Robust: Improving Android Malware Classifiers

Proceedings of the 9th ACM Conference on Security & Privacy in Wireless and Mobile Networks Pub Date : 2016-07-18 DOI:10.1145/2939918.2939931

Wei Chen, David Aspinall, A. Gordon, Charles Sutton, Igor Muttik

引用次数: 21

Abstract

Automatic malware classifiers often perform badly on the detection of new malware, i.e., their robustness is poor. We study the machine-learning-based mobile malware classifiers and reveal one reason: the input features used by these classifiers can't capture general behavioural patterns of malware instances. We extract the best-performing syntax-based features like permissions and API calls, and some semantics-based features like happen-befores and unwanted behaviours, and train classifiers using popular supervised and semi-supervised learning methods. By comparing their classification performance on industrial datasets collected across several years, we demonstrate that using semantics-based features can dramatically improve robustness of malware classifiers.

查看原文本刊更多论文

更多语义更健壮:改进Android恶意软件分类器

恶意软件自动分类器在检测新恶意软件方面表现不佳，鲁棒性较差。我们研究了基于机器学习的移动恶意软件分类器，并揭示了一个原因:这些分类器使用的输入特征不能捕获恶意软件实例的一般行为模式。我们提取了性能最好的基于语法的特征，如权限和API调用，以及一些基于语义的特征，如发生之前和不想要的行为，并使用流行的监督和半监督学习方法训练分类器。通过比较它们在几年来收集的工业数据集上的分类性能，我们证明使用基于语义的特征可以显着提高恶意软件分类器的鲁棒性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings of the 9th ACM Conference on Security & Privacy in Wireless and Mobile Networks

自引率

0.00%

发文量