Feature-Based Adversarial Attacks Against Machine Learnt Mobile Malware Detectors

2020 30th International Telecommunication Networks and Applications Conference (ITNAC) Pub Date : 2020-11-25 DOI:10.1109/ITNAC50341.2020.9315144

M. Shahpasand, Len Hamey, M. Kâafar, Dinusha Vatsalan

{"title":"Feature-Based Adversarial Attacks Against Machine Learnt Mobile Malware Detectors","authors":"M. Shahpasand, Len Hamey, M. Kâafar, Dinusha Vatsalan","doi":"10.1109/ITNAC50341.2020.9315144","DOIUrl":null,"url":null,"abstract":"The success of Machine Learning (ML) techniques in security applications, such as malware detection, is highly criticized for their vulnerability to Adversarial Examples (AE): perturbed input samples (e.g. malware) can mislead ML to produce an adversary’s desired output (e.g. benign class label). AEs against ML models are broadly studied in the computer vision domain where the adversary perturbs the pixel values of an image such that the change is not perceptible, but the resulting image is misclassified by the model. We investigate the effectiveness of attack techniques proposed in the image domain to attack ML classifiers in the context of mobile malware detection. Since the feature vector representation of samples is often used in ML, a simplified evaluation of ML classifiers’ robustness to AEs is to study feature-based attack models, where the adversary perturbs the input features. We compare the methods, trade-offs, and gaps for such attack models and show that generative models (e.g. GANs) outperform a selection of existing attacks in terms of attack success rate but apply large distortion to the original sample. We also describe how we use the generated samples for increasing a classifier’s robustness through adversarial training.","PeriodicalId":131639,"journal":{"name":"2020 30th International Telecommunication Networks and Applications Conference (ITNAC)","volume":"22 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-11-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 30th International Telecommunication Networks and Applications Conference (ITNAC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ITNAC50341.2020.9315144","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

The success of Machine Learning (ML) techniques in security applications, such as malware detection, is highly criticized for their vulnerability to Adversarial Examples (AE): perturbed input samples (e.g. malware) can mislead ML to produce an adversary’s desired output (e.g. benign class label). AEs against ML models are broadly studied in the computer vision domain where the adversary perturbs the pixel values of an image such that the change is not perceptible, but the resulting image is misclassified by the model. We investigate the effectiveness of attack techniques proposed in the image domain to attack ML classifiers in the context of mobile malware detection. Since the feature vector representation of samples is often used in ML, a simplified evaluation of ML classifiers’ robustness to AEs is to study feature-based attack models, where the adversary perturbs the input features. We compare the methods, trade-offs, and gaps for such attack models and show that generative models (e.g. GANs) outperform a selection of existing attacks in terms of attack success rate but apply large distortion to the original sample. We also describe how we use the generated samples for increasing a classifier’s robustness through adversarial training.

查看原文本刊更多论文

针对机器学习移动恶意软件检测器的基于特征的对抗性攻击

机器学习(ML)技术在安全应用(如恶意软件检测)中的成功，因其对对抗性示例(AE)的脆弱性而受到高度批评:受干扰的输入样本(例如恶意软件)可能会误导ML产生对手所需的输出(例如良性类标签)。针对ML模型的AEs在计算机视觉领域得到了广泛的研究，其中对手干扰图像的像素值，使变化不可察觉，但结果图像被模型错误分类。我们研究了在移动恶意软件检测的背景下，图像域攻击ML分类器的攻击技术的有效性。由于样本的特征向量表示经常用于机器学习，因此对机器学习分类器对ae的鲁棒性的简化评估是研究基于特征的攻击模型，其中对手干扰输入特征。我们比较了这些攻击模型的方法、权衡和差距，并表明生成模型(例如gan)在攻击成功率方面优于现有攻击的选择，但对原始样本施加了很大的失真。我们还描述了如何使用生成的样本通过对抗性训练来增加分类器的鲁棒性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2020 30th International Telecommunication Networks and Applications Conference (ITNAC)

自引率

0.00%

发文量