R-MFDroid: Android Malware Detection using Ranked Manifest File Components

Kartik Khariwal, Rishabh Gupta, Jatin P. Singh, Anshul Arora
{"title":"R-MFDroid: Android Malware Detection using Ranked Manifest File Components","authors":"Kartik Khariwal, Rishabh Gupta, Jatin P. Singh, Anshul Arora","doi":"10.35940/IJITEE.G8951.0510721","DOIUrl":null,"url":null,"abstract":"With the increasing fame of Android OS over the past\nfew years, the quantity of malware assaults on Android has\nadditionally expanded. In the year 2018, around 28 million\nmalicious applications were found on the Android platform and\nthese malicious apps were capable of causing huge financial\nlosses and information leakage. Such threats, caused due to these\nmalicious apps, call for a proper detection system for Android\nmalware. There exist some research works that aim to study static\nmanifest components for malware detection. However, to the best\nof our knowledge, none of the previous research works have\naimed to find the best set amongst different manifest file\ncomponents for malware detection. In this work, we focus on\nidentifying the best feature set from manifest file components\n(Permissions, Intents, Hardware Components, Activities, Services,\nBroadcast Receivers, and Content Providers) that could give better\ndetection accuracy. We apply Information Gain to rank the\nmanifest file components intending to find the best set of\ncomponents that can better classify between malware applications\nand benign applications. We put forward a novel algorithm to find\nthe best feature set by using various machine learning classifiers\nlike SVM, XGBoost, and Random Forest along with deep learning\ntechniques like classification using Neural networks. The\nexperimental results highlight that the best set obtained from the\nproposed algorithm consisted of 25 features, i.e., 5 Permissions, 2\nIntents, 9 Activities, 3 Content Providers, 4 Hardware\nComponents, 1 Service, and 1 Broadcast Receiver. The SVM\nclassifier gave the highest classification accuracy of 96.93% and\nan F1-Score of 0.97 with this best set of 25 features.","PeriodicalId":23601,"journal":{"name":"VOLUME-8 ISSUE-10, AUGUST 2019, REGULAR ISSUE","volume":"29 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2021-05-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"VOLUME-8 ISSUE-10, AUGUST 2019, REGULAR ISSUE","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.35940/IJITEE.G8951.0510721","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2

Abstract

With the increasing fame of Android OS over the past few years, the quantity of malware assaults on Android has additionally expanded. In the year 2018, around 28 million malicious applications were found on the Android platform and these malicious apps were capable of causing huge financial losses and information leakage. Such threats, caused due to these malicious apps, call for a proper detection system for Android malware. There exist some research works that aim to study static manifest components for malware detection. However, to the best of our knowledge, none of the previous research works have aimed to find the best set amongst different manifest file components for malware detection. In this work, we focus on identifying the best feature set from manifest file components (Permissions, Intents, Hardware Components, Activities, Services, Broadcast Receivers, and Content Providers) that could give better detection accuracy. We apply Information Gain to rank the manifest file components intending to find the best set of components that can better classify between malware applications and benign applications. We put forward a novel algorithm to find the best feature set by using various machine learning classifiers like SVM, XGBoost, and Random Forest along with deep learning techniques like classification using Neural networks. The experimental results highlight that the best set obtained from the proposed algorithm consisted of 25 features, i.e., 5 Permissions, 2 Intents, 9 Activities, 3 Content Providers, 4 Hardware Components, 1 Service, and 1 Broadcast Receiver. The SVM classifier gave the highest classification accuracy of 96.93% and an F1-Score of 0.97 with this best set of 25 features.
R-MFDroid: Android恶意软件检测使用排名清单文件组件
随着Android操作系统在过去几年中声名鹊起,针对Android的恶意软件攻击数量也有所增加。2018年,在安卓平台上发现了大约2800万个恶意应用程序,这些恶意应用程序能够造成巨大的经济损失和信息泄露。这些恶意软件造成的威胁需要一个合适的android恶意软件检测系统。目前已有一些研究工作旨在研究用于恶意软件检测的静态清单组件。然而,据我们所知,之前的研究工作都没有旨在找到恶意软件检测的不同清单文件组件的最佳集合。在这项工作中,我们专注于从清单文件组件(权限、意图、硬件组件、活动、服务、广播接收器和内容提供者)中识别出最好的功能集,这些功能集可以提供更好的检测准确性。我们使用信息增益对清单文件组件进行排序,以找到能够更好地区分恶意应用程序和良性应用程序的最佳组件集。我们提出了一种新的算法,通过使用各种机器学习分类器(如SVM, XGBoost和Random Forest)以及深度学习技术(如使用神经网络分类)来找到最佳特征集。实验结果表明,该算法得到的最佳特征集由25个特征组成,即5个权限、2个意图、9个活动、3个内容提供者、4个硬件组件、1个服务和1个广播接收器。在这25个特征的最佳集合下,svm分类器的最高分类准确率为96.93%,F1-Score为0.97。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信