{"title":"安卓恶意软件检测的特征选择方法研究","authors":"D. Kshirsagar, Pooja Agrawal","doi":"10.1080/02522667.2022.2133218","DOIUrl":null,"url":null,"abstract":"Abstract Feature Selection (FS) provides a vital role in the android malware detection system. The researchers have presented FS methods and tested them on benchmark datasets, including static types of features extracted from applications. This paper studies FS methods used in traditional android malware detection systems. These FS methods are implemented on the benchmark datasets such as Genome project, Google PlayStore, AndroZoo, and Drebin consist of static types of extracted features from applications. These traditional methods are studied and implemented on the latest dataset, such as CIC-MalDroid2020 dataset, which includes the latest types of malware and 470 dynamic types of features. The experimentation is performed on CIC-MalDroid2020 dataset with the Random Forest (RF) classifier using traditional FS methods, and performance is compared with the original feature set. Finally, the investigation with the RF classifier on CIC-MalDroid2020 dataset using the obtained 80 features from 470 original features by the ReliefF method produces higher precision of 97.4647% and a lower FAR of 0.1409 for malware detection.","PeriodicalId":46518,"journal":{"name":"JOURNAL OF INFORMATION & OPTIMIZATION SCIENCES","volume":"43 1","pages":"2111 - 2120"},"PeriodicalIF":1.1000,"publicationDate":"2022-11-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"A study of feature selection methods for android malware detection\",\"authors\":\"D. Kshirsagar, Pooja Agrawal\",\"doi\":\"10.1080/02522667.2022.2133218\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Abstract Feature Selection (FS) provides a vital role in the android malware detection system. The researchers have presented FS methods and tested them on benchmark datasets, including static types of features extracted from applications. This paper studies FS methods used in traditional android malware detection systems. These FS methods are implemented on the benchmark datasets such as Genome project, Google PlayStore, AndroZoo, and Drebin consist of static types of extracted features from applications. These traditional methods are studied and implemented on the latest dataset, such as CIC-MalDroid2020 dataset, which includes the latest types of malware and 470 dynamic types of features. The experimentation is performed on CIC-MalDroid2020 dataset with the Random Forest (RF) classifier using traditional FS methods, and performance is compared with the original feature set. Finally, the investigation with the RF classifier on CIC-MalDroid2020 dataset using the obtained 80 features from 470 original features by the ReliefF method produces higher precision of 97.4647% and a lower FAR of 0.1409 for malware detection.\",\"PeriodicalId\":46518,\"journal\":{\"name\":\"JOURNAL OF INFORMATION & OPTIMIZATION SCIENCES\",\"volume\":\"43 1\",\"pages\":\"2111 - 2120\"},\"PeriodicalIF\":1.1000,\"publicationDate\":\"2022-11-17\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"JOURNAL OF INFORMATION & OPTIMIZATION SCIENCES\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1080/02522667.2022.2133218\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"INFORMATION SCIENCE & LIBRARY SCIENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"JOURNAL OF INFORMATION & OPTIMIZATION SCIENCES","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1080/02522667.2022.2133218","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"INFORMATION SCIENCE & LIBRARY SCIENCE","Score":null,"Total":0}
A study of feature selection methods for android malware detection
Abstract Feature Selection (FS) provides a vital role in the android malware detection system. The researchers have presented FS methods and tested them on benchmark datasets, including static types of features extracted from applications. This paper studies FS methods used in traditional android malware detection systems. These FS methods are implemented on the benchmark datasets such as Genome project, Google PlayStore, AndroZoo, and Drebin consist of static types of extracted features from applications. These traditional methods are studied and implemented on the latest dataset, such as CIC-MalDroid2020 dataset, which includes the latest types of malware and 470 dynamic types of features. The experimentation is performed on CIC-MalDroid2020 dataset with the Random Forest (RF) classifier using traditional FS methods, and performance is compared with the original feature set. Finally, the investigation with the RF classifier on CIC-MalDroid2020 dataset using the obtained 80 features from 470 original features by the ReliefF method produces higher precision of 97.4647% and a lower FAR of 0.1409 for malware detection.