SNEL-DFF: Android malware detection using Siamese networks with ensemble learning

IF 2.7 Q2 MULTIDISCIPLINARY SCIENCES

Scientific African Pub Date : 2025-07-22 DOI:10.1016/j.sciaf.2025.e02816

Atif Raza Zaidi , Tahir Abbas , Sadaqat Ali Ramay , Tariq Shahzad , Zahid Hussain Qaisar , Muhammad Adnan Khan , Adnan Abu-Mahfouz , Amin Beheshti

{"title":"SNEL-DFF: Android malware detection using Siamese networks with ensemble learning","authors":"Atif Raza Zaidi , Tahir Abbas , Sadaqat Ali Ramay , Tariq Shahzad , Zahid Hussain Qaisar , Muhammad Adnan Khan , Adnan Abu-Mahfouz , Amin Beheshti","doi":"10.1016/j.sciaf.2025.e02816","DOIUrl":null,"url":null,"abstract":"<div><div>This paper proposes a new model simply known as Siamese Networks of Optimal Ensemble Learning with Deep Forest Feature (SNEL-DFF). The proposed model has the Deep Forest Feature extraction feature because of the complexity that is present in the data and to enhance the proficiency of the detection system. The feature vectors used in this study includes 215 attributes in android applications which are derived from samples sourced from Drebin dataset. Some of the performance evaluation results have been highlighted revealing that the proposed model yielded an accuracy of 0.99. The precision of 0.98 shows its ability to avoid miss-identification of negatives and the recall of 0.99 proves the effectiveness of using it for detection of the real malware samples. The F1 score is 0.99 and ROC-AUC value of 0.99 indicating the model has achieved 99% accuracy which points to the fact that it is balanced and provides a superior performance. These findings vindicate the hypothesis that SNEL-DFF has strong predictive accuracy as compared to the conventional machine learning algorithms. The proposed technique utilizes Siamese networks, deep forest feature enhancement, and ensemble learning, which makes it perform better than its competitors in terms of various evaluation criteria.</div></div>","PeriodicalId":21690,"journal":{"name":"Scientific African","volume":"29 ","pages":"Article e02816"},"PeriodicalIF":2.7000,"publicationDate":"2025-07-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Scientific African","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2468227625002856","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"MULTIDISCIPLINARY SCIENCES","Score":null,"Total":0}

引用次数: 0

Abstract

This paper proposes a new model simply known as Siamese Networks of Optimal Ensemble Learning with Deep Forest Feature (SNEL-DFF). The proposed model has the Deep Forest Feature extraction feature because of the complexity that is present in the data and to enhance the proficiency of the detection system. The feature vectors used in this study includes 215 attributes in android applications which are derived from samples sourced from Drebin dataset. Some of the performance evaluation results have been highlighted revealing that the proposed model yielded an accuracy of 0.99. The precision of 0.98 shows its ability to avoid miss-identification of negatives and the recall of 0.99 proves the effectiveness of using it for detection of the real malware samples. The F1 score is 0.99 and ROC-AUC value of 0.99 indicating the model has achieved 99% accuracy which points to the fact that it is balanced and provides a superior performance. These findings vindicate the hypothesis that SNEL-DFF has strong predictive accuracy as compared to the conventional machine learning algorithms. The proposed technique utilizes Siamese networks, deep forest feature enhancement, and ensemble learning, which makes it perform better than its competitors in terms of various evaluation criteria.

查看原文本刊更多论文

SNEL-DFF：使用集成学习的Siamese网络检测Android恶意软件

本文提出了一种新的模型，简单地称为具有深度森林特征的Siamese网络最优集成学习（SNEL-DFF）。由于数据本身的复杂性，该模型具有深度森林特征提取特征，提高了检测系统的熟练程度。本研究中使用的特征向量包括来自Drebin数据集样本的android应用程序中的215个属性。一些性能评估结果被突出显示，表明所提出的模型产生了0.99的精度。0.98的精度表明其能够避免阴性的误识别，0.99的召回率证明了将其用于检测真实恶意软件样本的有效性。F1分数为0.99，ROC-AUC值为0.99，表明该模型达到了99%的准确率，表明该模型是平衡的，提供了优越的性能。这些发现证实了SNEL-DFF与传统机器学习算法相比具有很强的预测准确性的假设。该技术利用了暹罗网络、深度森林特征增强和集成学习，使其在各种评估标准方面表现优于竞争对手。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊