FEDetect: A Federated Learning-Based Malware Detection and Classification Using Deep Neural Network Algorithms

IF 2.9 4区综合性期刊 Q2 MULTIDISCIPLINARY SCIENCES

Arabian Journal for Science and Engineering Pub Date : 2025-03-17 DOI:10.1007/s13369-025-10043-x

Zeki Çıplak, Kazım Yıldız, Şahsene Altınkaya

{"title":"FEDetect: A Federated Learning-Based Malware Detection and Classification Using Deep Neural Network Algorithms","authors":"Zeki Çıplak, Kazım Yıldız, Şahsene Altınkaya","doi":"10.1007/s13369-025-10043-x","DOIUrl":null,"url":null,"abstract":"<div><p>The growing importance of data security in modern information systems extends beyond the preventing malicious software and includes the critical topic of data privacy. Centralized data processing in traditional machine learning methods presents significant challenges, including greater risk of data breaches and attacks on centralized systems. This study addresses the critical issue of maintaining data privacy while obtaining effective malware detection and classification. The motivation stems from the growing requirement for robust and privacy-preserving machine learning methodologies in response to rising threats to centralized data systems. Federated learning offers a novel solution that eliminates the requirement for centralized data collecting while preserving privacy. In this paper, we investigate the performance of federated learning-based models and compare them classic non-federated approaches. Using the CIC-MalMem-2022 dataset, we built 22 models with feedforward neural networks and long short-term memory methods, including four non-federated models. The results show that federated learning performed outstanding performance with an accuracy of 0.999 in binary classification and 0.845 in multiclass classification, despite different numbers of users. This study contributes significantly to understanding the practical implementation and impact of federated learning. By examining the impact of various factors on classification performance, we highlight the potential of federated learning as a privacy-preserving alternative to centralized machine learning methods, filling a major gap in the field of secure data processing.</p></div>","PeriodicalId":54354,"journal":{"name":"Arabian Journal for Science and Engineering","volume":"50 19","pages":"16107 - 16134"},"PeriodicalIF":2.9000,"publicationDate":"2025-03-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s13369-025-10043-x.pdf","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Arabian Journal for Science and Engineering","FirstCategoryId":"103","ListUrlMain":"https://link.springer.com/article/10.1007/s13369-025-10043-x","RegionNum":4,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"MULTIDISCIPLINARY SCIENCES","Score":null,"Total":0}

引用次数: 0

Abstract

The growing importance of data security in modern information systems extends beyond the preventing malicious software and includes the critical topic of data privacy. Centralized data processing in traditional machine learning methods presents significant challenges, including greater risk of data breaches and attacks on centralized systems. This study addresses the critical issue of maintaining data privacy while obtaining effective malware detection and classification. The motivation stems from the growing requirement for robust and privacy-preserving machine learning methodologies in response to rising threats to centralized data systems. Federated learning offers a novel solution that eliminates the requirement for centralized data collecting while preserving privacy. In this paper, we investigate the performance of federated learning-based models and compare them classic non-federated approaches. Using the CIC-MalMem-2022 dataset, we built 22 models with feedforward neural networks and long short-term memory methods, including four non-federated models. The results show that federated learning performed outstanding performance with an accuracy of 0.999 in binary classification and 0.845 in multiclass classification, despite different numbers of users. This study contributes significantly to understanding the practical implementation and impact of federated learning. By examining the impact of various factors on classification performance, we highlight the potential of federated learning as a privacy-preserving alternative to centralized machine learning methods, filling a major gap in the field of secure data processing.

查看原文本刊更多论文

FEDetect：基于深度神经网络算法的联邦学习恶意软件检测与分类

在现代信息系统中，数据安全的重要性日益增长，已经超出了防止恶意软件的范畴，还包括数据隐私这一关键话题。传统机器学习方法中的集中式数据处理提出了重大挑战，包括更大的数据泄露风险和对集中式系统的攻击。本研究解决了维护数据隐私的关键问题，同时获得有效的恶意软件检测和分类。其动机源于对强大且保护隐私的机器学习方法的需求不断增长，以应对集中式数据系统日益增长的威胁。联邦学习提供了一种新颖的解决方案，消除了对集中数据收集的需求，同时保护了隐私。在本文中，我们研究了基于联邦学习的模型的性能，并与经典的非联邦方法进行了比较。利用cic - malmemm -2022数据集，我们构建了22个前馈神经网络和长短期记忆方法的模型，其中包括4个非联邦模型。结果表明，在用户数量不同的情况下，联邦学习在二元分类和多类分类上的准确率分别为0.999和0.845。本研究对理解联邦学习的实际实施及其影响具有重要意义。通过研究各种因素对分类性能的影响，我们强调了联邦学习作为集中式机器学习方法的隐私保护替代方案的潜力，填补了安全数据处理领域的主要空白。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Arabian Journal for Science and Engineering MULTIDISCIPLINARY SCIENCES-

CiteScore

5.70

自引率

3.40%

发文量

993

期刊介绍： King Fahd University of Petroleum & Minerals (KFUPM) partnered with Springer to publish the Arabian Journal for Science and Engineering (AJSE). AJSE, which has been published by KFUPM since 1975, is a recognized national, regional and international journal that provides a great opportunity for the dissemination of research advances from the Kingdom of Saudi Arabia, MENA and the world.