{"title":"EMTD: Efficient encrypted malware traffic detection based on adaptive meta-path guided graph propagation","authors":"Fanyi Zeng, Dapeng Man, Yuhao Zhao, Yuchen Liu, Huanran Wang, Wu Yang","doi":"10.1016/j.comnet.2025.111636","DOIUrl":null,"url":null,"abstract":"<div><div>Given the growing challenges posed by encrypted Android malware, developing effective detection methods is crucial to understanding the evolution of malware families and designing preventive security measures. Existing detection methods for encrypted malware traffic primarily focus on extracting features at the single-flow level and multi-flow context level, failing to capture the evolutionary associations within known malware families and their previously unseen variants, which compromises detection effectiveness. Graph-based modeling methods have advantages in expressing traffic association features, but scalability issues present new challenges for detection timeliness. To address these limitations, we propose a novel two-stage detection framework named <strong>Encrypted Malware Traffic Detection (EMTD)</strong>. In the first phase, our method <strong>MFGDect</strong> models encrypted traffic as a heterogeneous information network and applies a multilayer heterogeneous attention mechanism to learn semantic associations among traffic flows. This enables adaptive family-aware representation and improves detection performance under encryption. Furthermore, we design <strong>MFGDect++</strong>, an extension of our base model that introduces adaptive meta-path guided graph propagation, enabling efficient incremental detection of new traffic samples without re-graphing or model retraining. This mechanism significantly reduces the average detection time to 135 ms per sample, demonstrating strong scalability. Experiments on public datasets demonstrate that EMTD outperforms existing baselines, achieving an average improvement of 9.62% in malicious sample recall and a 2.32% increase in F1 score, while maintaining low resource overheads and strong adaptability to large-scale graph data.</div></div>","PeriodicalId":50637,"journal":{"name":"Computer Networks","volume":"271 ","pages":"Article 111636"},"PeriodicalIF":4.6000,"publicationDate":"2025-08-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computer Networks","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1389128625006036","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, HARDWARE & ARCHITECTURE","Score":null,"Total":0}
引用次数: 0
Abstract
Given the growing challenges posed by encrypted Android malware, developing effective detection methods is crucial to understanding the evolution of malware families and designing preventive security measures. Existing detection methods for encrypted malware traffic primarily focus on extracting features at the single-flow level and multi-flow context level, failing to capture the evolutionary associations within known malware families and their previously unseen variants, which compromises detection effectiveness. Graph-based modeling methods have advantages in expressing traffic association features, but scalability issues present new challenges for detection timeliness. To address these limitations, we propose a novel two-stage detection framework named Encrypted Malware Traffic Detection (EMTD). In the first phase, our method MFGDect models encrypted traffic as a heterogeneous information network and applies a multilayer heterogeneous attention mechanism to learn semantic associations among traffic flows. This enables adaptive family-aware representation and improves detection performance under encryption. Furthermore, we design MFGDect++, an extension of our base model that introduces adaptive meta-path guided graph propagation, enabling efficient incremental detection of new traffic samples without re-graphing or model retraining. This mechanism significantly reduces the average detection time to 135 ms per sample, demonstrating strong scalability. Experiments on public datasets demonstrate that EMTD outperforms existing baselines, achieving an average improvement of 9.62% in malicious sample recall and a 2.32% increase in F1 score, while maintaining low resource overheads and strong adaptability to large-scale graph data.
期刊介绍:
Computer Networks is an international, archival journal providing a publication vehicle for complete coverage of all topics of interest to those involved in the computer communications networking area. The audience includes researchers, managers and operators of networks as well as designers and implementors. The Editorial Board will consider any material for publication that is of interest to those groups.