ER-ERT:A Method of Ensemble Representation Learning of Encrypted RAT Traffic

Edutech Pub Date : 2023-06-12 DOI:10.23919/IFIPNetworking57963.2023.10186391

Yijing Zhang, Hui Xue, Jianjun Lin, Xiaoyu Liu, Weilin Gai, Xiaodu Yang, Anqi Wang, Yinliang Yue, Bo Sun

{"title":"ER-ERT:A Method of Ensemble Representation Learning of Encrypted RAT Traffic","authors":"Yijing Zhang, Hui Xue, Jianjun Lin, Xiaoyu Liu, Weilin Gai, Xiaodu Yang, Anqi Wang, Yinliang Yue, Bo Sun","doi":"10.23919/IFIPNetworking57963.2023.10186391","DOIUrl":null,"url":null,"abstract":"Remote Access Trojan (RAT) is one of the major threats to today's network environment. It is a class of malware frequently used by hacking collectives to monitor victims' actions and steal personal information in targeted computers. Traditional machine learning algorithms have been widely used to detect malicious encrypted RAT traffic. Traditional machine learning algorithms rely deeply on expert experience, and it is difficult for current traffic classification models to design effective handcraft features. Deep learning methods have been introduced in recent years to generate representations from raw network traffic data automatically. Previous deep learning-based malicious traffic detection methods generate representations from flow sequences or packet payload bytes. None of these methods simultaneously learn embeddings from flow sequence and packet payload bytes. Thus, we propose a novel ensemble model to draw fine-grained and multi-angle traffic representations for RAT traffic. The model extract (1) temporal features with convolution neural network (CNN) and the Reproducing Kernel Hilbert Space (RKHS) embedding method to model network flow sequence, (2) spatial features with autoencoder and bidirectional gated recurrent unit (Bi-GRU) network to model packet payload bytes, and (3) some stage-based attributes to enhance the identification ability of RAT traffic behaviors. According to the experimental result, our approach achieves better performance than previous works with a precision rate of 97.0% and a recall rate of 96.5%.","PeriodicalId":31737,"journal":{"name":"Edutech","volume":"9 1","pages":"1-10"},"PeriodicalIF":0.0000,"publicationDate":"2023-06-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Edutech","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.23919/IFIPNetworking57963.2023.10186391","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Remote Access Trojan (RAT) is one of the major threats to today's network environment. It is a class of malware frequently used by hacking collectives to monitor victims' actions and steal personal information in targeted computers. Traditional machine learning algorithms have been widely used to detect malicious encrypted RAT traffic. Traditional machine learning algorithms rely deeply on expert experience, and it is difficult for current traffic classification models to design effective handcraft features. Deep learning methods have been introduced in recent years to generate representations from raw network traffic data automatically. Previous deep learning-based malicious traffic detection methods generate representations from flow sequences or packet payload bytes. None of these methods simultaneously learn embeddings from flow sequence and packet payload bytes. Thus, we propose a novel ensemble model to draw fine-grained and multi-angle traffic representations for RAT traffic. The model extract (1) temporal features with convolution neural network (CNN) and the Reproducing Kernel Hilbert Space (RKHS) embedding method to model network flow sequence, (2) spatial features with autoencoder and bidirectional gated recurrent unit (Bi-GRU) network to model packet payload bytes, and (3) some stage-based attributes to enhance the identification ability of RAT traffic behaviors. According to the experimental result, our approach achieves better performance than previous works with a precision rate of 97.0% and a recall rate of 96.5%.

查看原文本刊更多论文

ER-ERT:一种加密鼠流量的集成表示学习方法

远程访问木马(RAT)是当今网络环境的主要威胁之一。这是黑客组织经常使用的一类恶意软件，用来监视受害者的行为，窃取目标电脑中的个人信息。传统的机器学习算法已被广泛用于检测恶意加密RAT流量。传统的机器学习算法严重依赖专家经验，目前的流量分类模型难以设计出有效的手工特征。近年来引入了深度学习方法来从原始网络流量数据自动生成表示。以前基于深度学习的恶意流量检测方法是从流序列或数据包有效载荷字节生成表示。这些方法都不能同时从流序列和数据包有效载荷字节中学习嵌入。因此，我们提出了一种新的集成模型来绘制RAT流量的细粒度和多角度流量表示。模型提取(1)利用卷积神经网络(CNN)和再现核希尔伯特空间(RKHS)嵌入方法对网络流序列进行时间特征建模;(2)利用自编码器和双向门控循环单元(Bi-GRU)网络对数据包有效载荷字节进行空间特征建模;(3)利用基于阶段的属性增强RAT流量行为识别能力。实验结果表明，该方法的准确率为97.0%，召回率为96.5%，优于以往的方法。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Edutech

自引率

0.00%

发文量

审稿时长

4 weeks