Fast and Accurate Machine Learning-based Malware Detection via RC4 Ciphertext Analysis*

2019 International Conference on Computing, Networking and Communications (ICNC) Pub Date : 2019-02-01 DOI:10.1109/ICCNC.2019.8685644

Junggab Son, Euiseong Ko, Uday Bhaskar Boyanapalli, Donghyun Kim, Youngsoon Kim, Mingon Kang

{"title":"Fast and Accurate Machine Learning-based Malware Detection via RC4 Ciphertext Analysis*","authors":"Junggab Son, Euiseong Ko, Uday Bhaskar Boyanapalli, Donghyun Kim, Youngsoon Kim, Mingon Kang","doi":"10.1109/ICCNC.2019.8685644","DOIUrl":null,"url":null,"abstract":"Recent malware increases its viability by employing ciphers which help to hide malicious intention and/or behavior against detection schemes. So far, many efforts have been made to detect malware and to prevent it from damaging clients by monitoring network packets. However, these conventional detection schemes tend to treat an encrypted packet as legitimate due to the hardness of extracting information from ciphertexts. Cryptoanalysis of each packet flowing over a network might be one feasible solution to the problem. However, this approach is computationally expensive and lacks accuracy, and thus it is consequently not a practical solution. To address the problem, we firstly introduce a discovery that a fixed encryption key generates unique statistical patterns on RC4 ciphertexts. To the best of our knowledge, this unique signature has never been discussed in the literature. Then, we propose a machine learning-based detection scheme that can identify malware packets efficiently and accurately by leveraging the discovery. The proposed scheme directly analyze network packets without decrypting ciphertexts. Moreover, our analysis demonstrates the proposed scheme requires only a tiny subset of the network packet.","PeriodicalId":161815,"journal":{"name":"2019 International Conference on Computing, Networking and Communications (ICNC)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 International Conference on Computing, Networking and Communications (ICNC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCNC.2019.8685644","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 3

Abstract

Recent malware increases its viability by employing ciphers which help to hide malicious intention and/or behavior against detection schemes. So far, many efforts have been made to detect malware and to prevent it from damaging clients by monitoring network packets. However, these conventional detection schemes tend to treat an encrypted packet as legitimate due to the hardness of extracting information from ciphertexts. Cryptoanalysis of each packet flowing over a network might be one feasible solution to the problem. However, this approach is computationally expensive and lacks accuracy, and thus it is consequently not a practical solution. To address the problem, we firstly introduce a discovery that a fixed encryption key generates unique statistical patterns on RC4 ciphertexts. To the best of our knowledge, this unique signature has never been discussed in the literature. Then, we propose a machine learning-based detection scheme that can identify malware packets efficiently and accurately by leveraging the discovery. The proposed scheme directly analyze network packets without decrypting ciphertexts. Moreover, our analysis demonstrates the proposed scheme requires only a tiny subset of the network packet.

查看原文本刊更多论文

通过RC4密文分析快速准确的基于机器学习的恶意软件检测*

最近的恶意软件通过使用有助于隐藏恶意意图和/或针对检测方案的行为的密码来增加其生存能力。到目前为止，人们已经做出了很多努力来检测恶意软件，并通过监控网络数据包来防止它破坏客户端。然而，由于从密文中提取信息的困难，这些传统的检测方案倾向于将加密的数据包视为合法的。对流经网络的每个数据包进行加密分析可能是解决该问题的一种可行方案。然而，这种方法在计算上很昂贵并且缺乏准确性，因此它不是一个实用的解决方案。为了解决这个问题，我们首先介绍一个发现，即固定的加密密钥在RC4密文上生成唯一的统计模式。据我们所知，这种独特的签名从未在文献中被讨论过。然后，我们提出了一种基于机器学习的检测方案，该方案可以利用发现高效准确地识别恶意数据包。该方案不需要对密文进行解密，直接对网络数据包进行分析。此外，我们的分析表明，所提出的方案只需要网络数据包的一小部分。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2019 International Conference on Computing, Networking and Communications (ICNC)

自引率

0.00%

发文量