Discovering Exfiltration Paths Using Reinforcement Learning with Attack Graphs

2022 IEEE Conference on Dependable and Secure Computing (DSC) Pub Date : 2022-01-28 DOI:10.1109/DSC54232.2022.9888919

Tyler Cody, Abdul Rahman, Christopher Redino, Lanxiao Huang, Ryan Clark, A. Kakkar, Deepak Kushwaha, Paul Park, P. Beling, E. Bowen

引用次数: 9

Abstract

Reinforcement learning (RL), in conjunction with attack graphs and cyber terrain, are used to develop reward and state associated with determination of optimal paths for exfiltration of data in enterprise networks. This work builds on previous crown jewels (CJ) identification that focused on the target goal of computing optimal paths that adversaries may traverse toward compromising CJs or hosts within their proximity. This work inverts the previous CJ approach based on the assumption that data has been stolen and now must be quietly exfiltrated from the network. RL is utilized to support the development of a reward function based on the identification of those paths where adversaries desire reduced detection. Results demonstrate promising performance for a sizable network environment.

查看原文本刊更多论文

使用攻击图的强化学习发现泄露路径

强化学习(RL)与攻击图和网络地形相结合，用于开发与确定企业网络中数据泄露的最佳路径相关的奖励和状态。这项工作建立在先前的皇冠珠宝(CJ)识别的基础上，该识别的重点是计算攻击者可能穿越的最优路径，以破坏其附近的CJ或主机。这项工作颠覆了之前的CJ方法，该方法基于数据已经被盗，现在必须悄悄地从网络中泄漏的假设。RL用于支持基于识别那些对手希望减少检测的路径的奖励函数的开发。结果表明，在一个相当大的网络环境中，性能是有希望的。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2022 IEEE Conference on Dependable and Secure Computing (DSC)

自引率

0.00%

发文量