A Hybrid Game Theory and Reinforcement Learning Approach for Cyber-Physical Systems Security

NOMS 2020 - 2020 IEEE/IFIP Network Operations and Management Symposium Pub Date : 2020-04-01 DOI:10.1109/NOMS47738.2020.9110453

Joseph Khoury, M. Nassar

{"title":"A Hybrid Game Theory and Reinforcement Learning Approach for Cyber-Physical Systems Security","authors":"Joseph Khoury, M. Nassar","doi":"10.1109/NOMS47738.2020.9110453","DOIUrl":null,"url":null,"abstract":"Cyber-Physical Systems (CPS) are monitored and controlled by Supervisory Control and Data Acquisition (SCADA) systems that use advanced computing, sensors, control systems, and communication networks. At first, CPS and SCADA systems were protected and secured by isolation. However, with recent industrial technology advances, the increased connectivity of CPSs and SCADA systems to enterprise networks has uncovered them to new cybersecurity threats and made them a primary target for cyber-attacks with the potential of causing catastrophic economic, social, and environmental damage. Recent research focuses on new methodologies for risk modeling and assessment using game theory and reinforcement learning.This paperwork proposes to frame CPS security on two different levels, strategic and battlefield, by meeting ideas from game theory and Multi-Agent Reinforcement Learning (MARL). The strategic level is modeled as imperfect information, extensive form game. Here, the human administrator and the malware author decide on the strategies of defense and attack, respectively. At the battlefield level, strategies are implemented by machine learning agents that derive optimal policies for run-time decisions. The outcomes of these policies manifest as the utility at a higher level, where we aim to reach a Nash Equilibrium (NE) in favor of the defender. We simulate the scenario of a virus spreading in the context of a CPS network. We present experiments using the MiniCPS simulator and the OpenAI Gym toolkit and discuss the results.","PeriodicalId":169457,"journal":{"name":"NOMS 2020 - 2020 IEEE/IFIP Network Operations and Management Symposium","volume":"67 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"8","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"NOMS 2020 - 2020 IEEE/IFIP Network Operations and Management Symposium","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/NOMS47738.2020.9110453","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 8

Abstract

Cyber-Physical Systems (CPS) are monitored and controlled by Supervisory Control and Data Acquisition (SCADA) systems that use advanced computing, sensors, control systems, and communication networks. At first, CPS and SCADA systems were protected and secured by isolation. However, with recent industrial technology advances, the increased connectivity of CPSs and SCADA systems to enterprise networks has uncovered them to new cybersecurity threats and made them a primary target for cyber-attacks with the potential of causing catastrophic economic, social, and environmental damage. Recent research focuses on new methodologies for risk modeling and assessment using game theory and reinforcement learning.This paperwork proposes to frame CPS security on two different levels, strategic and battlefield, by meeting ideas from game theory and Multi-Agent Reinforcement Learning (MARL). The strategic level is modeled as imperfect information, extensive form game. Here, the human administrator and the malware author decide on the strategies of defense and attack, respectively. At the battlefield level, strategies are implemented by machine learning agents that derive optimal policies for run-time decisions. The outcomes of these policies manifest as the utility at a higher level, where we aim to reach a Nash Equilibrium (NE) in favor of the defender. We simulate the scenario of a virus spreading in the context of a CPS network. We present experiments using the MiniCPS simulator and the OpenAI Gym toolkit and discuss the results.

查看原文本刊更多论文

网络物理系统安全的混合博弈论和强化学习方法

信息物理系统(CPS)由使用先进计算、传感器、控制系统和通信网络的监督控制和数据采集(SCADA)系统监视和控制。首先，CPS和SCADA系统通过隔离保护和安全。然而，随着最近工业技术的进步，cps和SCADA系统与企业网络的连接性增加，使它们面临新的网络安全威胁，并使它们成为网络攻击的主要目标，有可能造成灾难性的经济、社会和环境破坏。最近的研究集中在利用博弈论和强化学习进行风险建模和评估的新方法上。该文件提出通过满足博弈论和多智能体强化学习(MARL)的思想，在战略和战场两个不同的层面上构建CPS安全。战略层面被建模为不完全信息、广泛形式的博弈。在这里，人工管理员和恶意软件作者分别决定防御和攻击策略。在战场层面，策略由机器学习代理实现，这些代理为运行时决策派生出最佳策略。这些政策的结果在更高的层次上表现为效用，我们的目标是达到有利于防御者的纳什均衡(NE)。我们模拟了病毒在CPS网络环境中传播的场景。我们介绍了使用MiniCPS模拟器和OpenAI Gym工具包的实验，并讨论了结果。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

NOMS 2020 - 2020 IEEE/IFIP Network Operations and Management Symposium

自引率

0.00%

发文量