{"title":"基于部分信息多智能体强化学习的网络-物理电力系统混合博弈论安全评估","authors":"Zahra Azimi, Ahmad Afshar","doi":"10.1016/j.segan.2025.101727","DOIUrl":null,"url":null,"abstract":"<div><div>In this paper, we develop a Hybrid Non-zero-sum Multi-stage Partial-information Stochastic (HNMPS) game theoretic model for assessing the security of interdependent cyber-physical power systems (CPPS). In the cyber layer, HNMPS encapsulates the discrete dynamics of attacks, with tactics and techniques defined in the ICS MITRE ATT&CK framework. In the physical layer, it evaluates the consequence of Denial of Service (DoS) attacks on the transient stability of power networks through non-linear continuous dynamic analysis. Next, we propose an Imperfect-Information Multi-Agent Q-learning (IMQL) algorithm to solve the game when opposing players' actions and strategies are unknown. Unlike existing methods, IMQL doesn't require joint Nash strategy computation and therefore relaxes the strong assumption on existing global optima or saddle point. We further prove the convergence of the proposed algorithm. Based on the outcome of the HNMPS, we introduce the Cyber Security Index Level (CIL), a novel metric that quantifies the probability of physical layer intrusion following a breach in the cyber layer. To validate our model, we conduct simulations on the Western System Coordinating Council (WECC) 9-bus system coupled with a cyber network, employing an attack scenario inspired by the BlackEnergy v3 malware. Results indicate the successful convergence and robustness of the learning process under partial information settings compared to existing algorithms.</div></div>","PeriodicalId":56142,"journal":{"name":"Sustainable Energy Grids & Networks","volume":"43 ","pages":"Article 101727"},"PeriodicalIF":4.8000,"publicationDate":"2025-05-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Hybrid game-theoretic security assessment of cyber-physical power systems using partial-information multi-agent reinforcement learning\",\"authors\":\"Zahra Azimi, Ahmad Afshar\",\"doi\":\"10.1016/j.segan.2025.101727\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>In this paper, we develop a Hybrid Non-zero-sum Multi-stage Partial-information Stochastic (HNMPS) game theoretic model for assessing the security of interdependent cyber-physical power systems (CPPS). In the cyber layer, HNMPS encapsulates the discrete dynamics of attacks, with tactics and techniques defined in the ICS MITRE ATT&CK framework. In the physical layer, it evaluates the consequence of Denial of Service (DoS) attacks on the transient stability of power networks through non-linear continuous dynamic analysis. Next, we propose an Imperfect-Information Multi-Agent Q-learning (IMQL) algorithm to solve the game when opposing players' actions and strategies are unknown. Unlike existing methods, IMQL doesn't require joint Nash strategy computation and therefore relaxes the strong assumption on existing global optima or saddle point. We further prove the convergence of the proposed algorithm. Based on the outcome of the HNMPS, we introduce the Cyber Security Index Level (CIL), a novel metric that quantifies the probability of physical layer intrusion following a breach in the cyber layer. To validate our model, we conduct simulations on the Western System Coordinating Council (WECC) 9-bus system coupled with a cyber network, employing an attack scenario inspired by the BlackEnergy v3 malware. Results indicate the successful convergence and robustness of the learning process under partial information settings compared to existing algorithms.</div></div>\",\"PeriodicalId\":56142,\"journal\":{\"name\":\"Sustainable Energy Grids & Networks\",\"volume\":\"43 \",\"pages\":\"Article 101727\"},\"PeriodicalIF\":4.8000,\"publicationDate\":\"2025-05-05\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Sustainable Energy Grids & Networks\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S2352467725001092\",\"RegionNum\":2,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"ENERGY & FUELS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Sustainable Energy Grids & Networks","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2352467725001092","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENERGY & FUELS","Score":null,"Total":0}
Hybrid game-theoretic security assessment of cyber-physical power systems using partial-information multi-agent reinforcement learning
In this paper, we develop a Hybrid Non-zero-sum Multi-stage Partial-information Stochastic (HNMPS) game theoretic model for assessing the security of interdependent cyber-physical power systems (CPPS). In the cyber layer, HNMPS encapsulates the discrete dynamics of attacks, with tactics and techniques defined in the ICS MITRE ATT&CK framework. In the physical layer, it evaluates the consequence of Denial of Service (DoS) attacks on the transient stability of power networks through non-linear continuous dynamic analysis. Next, we propose an Imperfect-Information Multi-Agent Q-learning (IMQL) algorithm to solve the game when opposing players' actions and strategies are unknown. Unlike existing methods, IMQL doesn't require joint Nash strategy computation and therefore relaxes the strong assumption on existing global optima or saddle point. We further prove the convergence of the proposed algorithm. Based on the outcome of the HNMPS, we introduce the Cyber Security Index Level (CIL), a novel metric that quantifies the probability of physical layer intrusion following a breach in the cyber layer. To validate our model, we conduct simulations on the Western System Coordinating Council (WECC) 9-bus system coupled with a cyber network, employing an attack scenario inspired by the BlackEnergy v3 malware. Results indicate the successful convergence and robustness of the learning process under partial information settings compared to existing algorithms.
期刊介绍:
Sustainable Energy, Grids and Networks (SEGAN)is an international peer-reviewed publication for theoretical and applied research dealing with energy, information grids and power networks, including smart grids from super to micro grid scales. SEGAN welcomes papers describing fundamental advances in mathematical, statistical or computational methods with application to power and energy systems, as well as papers on applications, computation and modeling in the areas of electrical and energy systems with coupled information and communication technologies.