{"title":"Reinforcement Learning for Cyber-Physical Security Assessment of Power Systems","authors":"Xiaorui Liu, Charalambos Konstantinou","doi":"10.1109/PTC.2019.8810568","DOIUrl":null,"url":null,"abstract":"The protection of power systems is of paramount significance for the supply of electricity. Contingency analysis allows to access the impact of power grid components failures. Typically, power systems are designed to handle $N-2$ contingencies. Existing algorithms mainly focus on performance and computational efficiency. There has been little effort in designing contingency methods from a cybersecurity perspective. To address this limitation, we study contingency analysis in the context of power system planning and operation towards cyber-physical security assessment. The proposed methodology considers attackers transitions in the network based on the $N-2$ critical contingency pairs. We develop an online reinforcement $Q -$learning scheme to solve a Markov decision process that models adversarial actions. In this model, the adversary aims to maximize the cumulative reward before making any action and learns adaptively how to optimize the attack strategies. We validate and test the algorithm on eleven literature-based and synthetic power grid test cases.","PeriodicalId":187144,"journal":{"name":"2019 IEEE Milan PowerTech","volume":"83 3 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-06-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"9","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 IEEE Milan PowerTech","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/PTC.2019.8810568","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 9
Abstract
The protection of power systems is of paramount significance for the supply of electricity. Contingency analysis allows to access the impact of power grid components failures. Typically, power systems are designed to handle $N-2$ contingencies. Existing algorithms mainly focus on performance and computational efficiency. There has been little effort in designing contingency methods from a cybersecurity perspective. To address this limitation, we study contingency analysis in the context of power system planning and operation towards cyber-physical security assessment. The proposed methodology considers attackers transitions in the network based on the $N-2$ critical contingency pairs. We develop an online reinforcement $Q -$learning scheme to solve a Markov decision process that models adversarial actions. In this model, the adversary aims to maximize the cumulative reward before making any action and learns adaptively how to optimize the attack strategies. We validate and test the algorithm on eleven literature-based and synthetic power grid test cases.