{"title":"环境和强迫振荡下未知网络模型电力系统的强化学习控制","authors":"Sayak Mukherjee, H. Bai, A. Chakrabortty","doi":"10.1109/CCTA41146.2020.9206271","DOIUrl":null,"url":null,"abstract":"We present a model-free optimal control design for electric power systems with unknown transmission network and load models to improve its dynamic performance using techniques from reinforcement learning (RL) and adaptive dynamic programming (ADP). We consider different persistent disturbances in the grid including ambient oscillations resulting from load fluctuations and their effects on exciter voltage regulation loops. We also consider forced oscillation scenarios that frequently occur due to malfunctioning of governor valves. Our proposed RL algorithm recovers the optimal feedback response in spite of all of these disturbances in a completely model-free way using online measurements of the states, inputs, and the disturbances. The design is validated using the IEEE benchmark 39-bus, 10-generator New England power system model perturbed with different ambient and forced oscillations.","PeriodicalId":241335,"journal":{"name":"2020 IEEE Conference on Control Technology and Applications (CCTA)","volume":"37 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Reinforcement Learning Control of Power Systems with Unknown Network Model under Ambient and Forced Oscillations\",\"authors\":\"Sayak Mukherjee, H. Bai, A. Chakrabortty\",\"doi\":\"10.1109/CCTA41146.2020.9206271\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"We present a model-free optimal control design for electric power systems with unknown transmission network and load models to improve its dynamic performance using techniques from reinforcement learning (RL) and adaptive dynamic programming (ADP). We consider different persistent disturbances in the grid including ambient oscillations resulting from load fluctuations and their effects on exciter voltage regulation loops. We also consider forced oscillation scenarios that frequently occur due to malfunctioning of governor valves. Our proposed RL algorithm recovers the optimal feedback response in spite of all of these disturbances in a completely model-free way using online measurements of the states, inputs, and the disturbances. The design is validated using the IEEE benchmark 39-bus, 10-generator New England power system model perturbed with different ambient and forced oscillations.\",\"PeriodicalId\":241335,\"journal\":{\"name\":\"2020 IEEE Conference on Control Technology and Applications (CCTA)\",\"volume\":\"37 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-08-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2020 IEEE Conference on Control Technology and Applications (CCTA)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CCTA41146.2020.9206271\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 IEEE Conference on Control Technology and Applications (CCTA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CCTA41146.2020.9206271","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Reinforcement Learning Control of Power Systems with Unknown Network Model under Ambient and Forced Oscillations
We present a model-free optimal control design for electric power systems with unknown transmission network and load models to improve its dynamic performance using techniques from reinforcement learning (RL) and adaptive dynamic programming (ADP). We consider different persistent disturbances in the grid including ambient oscillations resulting from load fluctuations and their effects on exciter voltage regulation loops. We also consider forced oscillation scenarios that frequently occur due to malfunctioning of governor valves. Our proposed RL algorithm recovers the optimal feedback response in spite of all of these disturbances in a completely model-free way using online measurements of the states, inputs, and the disturbances. The design is validated using the IEEE benchmark 39-bus, 10-generator New England power system model perturbed with different ambient and forced oscillations.