{"title":"Maintaining the Frequency of AI-based Power System Model using Twin Delayed DDPG(TD3) Implementation","authors":"Rohan Dubey, Renuka Loka, A. M. Parimi","doi":"10.1109/PARC52418.2022.9726615","DOIUrl":null,"url":null,"abstract":"This paper proposes a multi-agent deep reinforcement learning (MA-DRL) method for load frequency control of a renewable energy single-area power system in a continuous action-space domain. This method can non-linearly adapt the control strategies for cooperative LFC control through off-policy learning. Multi-agent twin delayed deep deterministic policy gradient (TD3) is proposed to adjust and refine the control system parameters considering variational load and source behaviour. Implementation of the model requires only local information for each control area to achieve an optimal control state. Comparison between TD3 and DDPG model proves the edge of TD3 model. Simulations and numerical data comparison on a renewable energy single-area power system demonstrate that the proposed model can successfully reduce control errors and stochastic frequency deviations caused by load and renewable power fluctuations.","PeriodicalId":158896,"journal":{"name":"2022 2nd International Conference on Power Electronics & IoT Applications in Renewable Energy and its Control (PARC)","volume":"123 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-01-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 2nd International Conference on Power Electronics & IoT Applications in Renewable Energy and its Control (PARC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/PARC52418.2022.9726615","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
This paper proposes a multi-agent deep reinforcement learning (MA-DRL) method for load frequency control of a renewable energy single-area power system in a continuous action-space domain. This method can non-linearly adapt the control strategies for cooperative LFC control through off-policy learning. Multi-agent twin delayed deep deterministic policy gradient (TD3) is proposed to adjust and refine the control system parameters considering variational load and source behaviour. Implementation of the model requires only local information for each control area to achieve an optimal control state. Comparison between TD3 and DDPG model proves the edge of TD3 model. Simulations and numerical data comparison on a renewable energy single-area power system demonstrate that the proposed model can successfully reduce control errors and stochastic frequency deviations caused by load and renewable power fluctuations.