{"title":"基于无模型强化学习的电力电子变流器控制方法","authors":"Dajr Alfred, D. Czarkowski, Jiaxin Teng","doi":"10.1109/GreenTech48523.2021.00024","DOIUrl":null,"url":null,"abstract":"This paper presents a novel reinforcement learning (RL) based discrete-time closed-loop control methodology for switch-mode, pulse-width-modulated (PWM) power electronic converters. This method of closed-loop optimal output regulation is achieved by utilizing measured data to approximate system dynamics, thus obviating the need for prior knowledge of system/plant dynamics. The underlying RL algorithm is then utilized to obtain the optimal feedback controller. The derived controller is obtained in a manner akin to that of a Linear Quadratic Regulator (LQR) and involves the iterative solution of an algebraic Riccati equation (ARE). This closed-loop control methodology is implemented on both buck and boost converters and its robustness to load and line variation is tested. A Type-III compensator was also developed in order to compare its performance with that of the proposed controller. Simulation results are provided to verify the effectiveness and examine the limitations of the proposed control strategy.","PeriodicalId":146759,"journal":{"name":"2021 IEEE Green Technologies Conference (GreenTech)","volume":"116 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":"{\"title\":\"Model-Free Reinforcement-Learning-Based Control Methodology for Power Electronic Converters\",\"authors\":\"Dajr Alfred, D. Czarkowski, Jiaxin Teng\",\"doi\":\"10.1109/GreenTech48523.2021.00024\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper presents a novel reinforcement learning (RL) based discrete-time closed-loop control methodology for switch-mode, pulse-width-modulated (PWM) power electronic converters. This method of closed-loop optimal output regulation is achieved by utilizing measured data to approximate system dynamics, thus obviating the need for prior knowledge of system/plant dynamics. The underlying RL algorithm is then utilized to obtain the optimal feedback controller. The derived controller is obtained in a manner akin to that of a Linear Quadratic Regulator (LQR) and involves the iterative solution of an algebraic Riccati equation (ARE). This closed-loop control methodology is implemented on both buck and boost converters and its robustness to load and line variation is tested. A Type-III compensator was also developed in order to compare its performance with that of the proposed controller. Simulation results are provided to verify the effectiveness and examine the limitations of the proposed control strategy.\",\"PeriodicalId\":146759,\"journal\":{\"name\":\"2021 IEEE Green Technologies Conference (GreenTech)\",\"volume\":\"116 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-04-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 IEEE Green Technologies Conference (GreenTech)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/GreenTech48523.2021.00024\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE Green Technologies Conference (GreenTech)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/GreenTech48523.2021.00024","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Model-Free Reinforcement-Learning-Based Control Methodology for Power Electronic Converters
This paper presents a novel reinforcement learning (RL) based discrete-time closed-loop control methodology for switch-mode, pulse-width-modulated (PWM) power electronic converters. This method of closed-loop optimal output regulation is achieved by utilizing measured data to approximate system dynamics, thus obviating the need for prior knowledge of system/plant dynamics. The underlying RL algorithm is then utilized to obtain the optimal feedback controller. The derived controller is obtained in a manner akin to that of a Linear Quadratic Regulator (LQR) and involves the iterative solution of an algebraic Riccati equation (ARE). This closed-loop control methodology is implemented on both buck and boost converters and its robustness to load and line variation is tested. A Type-III compensator was also developed in order to compare its performance with that of the proposed controller. Simulation results are provided to verify the effectiveness and examine the limitations of the proposed control strategy.