{"title":"Safe Reinforcement Learning-Based Control in Power Electronic Systems","authors":"Daniel Weber, Maximilian Schenke, O. Wallscheid","doi":"10.1109/FES57669.2023.10182718","DOIUrl":null,"url":null,"abstract":"Data-driven approaches such as reinforcement learning (RL) allow a controller design without a priori system knowledge with minimal human effort as well as seamless self-adaptation to varying system characteristics. However, RL does not inherently consider input and state constraints, i.e., satisfying safety-relevant system limits during training and test. This is challenging in power electronic systems where it is necessary to avoid overcurrents and overvoltages. To overcome this issue, a standard RL algorithm is extended by a combination of constrained optimal control and online model identification to ensure safety during and after the learning process. In an exemplary three-level voltage source inverter for islanded electrical power grid application, it is shown that the approach does not only significantly improves safety but also improves the overall learning-based control performance.","PeriodicalId":165790,"journal":{"name":"2023 International Conference on Future Energy Solutions (FES)","volume":"7 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-06-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 International Conference on Future Energy Solutions (FES)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/FES57669.2023.10182718","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
Data-driven approaches such as reinforcement learning (RL) allow a controller design without a priori system knowledge with minimal human effort as well as seamless self-adaptation to varying system characteristics. However, RL does not inherently consider input and state constraints, i.e., satisfying safety-relevant system limits during training and test. This is challenging in power electronic systems where it is necessary to avoid overcurrents and overvoltages. To overcome this issue, a standard RL algorithm is extended by a combination of constrained optimal control and online model identification to ensure safety during and after the learning process. In an exemplary three-level voltage source inverter for islanded electrical power grid application, it is shown that the approach does not only significantly improves safety but also improves the overall learning-based control performance.