基于q -学习的lcl耦合参数偏差逆变器有限控制集模型预测控制

2023 IEEE 12th Data Driven Control and Learning Systems Conference (DDCLS) Pub Date : 2023-05-12 DOI:10.1109/DDCLS58216.2023.10167014

Lei Zhang, Yunjian Peng, Weijie Sun, Jinze Li

{"title":"基于q -学习的lcl耦合参数偏差逆变器有限控制集模型预测控制","authors":"Lei Zhang, Yunjian Peng, Weijie Sun, Jinze Li","doi":"10.1109/DDCLS58216.2023.10167014","DOIUrl":null,"url":null,"abstract":"Finite Control Set (FCS) Model Predictive Control (MPC), as an efficient method used for current tracking of LCL-Coupled three-phase inverters, runs into high computational complexity while finding its optimal version with a long predictive interval. For such a difficult problem we take a value function with discounted factors as an indicator to measure the pros and cons of control and propose a novel alternative method based on Q-learning algorithm. In the control scheme, the value function is approximated by reinforcement learning(RL) algorithm and furthermore, the long horizons prediction is transformed into an iterative multi-step matrix calculation. At the same time, the optimal switching position is directly obtained without a modulation link, which greatly reduces the computational complexity. Accordingly, a data-driven Q-learning algorithm is designed with a proof of convergence. Last, the proposed algorithm's performance in the case of complete deviation from the (unknown) system parameters is verified by simulations.","PeriodicalId":415532,"journal":{"name":"2023 IEEE 12th Data Driven Control and Learning Systems Conference (DDCLS)","volume":"156 1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-05-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Q-Learning-based Finite Control Set Model Predictive Control for LCL-Coupled Inverters with Deviated Parameters\",\"authors\":\"Lei Zhang, Yunjian Peng, Weijie Sun, Jinze Li\",\"doi\":\"10.1109/DDCLS58216.2023.10167014\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Finite Control Set (FCS) Model Predictive Control (MPC), as an efficient method used for current tracking of LCL-Coupled three-phase inverters, runs into high computational complexity while finding its optimal version with a long predictive interval. For such a difficult problem we take a value function with discounted factors as an indicator to measure the pros and cons of control and propose a novel alternative method based on Q-learning algorithm. In the control scheme, the value function is approximated by reinforcement learning(RL) algorithm and furthermore, the long horizons prediction is transformed into an iterative multi-step matrix calculation. At the same time, the optimal switching position is directly obtained without a modulation link, which greatly reduces the computational complexity. Accordingly, a data-driven Q-learning algorithm is designed with a proof of convergence. Last, the proposed algorithm's performance in the case of complete deviation from the (unknown) system parameters is verified by simulations.\",\"PeriodicalId\":415532,\"journal\":{\"name\":\"2023 IEEE 12th Data Driven Control and Learning Systems Conference (DDCLS)\",\"volume\":\"156 1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-05-12\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2023 IEEE 12th Data Driven Control and Learning Systems Conference (DDCLS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/DDCLS58216.2023.10167014\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 IEEE 12th Data Driven Control and Learning Systems Conference (DDCLS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/DDCLS58216.2023.10167014","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

有限控制集(FCS)模型预测控制(MPC)作为一种用于lcl耦合三相逆变器电流跟踪的有效方法，在寻找具有较长预测间隔的最优版本时，计算复杂度较高。针对这一难题，我们采用带有折现因子的值函数作为衡量控制优劣的指标，提出了一种基于Q-learning算法的替代方法。在控制方案中，采用强化学习(RL)算法逼近值函数，并将长视界预测转化为迭代的多步矩阵计算。同时，无需调制链路直接获得最优开关位置，大大降低了计算复杂度。据此，设计了一种数据驱动的q -学习算法，并证明了算法的收敛性。最后，通过仿真验证了该算法在完全偏离(未知)系统参数情况下的性能。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Q-Learning-based Finite Control Set Model Predictive Control for LCL-Coupled Inverters with Deviated Parameters

Finite Control Set (FCS) Model Predictive Control (MPC), as an efficient method used for current tracking of LCL-Coupled three-phase inverters, runs into high computational complexity while finding its optimal version with a long predictive interval. For such a difficult problem we take a value function with discounted factors as an indicator to measure the pros and cons of control and propose a novel alternative method based on Q-learning algorithm. In the control scheme, the value function is approximated by reinforcement learning(RL) algorithm and furthermore, the long horizons prediction is transformed into an iterative multi-step matrix calculation. At the same time, the optimal switching position is directly obtained without a modulation link, which greatly reduces the computational complexity. Accordingly, a data-driven Q-learning algorithm is designed with a proof of convergence. Last, the proposed algorithm's performance in the case of complete deviation from the (unknown) system parameters is verified by simulations.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2023 IEEE 12th Data Driven Control and Learning Systems Conference (DDCLS)

自引率

0.00%

发文量