面向性能的数据驱动控制：融合Koopman算子和基于mpc的强化学习

IF 2.4 Q2 AUTOMATION & CONTROL SYSTEMS

IEEE Control Systems Letters Pub Date : 2024-12-19 DOI:10.1109/LCSYS.2024.3520904

Hossein Nejatbakhsh Esfahani;Umesh Vaidya;Javad Mohammadpour Velni

{"title":"面向性能的数据驱动控制：融合Koopman算子和基于mpc的强化学习","authors":"Hossein Nejatbakhsh Esfahani;Umesh Vaidya;Javad Mohammadpour Velni","doi":"10.1109/LCSYS.2024.3520904","DOIUrl":null,"url":null,"abstract":"This letter develops the machinery of Koopman-based Model Predictive Control (KMPC) design, where the Koopman derived model is unable to capture the real nonlinear system perfectly. We then propose to use an MPC-based reinforcement learning within the Koopman framework combining the strengths of MPC, Reinforcement Learning (RL), and the Koopman Operator (KO) theory for an efficient data-driven control and performance-oriented learning of complex nonlinear systems. We show that the closed-loop performance of the KMPC is improved by modifying the KMPC objective function. In practice, we design a fully parameterized KMPC and employ RL to adjust the corresponding parameters aiming at achieving the best achievable closed-loop performance.","PeriodicalId":37235,"journal":{"name":"IEEE Control Systems Letters","volume":"8 ","pages":"3021-3026"},"PeriodicalIF":2.4000,"publicationDate":"2024-12-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Performance-Oriented Data-Driven Control: Fusing Koopman Operator and MPC-Based Reinforcement Learning\",\"authors\":\"Hossein Nejatbakhsh Esfahani;Umesh Vaidya;Javad Mohammadpour Velni\",\"doi\":\"10.1109/LCSYS.2024.3520904\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This letter develops the machinery of Koopman-based Model Predictive Control (KMPC) design, where the Koopman derived model is unable to capture the real nonlinear system perfectly. We then propose to use an MPC-based reinforcement learning within the Koopman framework combining the strengths of MPC, Reinforcement Learning (RL), and the Koopman Operator (KO) theory for an efficient data-driven control and performance-oriented learning of complex nonlinear systems. We show that the closed-loop performance of the KMPC is improved by modifying the KMPC objective function. In practice, we design a fully parameterized KMPC and employ RL to adjust the corresponding parameters aiming at achieving the best achievable closed-loop performance.\",\"PeriodicalId\":37235,\"journal\":{\"name\":\"IEEE Control Systems Letters\",\"volume\":\"8 \",\"pages\":\"3021-3026\"},\"PeriodicalIF\":2.4000,\"publicationDate\":\"2024-12-19\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE Control Systems Letters\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/10808167/\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"AUTOMATION & CONTROL SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Control Systems Letters","FirstCategoryId":"1085","ListUrlMain":"https://ieeexplore.ieee.org/document/10808167/","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}

引用次数: 0

摘要

这封信发展了基于Koopman的模型预测控制（KMPC）设计的机制，其中Koopman导出的模型无法完美地捕获实际的非线性系统。然后，我们建议在Koopman框架内使用基于MPC的强化学习，结合MPC、强化学习（RL）和Koopman算子（KO）理论的优势，对复杂非线性系统进行有效的数据驱动控制和面向性能的学习。结果表明，通过修改KMPC目标函数，可以提高KMPC的闭环性能。在实践中，我们设计了一个全参数化的KMPC，并利用RL来调整相应的参数，以达到最佳的闭环性能。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Performance-Oriented Data-Driven Control: Fusing Koopman Operator and MPC-Based Reinforcement Learning

This letter develops the machinery of Koopman-based Model Predictive Control (KMPC) design, where the Koopman derived model is unable to capture the real nonlinear system perfectly. We then propose to use an MPC-based reinforcement learning within the Koopman framework combining the strengths of MPC, Reinforcement Learning (RL), and the Koopman Operator (KO) theory for an efficient data-driven control and performance-oriented learning of complex nonlinear systems. We show that the closed-loop performance of the KMPC is improved by modifying the KMPC objective function. In practice, we design a fully parameterized KMPC and employ RL to adjust the corresponding parameters aiming at achieving the best achievable closed-loop performance.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

IEEE Control Systems Letters Mathematics-Control and Optimization

CiteScore

4.40

自引率

13.30%

发文量

471