{"title":"Performance-Oriented Data-Driven Control: Fusing Koopman Operator and MPC-Based Reinforcement Learning","authors":"Hossein Nejatbakhsh Esfahani;Umesh Vaidya;Javad Mohammadpour Velni","doi":"10.1109/LCSYS.2024.3520904","DOIUrl":null,"url":null,"abstract":"This letter develops the machinery of Koopman-based Model Predictive Control (KMPC) design, where the Koopman derived model is unable to capture the real nonlinear system perfectly. We then propose to use an MPC-based reinforcement learning within the Koopman framework combining the strengths of MPC, Reinforcement Learning (RL), and the Koopman Operator (KO) theory for an efficient data-driven control and performance-oriented learning of complex nonlinear systems. We show that the closed-loop performance of the KMPC is improved by modifying the KMPC objective function. In practice, we design a fully parameterized KMPC and employ RL to adjust the corresponding parameters aiming at achieving the best achievable closed-loop performance.","PeriodicalId":37235,"journal":{"name":"IEEE Control Systems Letters","volume":"8 ","pages":"3021-3026"},"PeriodicalIF":2.4000,"publicationDate":"2024-12-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Control Systems Letters","FirstCategoryId":"1085","ListUrlMain":"https://ieeexplore.ieee.org/document/10808167/","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}
引用次数: 0
Abstract
This letter develops the machinery of Koopman-based Model Predictive Control (KMPC) design, where the Koopman derived model is unable to capture the real nonlinear system perfectly. We then propose to use an MPC-based reinforcement learning within the Koopman framework combining the strengths of MPC, Reinforcement Learning (RL), and the Koopman Operator (KO) theory for an efficient data-driven control and performance-oriented learning of complex nonlinear systems. We show that the closed-loop performance of the KMPC is improved by modifying the KMPC objective function. In practice, we design a fully parameterized KMPC and employ RL to adjust the corresponding parameters aiming at achieving the best achievable closed-loop performance.