{"title":"面向性能的数据驱动控制:融合Koopman算子和基于mpc的强化学习","authors":"Hossein Nejatbakhsh Esfahani;Umesh Vaidya;Javad Mohammadpour Velni","doi":"10.1109/LCSYS.2024.3520904","DOIUrl":null,"url":null,"abstract":"This letter develops the machinery of Koopman-based Model Predictive Control (KMPC) design, where the Koopman derived model is unable to capture the real nonlinear system perfectly. We then propose to use an MPC-based reinforcement learning within the Koopman framework combining the strengths of MPC, Reinforcement Learning (RL), and the Koopman Operator (KO) theory for an efficient data-driven control and performance-oriented learning of complex nonlinear systems. We show that the closed-loop performance of the KMPC is improved by modifying the KMPC objective function. In practice, we design a fully parameterized KMPC and employ RL to adjust the corresponding parameters aiming at achieving the best achievable closed-loop performance.","PeriodicalId":37235,"journal":{"name":"IEEE Control Systems Letters","volume":"8 ","pages":"3021-3026"},"PeriodicalIF":2.4000,"publicationDate":"2024-12-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Performance-Oriented Data-Driven Control: Fusing Koopman Operator and MPC-Based Reinforcement Learning\",\"authors\":\"Hossein Nejatbakhsh Esfahani;Umesh Vaidya;Javad Mohammadpour Velni\",\"doi\":\"10.1109/LCSYS.2024.3520904\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This letter develops the machinery of Koopman-based Model Predictive Control (KMPC) design, where the Koopman derived model is unable to capture the real nonlinear system perfectly. We then propose to use an MPC-based reinforcement learning within the Koopman framework combining the strengths of MPC, Reinforcement Learning (RL), and the Koopman Operator (KO) theory for an efficient data-driven control and performance-oriented learning of complex nonlinear systems. We show that the closed-loop performance of the KMPC is improved by modifying the KMPC objective function. In practice, we design a fully parameterized KMPC and employ RL to adjust the corresponding parameters aiming at achieving the best achievable closed-loop performance.\",\"PeriodicalId\":37235,\"journal\":{\"name\":\"IEEE Control Systems Letters\",\"volume\":\"8 \",\"pages\":\"3021-3026\"},\"PeriodicalIF\":2.4000,\"publicationDate\":\"2024-12-19\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE Control Systems Letters\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/10808167/\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"AUTOMATION & CONTROL SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Control Systems Letters","FirstCategoryId":"1085","ListUrlMain":"https://ieeexplore.ieee.org/document/10808167/","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}
Performance-Oriented Data-Driven Control: Fusing Koopman Operator and MPC-Based Reinforcement Learning
This letter develops the machinery of Koopman-based Model Predictive Control (KMPC) design, where the Koopman derived model is unable to capture the real nonlinear system perfectly. We then propose to use an MPC-based reinforcement learning within the Koopman framework combining the strengths of MPC, Reinforcement Learning (RL), and the Koopman Operator (KO) theory for an efficient data-driven control and performance-oriented learning of complex nonlinear systems. We show that the closed-loop performance of the KMPC is improved by modifying the KMPC objective function. In practice, we design a fully parameterized KMPC and employ RL to adjust the corresponding parameters aiming at achieving the best achievable closed-loop performance.