{"title":"完全未知非线性时变系统的基于数据的约束最优控制","authors":"Peyman Ahmadi, Mehdi Rahmani, Aref Shahmansoorian","doi":"10.1016/j.jfranklin.2025.107958","DOIUrl":null,"url":null,"abstract":"<div><div>Although the model-based control for nonlinear time-varying (NTV) systems is a challenging problem, this paperproposes a data-based approach to solve the constrained optimal control problem for continuous-time nonlinear polynomial time-varying systems with completely unknown dynamics. This approach does not rely on computationally expensive numerical solutions for model approximation methods. Instead, the optimal control policy is obtained by an adaptive dynamic programming (ADP)-based sum-of-squares (SOS) programming which is computationally tractable. The proposed model-free optimal control approach can apply constraints on the control input. By a novel idea, the input limitations are applied using the concept of inverse optimal control (IOC). The Lyapunov method theoretically ensures the stability of the proposed optimal control scheme. Additionally, it is an off-policy algorithm and avoids the repeat of experiments for control design. The efficacy of the suggested method is investigated through two numerical examples.</div></div>","PeriodicalId":17283,"journal":{"name":"Journal of The Franklin Institute-engineering and Applied Mathematics","volume":"362 14","pages":"Article 107958"},"PeriodicalIF":4.2000,"publicationDate":"2025-08-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Data-based constrained optimal control for completely unknown nonlinear time-varying systems\",\"authors\":\"Peyman Ahmadi, Mehdi Rahmani, Aref Shahmansoorian\",\"doi\":\"10.1016/j.jfranklin.2025.107958\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Although the model-based control for nonlinear time-varying (NTV) systems is a challenging problem, this paperproposes a data-based approach to solve the constrained optimal control problem for continuous-time nonlinear polynomial time-varying systems with completely unknown dynamics. This approach does not rely on computationally expensive numerical solutions for model approximation methods. Instead, the optimal control policy is obtained by an adaptive dynamic programming (ADP)-based sum-of-squares (SOS) programming which is computationally tractable. The proposed model-free optimal control approach can apply constraints on the control input. By a novel idea, the input limitations are applied using the concept of inverse optimal control (IOC). The Lyapunov method theoretically ensures the stability of the proposed optimal control scheme. Additionally, it is an off-policy algorithm and avoids the repeat of experiments for control design. The efficacy of the suggested method is investigated through two numerical examples.</div></div>\",\"PeriodicalId\":17283,\"journal\":{\"name\":\"Journal of The Franklin Institute-engineering and Applied Mathematics\",\"volume\":\"362 14\",\"pages\":\"Article 107958\"},\"PeriodicalIF\":4.2000,\"publicationDate\":\"2025-08-09\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of The Franklin Institute-engineering and Applied Mathematics\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S001600322500451X\",\"RegionNum\":3,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"AUTOMATION & CONTROL SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of The Franklin Institute-engineering and Applied Mathematics","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S001600322500451X","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}
Data-based constrained optimal control for completely unknown nonlinear time-varying systems
Although the model-based control for nonlinear time-varying (NTV) systems is a challenging problem, this paperproposes a data-based approach to solve the constrained optimal control problem for continuous-time nonlinear polynomial time-varying systems with completely unknown dynamics. This approach does not rely on computationally expensive numerical solutions for model approximation methods. Instead, the optimal control policy is obtained by an adaptive dynamic programming (ADP)-based sum-of-squares (SOS) programming which is computationally tractable. The proposed model-free optimal control approach can apply constraints on the control input. By a novel idea, the input limitations are applied using the concept of inverse optimal control (IOC). The Lyapunov method theoretically ensures the stability of the proposed optimal control scheme. Additionally, it is an off-policy algorithm and avoids the repeat of experiments for control design. The efficacy of the suggested method is investigated through two numerical examples.
期刊介绍:
The Journal of The Franklin Institute has an established reputation for publishing high-quality papers in the field of engineering and applied mathematics. Its current focus is on control systems, complex networks and dynamic systems, signal processing and communications and their applications. All submitted papers are peer-reviewed. The Journal will publish original research papers and research review papers of substance. Papers and special focus issues are judged upon possible lasting value, which has been and continues to be the strength of the Journal of The Franklin Institute.