完全未知非线性时变系统的基于数据的约束最优控制

IF 4.2 3区计算机科学 Q2 AUTOMATION & CONTROL SYSTEMS

Journal of The Franklin Institute-engineering and Applied Mathematics Pub Date : 2025-08-09 DOI:10.1016/j.jfranklin.2025.107958

Peyman Ahmadi, Mehdi Rahmani, Aref Shahmansoorian

{"title":"完全未知非线性时变系统的基于数据的约束最优控制","authors":"Peyman Ahmadi, Mehdi Rahmani, Aref Shahmansoorian","doi":"10.1016/j.jfranklin.2025.107958","DOIUrl":null,"url":null,"abstract":"<div><div>Although the model-based control for nonlinear time-varying (NTV) systems is a challenging problem, this paperproposes a data-based approach to solve the constrained optimal control problem for continuous-time nonlinear polynomial time-varying systems with completely unknown dynamics. This approach does not rely on computationally expensive numerical solutions for model approximation methods. Instead, the optimal control policy is obtained by an adaptive dynamic programming (ADP)-based sum-of-squares (SOS) programming which is computationally tractable. The proposed model-free optimal control approach can apply constraints on the control input. By a novel idea, the input limitations are applied using the concept of inverse optimal control (IOC). The Lyapunov method theoretically ensures the stability of the proposed optimal control scheme. Additionally, it is an off-policy algorithm and avoids the repeat of experiments for control design. The efficacy of the suggested method is investigated through two numerical examples.</div></div>","PeriodicalId":17283,"journal":{"name":"Journal of The Franklin Institute-engineering and Applied Mathematics","volume":"362 14","pages":"Article 107958"},"PeriodicalIF":4.2000,"publicationDate":"2025-08-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Data-based constrained optimal control for completely unknown nonlinear time-varying systems\",\"authors\":\"Peyman Ahmadi, Mehdi Rahmani, Aref Shahmansoorian\",\"doi\":\"10.1016/j.jfranklin.2025.107958\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Although the model-based control for nonlinear time-varying (NTV) systems is a challenging problem, this paperproposes a data-based approach to solve the constrained optimal control problem for continuous-time nonlinear polynomial time-varying systems with completely unknown dynamics. This approach does not rely on computationally expensive numerical solutions for model approximation methods. Instead, the optimal control policy is obtained by an adaptive dynamic programming (ADP)-based sum-of-squares (SOS) programming which is computationally tractable. The proposed model-free optimal control approach can apply constraints on the control input. By a novel idea, the input limitations are applied using the concept of inverse optimal control (IOC). The Lyapunov method theoretically ensures the stability of the proposed optimal control scheme. Additionally, it is an off-policy algorithm and avoids the repeat of experiments for control design. The efficacy of the suggested method is investigated through two numerical examples.</div></div>\",\"PeriodicalId\":17283,\"journal\":{\"name\":\"Journal of The Franklin Institute-engineering and Applied Mathematics\",\"volume\":\"362 14\",\"pages\":\"Article 107958\"},\"PeriodicalIF\":4.2000,\"publicationDate\":\"2025-08-09\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of The Franklin Institute-engineering and Applied Mathematics\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S001600322500451X\",\"RegionNum\":3,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"AUTOMATION & CONTROL SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of The Franklin Institute-engineering and Applied Mathematics","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S001600322500451X","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}

引用次数: 0

摘要

尽管非线性时变系统的基于模型的控制是一个具有挑战性的问题，但本文提出了一种基于数据的方法来解决具有完全未知动力学的连续非线性多项式时变系统的约束最优控制问题。这种方法不依赖于模型近似方法的计算昂贵的数值解。采用基于自适应动态规划（ADP）的平方和规划方法获得最优控制策略，该方法在计算上易于处理。提出的无模型最优控制方法可以对控制输入施加约束。通过一种新颖的思想，利用逆最优控制（IOC）的概念，对输入限制进行了应用。李雅普诺夫方法从理论上保证了所提出的最优控制方案的稳定性。此外，它是一种非策略算法，避免了控制设计的重复实验。通过两个算例验证了该方法的有效性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Data-based constrained optimal control for completely unknown nonlinear time-varying systems

Although the model-based control for nonlinear time-varying (NTV) systems is a challenging problem, this paperproposes a data-based approach to solve the constrained optimal control problem for continuous-time nonlinear polynomial time-varying systems with completely unknown dynamics. This approach does not rely on computationally expensive numerical solutions for model approximation methods. Instead, the optimal control policy is obtained by an adaptive dynamic programming (ADP)-based sum-of-squares (SOS) programming which is computationally tractable. The proposed model-free optimal control approach can apply constraints on the control input. By a novel idea, the input limitations are applied using the concept of inverse optimal control (IOC). The Lyapunov method theoretically ensures the stability of the proposed optimal control scheme. Additionally, it is an off-policy algorithm and avoids the repeat of experiments for control design. The efficacy of the suggested method is investigated through two numerical examples.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Journal of The Franklin Institute-engineering and Applied Mathematics 工程技术-工程：电子与电气

CiteScore

7.30

自引率

14.60%

发文量

586

审稿时长

6.9 months

期刊介绍： The Journal of The Franklin Institute has an established reputation for publishing high-quality papers in the field of engineering and applied mathematics. Its current focus is on control systems, complex networks and dynamic systems, signal processing and communications and their applications. All submitted papers are peer-reviewed. The Journal will publish original research papers and research review papers of substance. Papers and special focus issues are judged upon possible lasting value, which has been and continues to be the strength of the Journal of The Franklin Institute.