USLC：通过物理性能策略优化神经网络实现通用自学习控制

IF 2.3 4区计算机科学 Q2 AUTOMATION & CONTROL SYSTEMS

IET Control Theory and Applications Pub Date : 2024-11-11 DOI:10.1049/cth2.12758

Yanhui Zhang, Xiaoling Liang, Weifang Chen, Kunfeng Lu, Chao Xu, Shuzhi Sam Ge

{"title":"USLC：通过物理性能策略优化神经网络实现通用自学习控制","authors":"Yanhui Zhang, Xiaoling Liang, Weifang Chen, Kunfeng Lu, Chao Xu, Shuzhi Sam Ge","doi":"10.1049/cth2.12758","DOIUrl":null,"url":null,"abstract":"<p>This article proposes an online universal self-learning control (USLC) algorithm based on a physical performance policy-optimization neural network, which aims to solve the problem of universal self-learning optimal control laws for nonlinear systems with various uncertain dynamics. As a key system characterization, this algorithm predicts the discrepancy between the optimal and current control laws by evaluating overall performance in each iterative learning cycle, leveraging an offline-trained universal policy network. This approach is universal, as it does not rely on an exact system model and can adaptively control performance preferences across various tasks by customizing the physical performance cost weights. Using the established control law-performance surface and contraction Lyapunov function, the necessary assumptions and proofs for the stable convergence of the system within a three-dimensional manifold space are provided. To demonstrate the universality of USLC, simulation experiments are conducted on two different systems: a low-order circuit system and a high-order variable-span aircraft attitude control system. The stable control achieved under varying initial values and boundary conditions in each system illustrates the effectiveness of the proposed method. Finally, the limitations of this study are discussed.</p>","PeriodicalId":50382,"journal":{"name":"IET Control Theory and Applications","volume":"18 18","pages":"2944-2960"},"PeriodicalIF":2.3000,"publicationDate":"2024-11-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1049/cth2.12758","citationCount":"0","resultStr":"{\"title\":\"USLC: Universal self-learning control via physical performance policy-optimization neural network\",\"authors\":\"Yanhui Zhang, Xiaoling Liang, Weifang Chen, Kunfeng Lu, Chao Xu, Shuzhi Sam Ge\",\"doi\":\"10.1049/cth2.12758\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p>This article proposes an online universal self-learning control (USLC) algorithm based on a physical performance policy-optimization neural network, which aims to solve the problem of universal self-learning optimal control laws for nonlinear systems with various uncertain dynamics. As a key system characterization, this algorithm predicts the discrepancy between the optimal and current control laws by evaluating overall performance in each iterative learning cycle, leveraging an offline-trained universal policy network. This approach is universal, as it does not rely on an exact system model and can adaptively control performance preferences across various tasks by customizing the physical performance cost weights. Using the established control law-performance surface and contraction Lyapunov function, the necessary assumptions and proofs for the stable convergence of the system within a three-dimensional manifold space are provided. To demonstrate the universality of USLC, simulation experiments are conducted on two different systems: a low-order circuit system and a high-order variable-span aircraft attitude control system. The stable control achieved under varying initial values and boundary conditions in each system illustrates the effectiveness of the proposed method. Finally, the limitations of this study are discussed.</p>\",\"PeriodicalId\":50382,\"journal\":{\"name\":\"IET Control Theory and Applications\",\"volume\":\"18 18\",\"pages\":\"2944-2960\"},\"PeriodicalIF\":2.3000,\"publicationDate\":\"2024-11-11\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://onlinelibrary.wiley.com/doi/epdf/10.1049/cth2.12758\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IET Control Theory and Applications\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://onlinelibrary.wiley.com/doi/10.1049/cth2.12758\",\"RegionNum\":4,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"AUTOMATION & CONTROL SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IET Control Theory and Applications","FirstCategoryId":"94","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1049/cth2.12758","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}

引用次数: 0

摘要

本文提出了一种基于物理性能策略优化神经网络的在线通用自学习控制（USLC）算法，旨在解决具有各种不确定动态的非线性系统的通用自学习最优控制律问题。作为一个关键的系统特征，该算法通过评估每个迭代学习周期中的整体性能来预测最优控制律和当前控制律之间的差异，利用离线训练的通用策略网络。这种方法是通用的，因为它不依赖于精确的系统模型，并且可以通过自定义物理性能成本权重来自适应地控制各种任务的性能首选项。利用所建立的控制律性能曲面和收缩Lyapunov函数，给出了系统在三维流形空间稳定收敛的必要假设和证明。为了证明USLC的通用性，对低阶电路系统和高阶变跨度飞行器姿态控制系统进行了仿真实验。各系统在不同初始值和边界条件下的稳定控制表明了所提方法的有效性。最后，讨论了本研究的局限性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

USLC: Universal self-learning control via physical performance policy-optimization neural network

查看原文本刊更多论文

USLC: Universal self-learning control via physical performance policy-optimization neural network

This article proposes an online universal self-learning control (USLC) algorithm based on a physical performance policy-optimization neural network, which aims to solve the problem of universal self-learning optimal control laws for nonlinear systems with various uncertain dynamics. As a key system characterization, this algorithm predicts the discrepancy between the optimal and current control laws by evaluating overall performance in each iterative learning cycle, leveraging an offline-trained universal policy network. This approach is universal, as it does not rely on an exact system model and can adaptively control performance preferences across various tasks by customizing the physical performance cost weights. Using the established control law-performance surface and contraction Lyapunov function, the necessary assumptions and proofs for the stable convergence of the system within a three-dimensional manifold space are provided. To demonstrate the universality of USLC, simulation experiments are conducted on two different systems: a low-order circuit system and a high-order variable-span aircraft attitude control system. The stable control achieved under varying initial values and boundary conditions in each system illustrates the effectiveness of the proposed method. Finally, the limitations of this study are discussed.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

IET Control Theory and Applications 工程技术-工程：电子与电气

CiteScore

5.70

自引率

7.70%

发文量

167

审稿时长

5.1 months

期刊介绍： IET Control Theory & Applications is devoted to control systems in the broadest sense, covering new theoretical results and the applications of new and established control methods. Among the topics of interest are system modelling, identification and simulation, the analysis and design of control systems (including computer-aided design), and practical implementation. The scope encompasses technological, economic, physiological (biomedical) and other systems, including man-machine interfaces. Most of the papers published deal with original work from industrial and government laboratories and universities, but subject reviews and tutorial expositions of current methods are welcomed. Correspondence discussing published papers is also welcomed. Applications papers need not necessarily involve new theory. Papers which describe new realisations of established methods, or control techniques applied in a novel situation, or practical studies which compare various designs, would be of interest. Of particular value are theoretical papers which discuss the applicability of new work or applications which engender new theoretical applications.