Yanhui Zhang, Xiaoling Liang, Weifang Chen, Kunfeng Lu, Chao Xu, Shuzhi Sam Ge
{"title":"USLC:通过物理性能策略优化神经网络实现通用自学习控制","authors":"Yanhui Zhang, Xiaoling Liang, Weifang Chen, Kunfeng Lu, Chao Xu, Shuzhi Sam Ge","doi":"10.1049/cth2.12758","DOIUrl":null,"url":null,"abstract":"<p>This article proposes an online universal self-learning control (USLC) algorithm based on a physical performance policy-optimization neural network, which aims to solve the problem of universal self-learning optimal control laws for nonlinear systems with various uncertain dynamics. As a key system characterization, this algorithm predicts the discrepancy between the optimal and current control laws by evaluating overall performance in each iterative learning cycle, leveraging an offline-trained universal policy network. This approach is universal, as it does not rely on an exact system model and can adaptively control performance preferences across various tasks by customizing the physical performance cost weights. Using the established control law-performance surface and contraction Lyapunov function, the necessary assumptions and proofs for the stable convergence of the system within a three-dimensional manifold space are provided. To demonstrate the universality of USLC, simulation experiments are conducted on two different systems: a low-order circuit system and a high-order variable-span aircraft attitude control system. The stable control achieved under varying initial values and boundary conditions in each system illustrates the effectiveness of the proposed method. Finally, the limitations of this study are discussed.</p>","PeriodicalId":50382,"journal":{"name":"IET Control Theory and Applications","volume":"18 18","pages":"2944-2960"},"PeriodicalIF":2.2000,"publicationDate":"2024-11-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1049/cth2.12758","citationCount":"0","resultStr":"{\"title\":\"USLC: Universal self-learning control via physical performance policy-optimization neural network\",\"authors\":\"Yanhui Zhang, Xiaoling Liang, Weifang Chen, Kunfeng Lu, Chao Xu, Shuzhi Sam Ge\",\"doi\":\"10.1049/cth2.12758\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p>This article proposes an online universal self-learning control (USLC) algorithm based on a physical performance policy-optimization neural network, which aims to solve the problem of universal self-learning optimal control laws for nonlinear systems with various uncertain dynamics. As a key system characterization, this algorithm predicts the discrepancy between the optimal and current control laws by evaluating overall performance in each iterative learning cycle, leveraging an offline-trained universal policy network. This approach is universal, as it does not rely on an exact system model and can adaptively control performance preferences across various tasks by customizing the physical performance cost weights. Using the established control law-performance surface and contraction Lyapunov function, the necessary assumptions and proofs for the stable convergence of the system within a three-dimensional manifold space are provided. To demonstrate the universality of USLC, simulation experiments are conducted on two different systems: a low-order circuit system and a high-order variable-span aircraft attitude control system. The stable control achieved under varying initial values and boundary conditions in each system illustrates the effectiveness of the proposed method. Finally, the limitations of this study are discussed.</p>\",\"PeriodicalId\":50382,\"journal\":{\"name\":\"IET Control Theory and Applications\",\"volume\":\"18 18\",\"pages\":\"2944-2960\"},\"PeriodicalIF\":2.2000,\"publicationDate\":\"2024-11-11\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://onlinelibrary.wiley.com/doi/epdf/10.1049/cth2.12758\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IET Control Theory and Applications\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://onlinelibrary.wiley.com/doi/10.1049/cth2.12758\",\"RegionNum\":4,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"AUTOMATION & CONTROL SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IET Control Theory and Applications","FirstCategoryId":"94","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1049/cth2.12758","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}
USLC: Universal self-learning control via physical performance policy-optimization neural network
This article proposes an online universal self-learning control (USLC) algorithm based on a physical performance policy-optimization neural network, which aims to solve the problem of universal self-learning optimal control laws for nonlinear systems with various uncertain dynamics. As a key system characterization, this algorithm predicts the discrepancy between the optimal and current control laws by evaluating overall performance in each iterative learning cycle, leveraging an offline-trained universal policy network. This approach is universal, as it does not rely on an exact system model and can adaptively control performance preferences across various tasks by customizing the physical performance cost weights. Using the established control law-performance surface and contraction Lyapunov function, the necessary assumptions and proofs for the stable convergence of the system within a three-dimensional manifold space are provided. To demonstrate the universality of USLC, simulation experiments are conducted on two different systems: a low-order circuit system and a high-order variable-span aircraft attitude control system. The stable control achieved under varying initial values and boundary conditions in each system illustrates the effectiveness of the proposed method. Finally, the limitations of this study are discussed.
期刊介绍:
IET Control Theory & Applications is devoted to control systems in the broadest sense, covering new theoretical results and the applications of new and established control methods. Among the topics of interest are system modelling, identification and simulation, the analysis and design of control systems (including computer-aided design), and practical implementation. The scope encompasses technological, economic, physiological (biomedical) and other systems, including man-machine interfaces.
Most of the papers published deal with original work from industrial and government laboratories and universities, but subject reviews and tutorial expositions of current methods are welcomed. Correspondence discussing published papers is also welcomed.
Applications papers need not necessarily involve new theory. Papers which describe new realisations of established methods, or control techniques applied in a novel situation, or practical studies which compare various designs, would be of interest. Of particular value are theoretical papers which discuss the applicability of new work or applications which engender new theoretical applications.