基于数据驱动的低计算成本优化输出反馈控制

IF 3.9 4区 计算机科学 Q2 AUTOMATION & CONTROL SYSTEMS
Xinyi Yu, Jiaqi Yu, Yongqi Zhang, Jiaxin Wu, Yan Wei, Linlin Ou
{"title":"基于数据驱动的低计算成本优化输出反馈控制","authors":"Xinyi Yu,&nbsp;Jiaqi Yu,&nbsp;Yongqi Zhang,&nbsp;Jiaxin Wu,&nbsp;Yan Wei,&nbsp;Linlin Ou","doi":"10.1002/acs.3832","DOIUrl":null,"url":null,"abstract":"<div>\n \n <p>A partial model-free, data-driven adaptive optimal output feedback (OPFB) control scheme with low computational cost continuous-time is proposed in this paper. The design objective is to obtain the optimal control law by using measurable input and output data, without some knowledge of system model information. Firstly, the system states are decoupled into measurable and unmeasurable parts, and a new state-space equation is built to estimate the unmeasurable states by using a reduced-order observer. Based on this, a parametrization method is utilized to reconstruct the system states. Subsequently, by using the reconstructed states, the adaptive dynamic programming (ADP) Bellman equations based on policy-iteration (PI) and value-iteration (VI) are presented to solve the control problems with initially stable and unstable conditions, respectively. Then, the convergence of the system is proved. Compared with the early proposed OPFB algorithms, only the unknown internal state needs to be reconstructed. Therefore, the computation cost and design complexity are reduced for the proposed scheme. The effectiveness of the proposed scheme is verified through two numerical simulations. In addition, a practical inverted pendulum experiment is carried out to demonstrate the performance of the proposed scheme.</p>\n </div>","PeriodicalId":50347,"journal":{"name":"International Journal of Adaptive Control and Signal Processing","volume":"38 8","pages":"2790-2809"},"PeriodicalIF":3.9000,"publicationDate":"2024-05-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Data-driven based optimal output feedback control with low computation cost\",\"authors\":\"Xinyi Yu,&nbsp;Jiaqi Yu,&nbsp;Yongqi Zhang,&nbsp;Jiaxin Wu,&nbsp;Yan Wei,&nbsp;Linlin Ou\",\"doi\":\"10.1002/acs.3832\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div>\\n \\n <p>A partial model-free, data-driven adaptive optimal output feedback (OPFB) control scheme with low computational cost continuous-time is proposed in this paper. The design objective is to obtain the optimal control law by using measurable input and output data, without some knowledge of system model information. Firstly, the system states are decoupled into measurable and unmeasurable parts, and a new state-space equation is built to estimate the unmeasurable states by using a reduced-order observer. Based on this, a parametrization method is utilized to reconstruct the system states. Subsequently, by using the reconstructed states, the adaptive dynamic programming (ADP) Bellman equations based on policy-iteration (PI) and value-iteration (VI) are presented to solve the control problems with initially stable and unstable conditions, respectively. Then, the convergence of the system is proved. Compared with the early proposed OPFB algorithms, only the unknown internal state needs to be reconstructed. Therefore, the computation cost and design complexity are reduced for the proposed scheme. The effectiveness of the proposed scheme is verified through two numerical simulations. In addition, a practical inverted pendulum experiment is carried out to demonstrate the performance of the proposed scheme.</p>\\n </div>\",\"PeriodicalId\":50347,\"journal\":{\"name\":\"International Journal of Adaptive Control and Signal Processing\",\"volume\":\"38 8\",\"pages\":\"2790-2809\"},\"PeriodicalIF\":3.9000,\"publicationDate\":\"2024-05-13\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"International Journal of Adaptive Control and Signal Processing\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://onlinelibrary.wiley.com/doi/10.1002/acs.3832\",\"RegionNum\":4,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"AUTOMATION & CONTROL SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Adaptive Control and Signal Processing","FirstCategoryId":"94","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/acs.3832","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}
引用次数: 0

摘要

摘要 本文提出了一种部分无模型、数据驱动、低计算成本的连续时间自适应最优输出反馈(OPFB)控制方案。设计目标是在不了解系统模型信息的情况下,利用可测量的输入和输出数据获得最优控制法则。首先,将系统状态解耦为可测量和不可测量两部分,并建立一个新的状态空间方程,利用降阶观测器来估计不可测量的状态。在此基础上,利用参数化方法重建系统状态。随后,利用重建的状态,提出基于策略迭代(PI)和值迭代(VI)的自适应动态编程(ADP)贝尔曼方程,分别解决初始条件稳定和不稳定的控制问题。然后,证明了系统的收敛性。与早期提出的 OPFB 算法相比,该算法只需重建未知的内部状态。因此,提出的方案降低了计算成本和设计复杂度。通过两次数值模拟验证了所提方案的有效性。此外,还进行了实际的倒立摆实验,以证明所提方案的性能。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Data-driven based optimal output feedback control with low computation cost

A partial model-free, data-driven adaptive optimal output feedback (OPFB) control scheme with low computational cost continuous-time is proposed in this paper. The design objective is to obtain the optimal control law by using measurable input and output data, without some knowledge of system model information. Firstly, the system states are decoupled into measurable and unmeasurable parts, and a new state-space equation is built to estimate the unmeasurable states by using a reduced-order observer. Based on this, a parametrization method is utilized to reconstruct the system states. Subsequently, by using the reconstructed states, the adaptive dynamic programming (ADP) Bellman equations based on policy-iteration (PI) and value-iteration (VI) are presented to solve the control problems with initially stable and unstable conditions, respectively. Then, the convergence of the system is proved. Compared with the early proposed OPFB algorithms, only the unknown internal state needs to be reconstructed. Therefore, the computation cost and design complexity are reduced for the proposed scheme. The effectiveness of the proposed scheme is verified through two numerical simulations. In addition, a practical inverted pendulum experiment is carried out to demonstrate the performance of the proposed scheme.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
CiteScore
5.30
自引率
16.10%
发文量
163
审稿时长
5 months
期刊介绍: The International Journal of Adaptive Control and Signal Processing is concerned with the design, synthesis and application of estimators or controllers where adaptive features are needed to cope with uncertainties.Papers on signal processing should also have some relevance to adaptive systems. The journal focus is on model based control design approaches rather than heuristic or rule based control design methods. All papers will be expected to include significant novel material. Both the theory and application of adaptive systems and system identification are areas of interest. Papers on applications can include problems in the implementation of algorithms for real time signal processing and control. The stability, convergence, robustness and numerical aspects of adaptive algorithms are also suitable topics. The related subjects of controller tuning, filtering, networks and switching theory are also of interest. Principal areas to be addressed include: Auto-Tuning, Self-Tuning and Model Reference Adaptive Controllers Nonlinear, Robust and Intelligent Adaptive Controllers Linear and Nonlinear Multivariable System Identification and Estimation Identification of Linear Parameter Varying, Distributed and Hybrid Systems Multiple Model Adaptive Control Adaptive Signal processing Theory and Algorithms Adaptation in Multi-Agent Systems Condition Monitoring Systems Fault Detection and Isolation Methods Fault Detection and Isolation Methods Fault-Tolerant Control (system supervision and diagnosis) Learning Systems and Adaptive Modelling Real Time Algorithms for Adaptive Signal Processing and Control Adaptive Signal Processing and Control Applications Adaptive Cloud Architectures and Networking Adaptive Mechanisms for Internet of Things Adaptive Sliding Mode Control.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信