Adaptive dynamic programming and distributionally robust optimal control of linear stochastic system using the Wasserstein metric

IF 3.9 4区 计算机科学 Q2 AUTOMATION & CONTROL SYSTEMS
Qingpeng Liang, Jiangping Hu, Linying Xiang, Kaibo Shi, Yanzhi Wu
{"title":"Adaptive dynamic programming and distributionally robust optimal control of linear stochastic system using the Wasserstein metric","authors":"Qingpeng Liang,&nbsp;Jiangping Hu,&nbsp;Linying Xiang,&nbsp;Kaibo Shi,&nbsp;Yanzhi Wu","doi":"10.1002/acs.3830","DOIUrl":null,"url":null,"abstract":"<div>\n \n <p>In this paper, we consider the optimal control of unknown stochastic dynamical system for both the finite-horizon and infinite-horizon cases. The objective of this paper is to find an optimal controller to minimize the expected value of a function which depends on the random disturbance. Throughout this paper, it is assumed that the mean vector and covariance matrix of the disturbance distribution is unknown. An uncertainty set in the space of mean vector and the covariance matrix is introduced. For the finite-horizon case, we derive a closed-form expression of the unique optimal policy and the opponents policy that generates the worst-case distribution. For the infinite-horizon case, we simplify the Riccati equation obtained in the finite-hozion setting to an algebraic Riccati equation, which can guarantee the existence of the solution of the Riccati equation. It is shown that the resulting optimal policies obtained in these two cases can stabilize the expected value of the system state under the worst-case distribution. Furthermore, the unknown system matrices can also be explicitly computed using the adaptive dynamic programming technique, which can help compute the optimal control policy by solving the algebraic Riccati equation. Finally, a simulation example is presented to demonstrate the effectiveness of our theoretical results.</p>\n </div>","PeriodicalId":50347,"journal":{"name":"International Journal of Adaptive Control and Signal Processing","volume":"38 8","pages":"2810-2832"},"PeriodicalIF":3.9000,"publicationDate":"2024-05-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Adaptive Control and Signal Processing","FirstCategoryId":"94","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/acs.3830","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}
引用次数: 0

Abstract

In this paper, we consider the optimal control of unknown stochastic dynamical system for both the finite-horizon and infinite-horizon cases. The objective of this paper is to find an optimal controller to minimize the expected value of a function which depends on the random disturbance. Throughout this paper, it is assumed that the mean vector and covariance matrix of the disturbance distribution is unknown. An uncertainty set in the space of mean vector and the covariance matrix is introduced. For the finite-horizon case, we derive a closed-form expression of the unique optimal policy and the opponents policy that generates the worst-case distribution. For the infinite-horizon case, we simplify the Riccati equation obtained in the finite-hozion setting to an algebraic Riccati equation, which can guarantee the existence of the solution of the Riccati equation. It is shown that the resulting optimal policies obtained in these two cases can stabilize the expected value of the system state under the worst-case distribution. Furthermore, the unknown system matrices can also be explicitly computed using the adaptive dynamic programming technique, which can help compute the optimal control policy by solving the algebraic Riccati equation. Finally, a simulation example is presented to demonstrate the effectiveness of our theoretical results.

利用瓦瑟斯坦度量对线性随机系统进行自适应动态编程和分布稳健优化控制
摘要 本文考虑了有限视距和无限视距情况下未知随机动力系统的最优控制问题。本文的目标是找到一个最优控制器,以最小化取决于随机扰动的函数的期望值。本文假定扰动分布的均值向量和协方差矩阵是未知的。本文引入了均值向量和协方差矩阵空间中的不确定性集。对于有限视距情况,我们推导出唯一最优策略的闭式表达式,以及产生最坏情况分布的对立策略。对于无限视距情形,我们将有限视距情形下得到的里卡提方程简化为代数里卡提方程,从而保证了里卡提方程解的存在性。结果表明,在这两种情况下得到的最优策略可以在最坏情况分布下稳定系统状态的期望值。此外,还可以利用自适应动态编程技术显式计算未知系统矩阵,这有助于通过求解代数 Riccati 方程来计算最优控制策略。最后,我们以一个仿真实例来证明我们理论结果的有效性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
5.30
自引率
16.10%
发文量
163
审稿时长
5 months
期刊介绍: The International Journal of Adaptive Control and Signal Processing is concerned with the design, synthesis and application of estimators or controllers where adaptive features are needed to cope with uncertainties.Papers on signal processing should also have some relevance to adaptive systems. The journal focus is on model based control design approaches rather than heuristic or rule based control design methods. All papers will be expected to include significant novel material. Both the theory and application of adaptive systems and system identification are areas of interest. Papers on applications can include problems in the implementation of algorithms for real time signal processing and control. The stability, convergence, robustness and numerical aspects of adaptive algorithms are also suitable topics. The related subjects of controller tuning, filtering, networks and switching theory are also of interest. Principal areas to be addressed include: Auto-Tuning, Self-Tuning and Model Reference Adaptive Controllers Nonlinear, Robust and Intelligent Adaptive Controllers Linear and Nonlinear Multivariable System Identification and Estimation Identification of Linear Parameter Varying, Distributed and Hybrid Systems Multiple Model Adaptive Control Adaptive Signal processing Theory and Algorithms Adaptation in Multi-Agent Systems Condition Monitoring Systems Fault Detection and Isolation Methods Fault Detection and Isolation Methods Fault-Tolerant Control (system supervision and diagnosis) Learning Systems and Adaptive Modelling Real Time Algorithms for Adaptive Signal Processing and Control Adaptive Signal Processing and Control Applications Adaptive Cloud Architectures and Networking Adaptive Mechanisms for Internet of Things Adaptive Sliding Mode Control.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信