非合作随机微分博弈的自适应稳定化

IF 2.2 2区 数学 Q2 AUTOMATION & CONTROL SYSTEMS
Nian Liu, Lei Guo
{"title":"非合作随机微分博弈的自适应稳定化","authors":"Nian Liu, Lei Guo","doi":"10.1137/22m1530549","DOIUrl":null,"url":null,"abstract":"SIAM Journal on Control and Optimization, Volume 62, Issue 3, Page 1317-1342, June 2024. <br/>Abstract. In this paper, we consider the adaptive stabilization problem for a basic class of linear-quadratic noncooperative stochastic differential games when the systems matrices are unknown to the regulator and the players. This is a typical problem of game-based control systems (GBCS) introduced and studied recently, which have a hierarchical decision-making structure: there is a controller at the upper level acting as a global regulator which makes its decision first, and the players at the lower level are assumed to play a typical zero-sum differential games. The main purpose of the paper is to study how the adaptive regulator can be designed to make the GBCS globally stable and at the same time to ensure a Nash equilibrium reached by the players, where the adaptive strategies of the players are assumed to be constructed based on the standard least squares estimators. The design of the global regulator is an integration of the weighted least squares parameter estimator, random regularization and diminishing excitation methods. Under the assumption that the system matrix pair [math] is controllable and there exists a stabilizing solution for the corresponding algebraic Riccati equation, it is shown that the closed-loop adaptive GBCS will be globally stable, and at the same time reach a Nash equilibrium by the players.","PeriodicalId":49531,"journal":{"name":"SIAM Journal on Control and Optimization","volume":null,"pages":null},"PeriodicalIF":2.2000,"publicationDate":"2024-05-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Adaptive Stabilization of Noncooperative Stochastic Differential Games\",\"authors\":\"Nian Liu, Lei Guo\",\"doi\":\"10.1137/22m1530549\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"SIAM Journal on Control and Optimization, Volume 62, Issue 3, Page 1317-1342, June 2024. <br/>Abstract. In this paper, we consider the adaptive stabilization problem for a basic class of linear-quadratic noncooperative stochastic differential games when the systems matrices are unknown to the regulator and the players. This is a typical problem of game-based control systems (GBCS) introduced and studied recently, which have a hierarchical decision-making structure: there is a controller at the upper level acting as a global regulator which makes its decision first, and the players at the lower level are assumed to play a typical zero-sum differential games. The main purpose of the paper is to study how the adaptive regulator can be designed to make the GBCS globally stable and at the same time to ensure a Nash equilibrium reached by the players, where the adaptive strategies of the players are assumed to be constructed based on the standard least squares estimators. The design of the global regulator is an integration of the weighted least squares parameter estimator, random regularization and diminishing excitation methods. Under the assumption that the system matrix pair [math] is controllable and there exists a stabilizing solution for the corresponding algebraic Riccati equation, it is shown that the closed-loop adaptive GBCS will be globally stable, and at the same time reach a Nash equilibrium by the players.\",\"PeriodicalId\":49531,\"journal\":{\"name\":\"SIAM Journal on Control and Optimization\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":2.2000,\"publicationDate\":\"2024-05-02\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"SIAM Journal on Control and Optimization\",\"FirstCategoryId\":\"100\",\"ListUrlMain\":\"https://doi.org/10.1137/22m1530549\",\"RegionNum\":2,\"RegionCategory\":\"数学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"AUTOMATION & CONTROL SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"SIAM Journal on Control and Optimization","FirstCategoryId":"100","ListUrlMain":"https://doi.org/10.1137/22m1530549","RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}
引用次数: 0

摘要

SIAM 控制与优化期刊》第 62 卷第 3 期第 1317-1342 页,2024 年 6 月。摘要本文考虑了一类基本的线性二次非合作随机微分博弈的自适应稳定问题,当系统矩阵对调节者和博弈者来说都是未知的。这是最近引入和研究的基于博弈的控制系统(GBCS)的一个典型问题,它具有分层决策结构:上层有一个控制器作为全局调节器,它首先做出决策,下层的博弈者被假定为玩典型的零和微分博弈。本文的主要目的是研究如何设计自适应调节器,使 GBCS 全局稳定,同时确保博弈者达到纳什均衡,其中博弈者的自适应策略假定是基于标准最小二乘估计器构建的。全局调节器的设计是加权最小二乘参数估计器、随机正则化和递减激励方法的集成。在系统矩阵对[math]可控且相应代数里卡提方程存在稳定解的假设下,研究表明闭环自适应 GBCS 将是全局稳定的,同时博弈者将达到纳什均衡。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Adaptive Stabilization of Noncooperative Stochastic Differential Games
SIAM Journal on Control and Optimization, Volume 62, Issue 3, Page 1317-1342, June 2024.
Abstract. In this paper, we consider the adaptive stabilization problem for a basic class of linear-quadratic noncooperative stochastic differential games when the systems matrices are unknown to the regulator and the players. This is a typical problem of game-based control systems (GBCS) introduced and studied recently, which have a hierarchical decision-making structure: there is a controller at the upper level acting as a global regulator which makes its decision first, and the players at the lower level are assumed to play a typical zero-sum differential games. The main purpose of the paper is to study how the adaptive regulator can be designed to make the GBCS globally stable and at the same time to ensure a Nash equilibrium reached by the players, where the adaptive strategies of the players are assumed to be constructed based on the standard least squares estimators. The design of the global regulator is an integration of the weighted least squares parameter estimator, random regularization and diminishing excitation methods. Under the assumption that the system matrix pair [math] is controllable and there exists a stabilizing solution for the corresponding algebraic Riccati equation, it is shown that the closed-loop adaptive GBCS will be globally stable, and at the same time reach a Nash equilibrium by the players.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
CiteScore
4.00
自引率
4.50%
发文量
143
审稿时长
12 months
期刊介绍: SIAM Journal on Control and Optimization (SICON) publishes original research articles on the mathematics and applications of control theory and certain parts of optimization theory. Papers considered for publication must be significant at both the mathematical level and the level of applications or potential applications. Papers containing mostly routine mathematics or those with no discernible connection to control and systems theory or optimization will not be considered for publication. From time to time, the journal will also publish authoritative surveys of important subject areas in control theory and optimization whose level of maturity permits a clear and unified exposition. The broad areas mentioned above are intended to encompass a wide range of mathematical techniques and scientific, engineering, economic, and industrial applications. These include stochastic and deterministic methods in control, estimation, and identification of systems; modeling and realization of complex control systems; the numerical analysis and related computational methodology of control processes and allied issues; and the development of mathematical theories and techniques that give new insights into old problems or provide the basis for further progress in control theory and optimization. Within the field of optimization, the journal focuses on the parts that are relevant to dynamic and control systems. Contributions to numerical methodology are also welcome in accordance with these aims, especially as related to large-scale problems and decomposition as well as to fundamental questions of convergence and approximation.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信