游戏中的安全追踪:在未知动态和约束条件下实现最优控制

IF 2.7 4区 计算机科学 Q2 AUTOMATION & CONTROL SYSTEMS
Xiaohong Cui, Wenjie Chen, Binrui Wang, Kun Zhou
{"title":"游戏中的安全追踪:在未知动态和约束条件下实现最优控制","authors":"Xiaohong Cui,&nbsp;Wenjie Chen,&nbsp;Binrui Wang,&nbsp;Kun Zhou","doi":"10.1002/asjc.3397","DOIUrl":null,"url":null,"abstract":"<p>This paper introduces mix-zero-sum differential (MZSD) game theory to address multi-player tracking systems, offering a better understanding of the coexistence of cooperation and competition among players. Within this framework, we present an optimal safety tracking control (OSTC) method, which incorporates a control barrier function (CBF) into the value function to ensure that the tracking error remains within a specified range, thus guaranteeing safety while achieving optimization. Simultaneously, to eliminate the need for system dynamics, we propose a novel approach leveraging off-policy integral reinforcement learning (IRL) technology to obtain the Nash equilibrium solution of the MZSD games. We establish a unique critics–actors neural network (NN) structure that updates concurrently. Furthermore, we analyze stability and convergence using the Lyapunov method. We conduct two simulations to demonstrate the effectiveness of the proposed algorithm.</p>","PeriodicalId":55453,"journal":{"name":"Asian Journal of Control","volume":"26 6","pages":"3190-3209"},"PeriodicalIF":2.7000,"publicationDate":"2024-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Safe tracking in games: Achieving optimal control with unknown dynamics and constraints\",\"authors\":\"Xiaohong Cui,&nbsp;Wenjie Chen,&nbsp;Binrui Wang,&nbsp;Kun Zhou\",\"doi\":\"10.1002/asjc.3397\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p>This paper introduces mix-zero-sum differential (MZSD) game theory to address multi-player tracking systems, offering a better understanding of the coexistence of cooperation and competition among players. Within this framework, we present an optimal safety tracking control (OSTC) method, which incorporates a control barrier function (CBF) into the value function to ensure that the tracking error remains within a specified range, thus guaranteeing safety while achieving optimization. Simultaneously, to eliminate the need for system dynamics, we propose a novel approach leveraging off-policy integral reinforcement learning (IRL) technology to obtain the Nash equilibrium solution of the MZSD games. We establish a unique critics–actors neural network (NN) structure that updates concurrently. Furthermore, we analyze stability and convergence using the Lyapunov method. We conduct two simulations to demonstrate the effectiveness of the proposed algorithm.</p>\",\"PeriodicalId\":55453,\"journal\":{\"name\":\"Asian Journal of Control\",\"volume\":\"26 6\",\"pages\":\"3190-3209\"},\"PeriodicalIF\":2.7000,\"publicationDate\":\"2024-05-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Asian Journal of Control\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://onlinelibrary.wiley.com/doi/10.1002/asjc.3397\",\"RegionNum\":4,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"AUTOMATION & CONTROL SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Asian Journal of Control","FirstCategoryId":"94","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/asjc.3397","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}
引用次数: 0

摘要

本文将混合零和差分博弈论(MZSD)引入多玩家跟踪系统,从而更好地理解玩家之间合作与竞争并存的关系。在此框架内,我们提出了一种最优安全跟踪控制(OSTC)方法,该方法在值函数中加入了控制障碍函数(CBF),以确保跟踪误差保持在指定范围内,从而在实现优化的同时保证安全性。同时,为了消除对系统动力学的需求,我们提出了一种利用非政策积分强化学习(IRL)技术的新方法,以获得 MZSD 博弈的纳什均衡解。我们建立了一种独特的批评者-行动者神经网络(NN)结构,可同时更新。此外,我们还利用 Lyapunov 方法分析了稳定性和收敛性。我们进行了两次模拟,以证明所提算法的有效性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Safe tracking in games: Achieving optimal control with unknown dynamics and constraints

This paper introduces mix-zero-sum differential (MZSD) game theory to address multi-player tracking systems, offering a better understanding of the coexistence of cooperation and competition among players. Within this framework, we present an optimal safety tracking control (OSTC) method, which incorporates a control barrier function (CBF) into the value function to ensure that the tracking error remains within a specified range, thus guaranteeing safety while achieving optimization. Simultaneously, to eliminate the need for system dynamics, we propose a novel approach leveraging off-policy integral reinforcement learning (IRL) technology to obtain the Nash equilibrium solution of the MZSD games. We establish a unique critics–actors neural network (NN) structure that updates concurrently. Furthermore, we analyze stability and convergence using the Lyapunov method. We conduct two simulations to demonstrate the effectiveness of the proposed algorithm.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Asian Journal of Control
Asian Journal of Control 工程技术-自动化与控制系统
CiteScore
4.80
自引率
25.00%
发文量
253
审稿时长
7.2 months
期刊介绍: The Asian Journal of Control, an Asian Control Association (ACA) and Chinese Automatic Control Society (CACS) affiliated journal, is the first international journal originating from the Asia Pacific region. The Asian Journal of Control publishes papers on original theoretical and practical research and developments in the areas of control, involving all facets of control theory and its application. Published six times a year, the Journal aims to be a key platform for control communities throughout the world. The Journal provides a forum where control researchers and practitioners can exchange knowledge and experiences on the latest advances in the control areas, and plays an educational role for students and experienced researchers in other disciplines interested in this continually growing field. The scope of the journal is extensive. Topics include: The theory and design of control systems and components, encompassing: Robust and distributed control using geometric, optimal, stochastic and nonlinear methods Game theory and state estimation Adaptive control, including neural networks, learning, parameter estimation and system fault detection Artificial intelligence, fuzzy and expert systems Hierarchical and man-machine systems All parts of systems engineering which consider the reliability of components and systems Emerging application areas, such as: Robotics Mechatronics Computers for computer-aided design, manufacturing, and control of various industrial processes Space vehicles and aircraft, ships, and traffic Biomedical systems National economies Power systems Agriculture Natural resources.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信